hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <cutt...@apache.org>
Subject Re: [jira] Commented: (HADOOP-1214) the first step for streaming clean up
Date Fri, 13 Apr 2007 16:29:34 GMT
Arkady Borkovsky wrote:
>> On Apr 12, 2007, at 4:18 PM, Doug Cutting wrote:
>> The new classes in question are not a part of streaming, but are being 
>> added to the mapred package.
> Is not Hadoop Streaming part of Hadoop MapReduce product?

Streaming is currently in contrib not in the core.

I'm just suggesting we use consistent, accurate and descriptive 
terminology within the core.  These classes to not read nor generate 
lines.  They do facilitate interoperability with other line-based tools 
like TextInputFormat and TextOutputFormat.

> And are not the classes in question supposed to be referred to by "naive 
> users" on the Hadoop Streaming command line?

I don't think we should name core classes to make the streaming command 
line more intuitive.  If all else were equal, sure, that's a good thing, 
but, core classes should be named as consistently, accurately and 
descriptively as possible.  If streaming's command line is confusing, 
then that should be fixed in streaming, no?


View raw message