Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 36003 invoked from network); 13 Apr 2007 16:29:59 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 13 Apr 2007 16:29:59 -0000 Received: (qmail 16827 invoked by uid 500); 13 Apr 2007 16:30:04 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 16802 invoked by uid 500); 13 Apr 2007 16:30:04 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 16793 invoked by uid 99); 13 Apr 2007 16:30:04 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Apr 2007 09:30:04 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (herse.apache.org: local policy) Received: from [206.18.177.52] (HELO alnrmhc12.comcast.net) (206.18.177.52) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Apr 2007 09:29:57 -0700 Received: from [192.168.168.15] (c-71-202-24-246.hsd1.ca.comcast.net[71.202.24.246]) by comcast.net (alnrmhc12) with ESMTP id <20070413162936b1200b1hs2e>; Fri, 13 Apr 2007 16:29:36 +0000 Message-ID: <461FAFEE.3040207@apache.org> Date: Fri, 13 Apr 2007 09:29:34 -0700 From: Doug Cutting User-Agent: Thunderbird 1.5.0.10 (X11/20070306) MIME-Version: 1.0 To: hadoop-dev@lucene.apache.org Subject: Re: [jira] Commented: (HADOOP-1214) the first step for streaming clean up References: <985608.1176412455491.JavaMail.jira@brutus> <2f6eb616632ea9aa7935b6c1dc02afdf@yahoo-inc.com> <461EBE2E.6080208@apache.org> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Arkady Borkovsky wrote: >> On Apr 12, 2007, at 4:18 PM, Doug Cutting wrote: >> The new classes in question are not a part of streaming, but are being >> added to the mapred package. > Is not Hadoop Streaming part of Hadoop MapReduce product? Streaming is currently in contrib not in the core. I'm just suggesting we use consistent, accurate and descriptive terminology within the core. These classes to not read nor generate lines. They do facilitate interoperability with other line-based tools like TextInputFormat and TextOutputFormat. > And are not the classes in question supposed to be referred to by "naive > users" on the Hadoop Streaming command line? I don't think we should name core classes to make the streaming command line more intuitive. If all else were equal, sure, that's a good thing, but, core classes should be named as consistently, accurately and descriptively as possible. If streaming's command line is confusing, then that should be fixed in streaming, no? Doug