Return-Path: Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: (qmail 43633 invoked from network); 14 Apr 2011 07:53:32 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 14 Apr 2011 07:53:32 -0000 Received: (qmail 26798 invoked by uid 500); 14 Apr 2011 07:53:30 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 26728 invoked by uid 500); 14 Apr 2011 07:53:29 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 26700 invoked by uid 99); 14 Apr 2011 07:53:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 14 Apr 2011 07:53:26 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of harsh@cloudera.com designates 209.85.216.176 as permitted sender) Received: from [209.85.216.176] (HELO mail-qy0-f176.google.com) (209.85.216.176) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 14 Apr 2011 07:53:18 +0000 Received: by qyk30 with SMTP id 30so1099752qyk.14 for ; Thu, 14 Apr 2011 00:52:58 -0700 (PDT) Received: by 10.224.89.6 with SMTP id c6mr293558qam.188.1302767578145; Thu, 14 Apr 2011 00:52:58 -0700 (PDT) MIME-Version: 1.0 Received: by 10.224.89.84 with HTTP; Thu, 14 Apr 2011 00:52:38 -0700 (PDT) In-Reply-To: References: From: Harsh J Date: Thu, 14 Apr 2011 13:22:38 +0530 Message-ID: Subject: Re: Using MultipleTextOutputFormat for map-only jobs To: mapreduce-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Hello Hari, On Thu, Apr 14, 2011 at 11:09 AM, Hari Sreekumar wrote: > Hi, > I have a map-only mapreduce job where I want to deduce the output filenam= e > from the output key/value. I figured=A0MultipleTextOutputFormat=A0is the = best > fit for my purpose. But I am unable to use it in map-only jobs. I was abl= e > to run it if I add a reduce phase. But when I use map-only jobs, the file > gets written to the usual part-0000xx files. Also, is there no support fo= r > this output format in v0.20.2? I mean, is it necessary to use the depreca= ted > classes if I want to use this? > Thanks, > Hari The class MultipleOutputFormat is not available in the Hadoop for the new, unstable API, as it has been replaced in functionality by the MultipleOutputs class that does the same very similarly. However, the new API MultipleOutputs is not part of the Apache's Hadoop 0.20.2 release either [1]. Using the stable API is still recommended (it is no longer marked deprecated in 0.20.3 and 0.21 also supports the old API) That said, it should still work for Map-only jobs as described in two of its usecases [2]. Could you give us some details of your code setup for using this? [1] - It is available as part of 0.21.0, though, or in Cloudera's Distribution including Apache Hadoop 0.20.2. [2] - http://hadoop.apache.org/common/docs/r0.20.2/api/org/apache/hadoop/ma= pred/lib/MultipleOutputFormat.html --=20 Harsh J