Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: mapreduce-user@hadoop.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
From: Robert Evans <evans@yahoo-inc.com>
To: "mapreduce-user@hadoop.apache.org" <mapreduce-user@hadoop.apache.org>
Date: Fri, 15 Apr 2011 13:20:14 -0700
Subject: Re: successive mappers
Thread-Topic: successive mappers
Thread-Index: Acv7oEun4GLY/zQ2QSWkXO9cT16OcwACjt1u
Message-ID: <C9CE12AE.21E19%evans@yahoo-inc.com>
In-Reply-To: <898077.11584.qm@web19206.mail.hk2.yahoo.com>
Accept-Language: en-US
Content-Language: en
acceptlanguage: en-US
Content-Type: multipart/alternative;
	boundary="_000_C9CE12AE21E19evansyahooinccom_"
MIME-Version: 1.0

--_000_C9CE12AE21E19evansyahooinccom_
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

I,

Take a look at the Multiple output format classes

http://hadoop.apache.org/common/docs/r0.20.0/api/org/apache/hadoop/mapred/l=
ib/MultipleTextOutputFormat.html

Is a good example.  You should be able to create a custom output format cla=
ss that matches your needs.  Although, if all you are doing is map processi=
ng then why are you outputting intermediate results instead of processing t=
hem all in a single mapper?  It should be a lot faster if you don't need th=
e intermediate results.

--Bobby Evans

On 4/15/11 2:05 PM, "Injun Joe" <ll_oz_ll@yahoo.com.hk> wrote:

Hi,
I am coding a map-reduce program which involves several map-reduce steps. T=
he work that my program does is only in the mapper, so I was thinking to ha=
ve no reduce steps but successive mappers. The logic can be written like th=
is for mappers at iteration 0 and 1:

1. Take input.
2. Map 0:
   Determine if a key-value pair satisfies condition C.
    - If it satisfies condition then output the key-value pair to a file in=
 directory E.
    - If it does not then transform key-value pair and output the key-value=
 pair to directory D.
3. Map 1:
   - Change input directory to directory D
   - Perform same steps as map 0.

So, the problem is that I have not been able to find a way to output key-va=
lue pairs to different directories. All I have been able to specify is the =
map output directory by TextOutputFormat.setOutputPath.

Any help would be appreciated.

Thanks a lot
I


--_000_C9CE12AE21E19evansyahooinccom_
Content-Type: text/html; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<HTML>
<HEAD>
<TITLE>Re: successive mappers</TITLE>
</HEAD>
<BODY>
<FONT FACE=3D"Calibri, Verdana, Helvetica, Arial"><SPAN STYLE=3D'font-size:=
11pt'>I,<BR>
<BR>
Take a look at the Multiple output format classes<BR>
<BR>
<a href=3D"http://hadoop.apache.org/common/docs/r0.20.0/api/org/apache/hado=
op/mapred/lib/MultipleTextOutputFormat.html">http://hadoop.apache.org/commo=
n/docs/r0.20.0/api/org/apache/hadoop/mapred/lib/MultipleTextOutputFormat.ht=
ml</a><BR>
<BR>
Is a good example. &nbsp;You should be able to create a custom output forma=
t class that matches your needs. &nbsp;Although, if all you are doing is ma=
p processing then why are you outputting intermediate results instead of pr=
ocessing them all in a single mapper? &nbsp;It should be a lot faster if yo=
u don&#8217;t need the intermediate results.<BR>
<BR>
--Bobby Evans<BR>
<BR>
On 4/15/11 2:05 PM, &quot;Injun Joe&quot; &lt;<a href=3D"ll_oz_ll@yahoo.com=
.hk">ll_oz_ll@yahoo.com.hk</a>&gt; wrote:<BR>
<BR>
</SPAN></FONT><BLOCKQUOTE><FONT FACE=3D"Calibri, Verdana, Helvetica, Arial"=
><SPAN STYLE=3D'font-size:11pt'>Hi,<BR>
I am coding a map-reduce program which involves several map-reduce steps. T=
he work that my program does is only in the mapper, so I was thinking to ha=
ve no reduce steps but successive mappers. The logic can be written like th=
is for mappers at iteration 0 and 1:<BR>
<BR>
1. Take input.<BR>
2. Map 0:<BR>
&nbsp;&nbsp;&nbsp;Determine if a key-value pair satisfies condition C.<BR>
&nbsp;&nbsp;&nbsp;&nbsp;- If it satisfies condition then output the key-val=
ue pair to a file in directory E.<BR>
&nbsp;&nbsp;&nbsp;&nbsp;- If it does not then transform key-value pair and =
output the key-value pair to directory D.<BR>
3. Map 1:<BR>
&nbsp;&nbsp;&nbsp;- Change input directory to directory D<BR>
&nbsp;&nbsp;&nbsp;- Perform same steps as map 0.<BR>
<BR>
So, the problem is that I have not been able to find a way to output key-va=
lue pairs to different directories. All I have been able to specify is the =
map output directory by TextOutputFormat.setOutputPath.<BR>
<BR>
Any help would be appreciated.<BR>
<BR>
Thanks a lot<BR>
I<BR>
<BR>
<BR>
</SPAN></FONT></BLOCKQUOTE>
</BODY>
</HTML>


--_000_C9CE12AE21E19evansyahooinccom_--