Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of
 nagarjuna.kanamarlapudi@gmail.com designates 209.85.220.42 as permitted
 sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAMOW4COjH4mMT6bJKBDMPZi4V0p4n3RccC+8PM5ZBR4Bq71Z8A@mail.gmail.com>
References: 
 <CA+Zwj9_y2Y_U7q2VGFqiugW0+mEtWb_KP-imMKxraWMOxMt04Q@mail.gmail.com>
	<CA+Zwj98f631oLR042_h6FK41D6+nGTq-xT0i1U2FSY83zCFb0A@mail.gmail.com>
	<CAMOW4COjH4mMT6bJKBDMPZi4V0p4n3RccC+8PM5ZBR4Bq71Z8A@mail.gmail.com>
Date: Tue, 7 Jan 2014 00:37:10 +0530
Message-ID: 
 <CA+Zwj98RbOE9-Cs5qsxyyeKjeH2tW9KTtOFns+01qdS72fCvxA@mail.gmail.com>
Subject: Re: Understanding MapReduce source code : Flush operations
From: nagarjuna kanamarlapudi <nagarjuna.kanamarlapudi@gmail.com>
To: "user@hadoop.apache.org" <user@hadoop.apache.org>
Content-Type: multipart/alternative; boundary=bcaec520f2e381793504ef51f74d

--bcaec520f2e381793504ef51f74d
Content-Type: text/plain; charset=ISO-8859-1

I want to have a look at the code where of flush operations that happens
after the reduce phase.

Reducer writes the output to OutputFormat which inturn pushes that to
memory and once it reaches 90% of chunk size it starts to flush the reducer
output.

I essentially want to look at the code of that flushing operation.


What is the class(es) I need to look into


On Mon, Jan 6, 2014 at 11:23 PM, Hardik Pandya <smarty.juice@gmail.com>wrote:

> Please do not tell me since last 2.5 years you have not used virtual
> Hadoop environment to debug your Map Reduce application before deploying to
> Production environment
>
> No one can stop you looking at the code , Hadoop and its ecosystem is
> open-source
>
>
> On Mon, Jan 6, 2014 at 9:35 AM, nagarjuna kanamarlapudi <
> nagarjuna.kanamarlapudi@gmail.com> wrote:
>
>>
>>
>> ---------- Forwarded message ----------
>> From: nagarjuna kanamarlapudi <nagarjuna.kanamarlapudi@gmail.com>
>>  Date: Mon, Jan 6, 2014 at 6:39 PM
>> Subject: Understanding MapReduce source code : Flush operations
>> To: mapreduce-user@hadoop.apache.org
>>
>>
>>  Hi,
>>
>> I am using hadoop/ map reduce for aout 2.5 years. I want to understand
>> the internals of the hadoop source code.
>>
>> Let me put my requirement very clear.
>>
>> I want to have a look at the code where of flush operations that happens
>> after the reduce phase.
>>
>> Reducer writes the output to OutputFormat which inturn pushes that to
>> memory and once it reaches 90% of chunk size it starts to flush the reducer
>> output.
>>
>> I essentially want to look at the code of that flushing operation.
>>
>>
>>
>>
>> Regards,
>> Nagarjuna K
>>
>>
>

--bcaec520f2e381793504ef51f74d
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>I want to have a look at the code where of flush oper=
ations that happens after the reduce phase.<br><br>Reducer
 writes the output to OutputFormat which inturn pushes that to memory=20
and once it reaches 90% of chunk size it starts to flush the reducer=20
output. <br>
<br>I essentially want to look at the code of that flushing operation.<br><=
br><br></div>What is the class(es) I need to look into <br></div><div class=
=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Mon, Jan 6, 2014 at =
11:23 PM, Hardik Pandya <span dir=3D"ltr">&lt;<a href=3D"mailto:smarty.juic=
e@gmail.com" target=3D"_blank">smarty.juice@gmail.com</a>&gt;</span> wrote:=
<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Please do not tell me since=
 last 2.5 years you have not used virtual Hadoop environment to debug your =
Map Reduce application before deploying to Production environment<div>
<br></div><div>No one can stop you looking at the code , Hadoop and its eco=
system is open-source</div>
</div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extra"><b=
r><br><div class=3D"gmail_quote">On Mon, Jan 6, 2014 at 9:35 AM, nagarjuna =
kanamarlapudi <span dir=3D"ltr">&lt;<a href=3D"mailto:nagarjuna.kanamarlapu=
di@gmail.com" target=3D"_blank">nagarjuna.kanamarlapudi@gmail.com</a>&gt;</=
span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><br><br><div class=3D"gmail=
_quote"><div>---------- Forwarded message ----------<br>From: <b class=3D"g=
mail_sendername">nagarjuna kanamarlapudi</b> <span dir=3D"ltr">&lt;<a href=
=3D"mailto:nagarjuna.kanamarlapudi@gmail.com" target=3D"_blank">nagarjuna.k=
anamarlapudi@gmail.com</a>&gt;</span><br>

</div><div>
Date: Mon, Jan 6, 2014 at 6:39 PM<br>Subject: Understanding MapReduce sourc=
e code : Flush operations<br>To: <a href=3D"mailto:mapreduce-user@hadoop.ap=
ache.org" target=3D"_blank">mapreduce-user@hadoop.apache.org</a><br><br><br=
>

<div dir=3D"ltr">
<div><div><div><div><div><div>Hi,<br><br></div>I am using hadoop/ map reduc=
e for aout 2.5 years. I want to understand the internals of the hadoop sour=
ce code. <br><br></div>Let me put my requirement very clear.<br>
<br></div>I want to have a look at the code where of flush operations that =
happens after the reduce phase.<br><br></div>Reducer writes the output to O=
utputFormat which inturn pushes that to memory and once it reaches 90% of c=
hunk size it starts to flush the reducer output. <br>


<br></div>I essentially want to look at the code of that flushing operation=
.<br><br><br></div><div><br></div><div><br>Regards,<br></div>Nagarjuna K<br=
></div>
</div></div><br></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--bcaec520f2e381793504ef51f74d--