Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of dhruv21@gmail.com designates
 209.85.216.179 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAOcnVr2O-TJcnpU-+KPHTDpLwdSYKw5r2L+oErATr9JL5braMw@mail.gmail.com>
References: 
 <CAFk0Lrc_FqqgyjVJgNvQxQM9-95_j=cSH2iO5N3qKqzT8LGqXw@mail.gmail.com>
	<CAOcnVr2O-TJcnpU-+KPHTDpLwdSYKw5r2L+oErATr9JL5braMw@mail.gmail.com>
Date: Fri, 2 Nov 2012 10:35:19 -0700
Message-ID: 
 <CAFk0LrdvRsnTmium5ceAt_0xcqP7Cs-TQNvvDEa7ia_=5VrM5Q@mail.gmail.com>
Subject: Re: OutputFormat and Reduce Task
From: Dhruv <dhruv21@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=001636ef06833ae62904cd868fe6

--001636ef06833ae62904cd868fe6
Content-Type: text/plain; charset=ISO-8859-1

Thanks Harsh, just to be clear--if I have a large key set and if I run with
just one reducer which is the default, the OutputFormat and the
RecordWriter will be constructed only once?


On Thu, Nov 1, 2012 at 8:14 PM, Harsh J <harsh@cloudera.com> wrote:

> Hi Dhruv,
>
> Inline.
>
> On Fri, Nov 2, 2012 at 4:15 AM, Dhruv <dhruv21@gmail.com> wrote:
> > I'm trying to optimize the performance of my OutputFormat's
> implementation.
> > I'm doing things similar to HBase's TableOutputFormat--sending the
> reducer's
> > output to a distributed k-v store. So, the context.write() call basically
> > winds up doing a Put() on the store.
> >
> > Although I haven't profiled, a sequence of thread dumps on the reduce
> tasks
> > reveal that the threads are RUNNABLE and hanging out in the put() and its
> > subsequent method calls. So, I proceeded to decouple these two by
> > implementing the producer (context.write()) consumer
> (RecordWriter.write())
> > pattern using ExecutorService.
>
> With HBase involved, this is only partly correct. The HTable API,
> which regular TableOutputFormat uses, provides a "AutoFlush" option
> which if disabled, begins to buffer writes to regionservers instead of
> doing a flush of Puts/Deletes at every single invoke.
>
> The TableOutputFormat by default does disable AutoFlush, to provide
> this behavior.
>
> Read more on that at
>
> http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/client/HTable.html#setAutoFlush(boolean,%20boolean)
> and/or in Lars' book, "HBase: The Definitive Guide".
>
> > My understanding is that Context.write() calls RecordWriter.write() and
> that
> > these two are synchronous calls. The first will block until the second
> > method completes.Each reduce phase blocks until the context.write()
> > finishes, so the next reduce on the next key also blocks, making things
> run
> > slow in my case. Is this correct?
>
> Given the above explanation, this is untrue if HBase's
> TableOutputFormat is involved, but true otherwise for general FS
> interacting OFs.
>
> > Does this mean that OutputFormat is
> > instantiated once by the TaskTracker for the Job's reduce logic and all
> keys
> > operated on by the reducers get the same instance of the OutputFormat.
> Or,
> > is it that for each key operated by the reducer, a new OutputFormat is
> > instantiated?
>
> The TaskTracker is a service daemon that does not execute any
> user-code. Only a single OutputFormat object is instantiated in a
> single Task. The RecordWriter wrapped in it too is only instantiated
> once per Task.
>
> > Thanks,
> > Dhruv
>
>
>
> --
> Harsh J
>

--001636ef06833ae62904cd868fe6
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Thanks Harsh, just to be clear--if I have a large key set and if I run with=
 just one reducer which is the default, the OutputFormat and the RecordWrit=
er will be constructed only once?<div><br></div><div><br></div><div class=
=3D"gmail_extra">
<br><br><div class=3D"gmail_quote">On Thu, Nov 1, 2012 at 8:14 PM, Harsh J =
<span dir=3D"ltr">&lt;<a href=3D"mailto:harsh@cloudera.com" target=3D"_blan=
k">harsh@cloudera.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_q=
uote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1e=
x">
Hi Dhruv,<br>
<br>
Inline.<br>
<div class=3D"im"><br>
On Fri, Nov 2, 2012 at 4:15 AM, Dhruv &lt;<a href=3D"mailto:dhruv21@gmail.c=
om">dhruv21@gmail.com</a>&gt; wrote:<br>
&gt; I&#39;m trying to optimize the performance of my OutputFormat&#39;s im=
plementation.<br>
&gt; I&#39;m doing things similar to HBase&#39;s TableOutputFormat--sending=
 the reducer&#39;s<br>
&gt; output to a distributed k-v store. So, the context.write() call basica=
lly<br>
&gt; winds up doing a Put() on the store.<br>
&gt;<br>
&gt; Although I haven&#39;t profiled, a sequence of thread dumps on the red=
uce tasks<br>
&gt; reveal that the threads are RUNNABLE and hanging out in the put() and =
its<br>
&gt; subsequent method calls. So, I proceeded to decouple these two by<br>
&gt; implementing the producer (context.write()) consumer (RecordWriter.wri=
te())<br>
&gt; pattern using ExecutorService.<br>
<br>
</div>With HBase involved, this is only partly correct. The HTable API,<br>
which regular TableOutputFormat uses, provides a &quot;AutoFlush&quot; opti=
on<br>
which if disabled, begins to buffer writes to regionservers instead of<br>
doing a flush of Puts/Deletes at every single invoke.<br>
<br>
The TableOutputFormat by default does disable AutoFlush, to provide<br>
this behavior.<br>
<br>
Read more on that at<br>
<a href=3D"http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/client/H=
Table.html#setAutoFlush(boolean,%20boolean)
and/or" target=3D"_blank">http://hbase.apache.org/apidocs/org/apache/hadoop=
/hbase/client/HTable.html#setAutoFlush(boolean,%20boolean)<br>
and/or</a> in Lars&#39; book, &quot;HBase: The Definitive Guide&quot;.<br>
<div class=3D"im"><br>
&gt; My understanding is that Context.write() calls RecordWriter.write() an=
d that<br>
&gt; these two are synchronous calls. The first will block until the second=
<br>
&gt; method completes.Each reduce phase blocks until the context.write()<br=
>
&gt; finishes, so the next reduce on the next key also blocks, making thing=
s run<br>
&gt; slow in my case. Is this correct?<br>
<br>
</div>Given the above explanation, this is untrue if HBase&#39;s<br>
TableOutputFormat is involved, but true otherwise for general FS<br>
interacting OFs.<br>
<div class=3D"im"><br>
&gt; Does this mean that OutputFormat is<br>
&gt; instantiated once by the TaskTracker for the Job&#39;s reduce logic an=
d all keys<br>
&gt; operated on by the reducers get the same instance of the OutputFormat.=
 Or,<br>
&gt; is it that for each key operated by the reducer, a new OutputFormat is=
<br>
&gt; instantiated?<br>
<br>
</div>The TaskTracker is a service daemon that does not execute any<br>
user-code. Only a single OutputFormat object is instantiated in a<br>
single Task. The RecordWriter wrapped in it too is only instantiated<br>
once per Task.<br>
<br>
&gt; Thanks,<br>
&gt; Dhruv<br>
<span class=3D"HOEnZb"><font color=3D"#888888"><br>
<br>
<br>
--<br>
Harsh J<br>
</font></span></blockquote></div><br></div>

--001636ef06833ae62904cd868fe6--