Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of psybers@gmail.com designates
 209.85.215.54 as permitted sender)
MIME-Version: 1.0
Sender: psybers@gmail.com
In-Reply-To: 
 <CAOcnVr2Ezxd+i4h3vTv_uepFGg4dyYoikgn-FHX80G=Da9TLDA@mail.gmail.com>
References: 
 <CAMiz3FPFPFpk=ZVLDskg_0KgFX6HgNFMZzDM+Z7FuRDB68bAug@mail.gmail.com>
	<CAOcnVr2Ezxd+i4h3vTv_uepFGg4dyYoikgn-FHX80G=Da9TLDA@mail.gmail.com>
Date: Mon, 25 Feb 2013 01:41:26 -0600
Message-ID: 
 <CAMiz3FMwr=tOLK4M7OhHrrY8XvfdKSDhtUzAST7hwh6AFRX_ig@mail.gmail.com>
Subject: Re: Slow MR time and high network utilization with all local data
From: Robert Dyer <rdyer@iastate.edu>
To: Harsh J <harsh@cloudera.com>
Cc: "<user@hadoop.apache.org>" <user@hadoop.apache.org>
Content-Type: multipart/alternative; boundary=bcaec554d7fe2224b304d687abcd

--bcaec554d7fe2224b304d687abcd
Content-Type: text/plain; charset=ISO-8859-1

I am using Ganglia.

Note I have short circuit reads enabled (I think, I never verified it was
working but I do get errors if I run jobs as another user).

Also, if Ganglia's network use included the local socket then I would see
network utilization in all cases.  I see no utilization when using HBase as
MR input and MapFile.  I also see a small amount when using HBase for both
(as one would expect).


On Mon, Feb 25, 2013 at 1:22 AM, Harsh J <harsh@cloudera.com> wrote:

> Hi Robert,
>
> How are you measuring the network usage? Note that unless short
> circuit reading is on, data reads are done over a local socket as
> well, and may appear in network traffic observing tools too (but do
> not mean they are over the network).
>
> On Mon, Feb 25, 2013 at 2:35 AM, Robert Dyer <psybers@gmail.com> wrote:
> > I have a small 6 node dev cluster.  I use a 1GB SequenceFile as input to
> a
> > MapReduce job, using a custom split size of 10MB (to increase the number
> of
> > maps).  Each map call will read random entries out of a shared MapFile
> (that
> > is around 50GB).
> >
> > I set replication to 6 on both of these files, so all of the data should
> be
> > local for each map task.  I verified via fsck that no blocks are
> > under-replicated.
> >
> > Despite this, for some reason the MR job maxes out the network and takes
> an
> > extremely long time.  What could be causing this?
> >
> > Note that the total number of map outputs for this job is around 400 and
> the
> > reducer just passes the values through, so there shouldn't be much
> network
> > utilized by the output.
> >
> > As an experiment, I switched from the SeqFile input to an HBase table and
> > now see almost no network used.  I also tried leaving the SeqFile as
> input
> > and switched the MapFile to an HBase table and see about 30% network used
> > (which makes sense, as now that 50GB data isn't always local).
> >
> > What is going on here?  How can I debug to see what data is being
> > transferred over the network?
>
>
>
> --
> Harsh J
>


-- 

Robert Dyer
rdyer@iastate.edu

--bcaec554d7fe2224b304d687abcd
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div>I am using Ganglia.<br><br></div>Note I have sho=
rt circuit reads enabled (I think, I never verified it was working but I do=
 get errors if I run jobs as another user).<br><br></div>Also, if Ganglia&#=
39;s network use included the local socket then I would see network utiliza=
tion in all cases.=A0 I see no utilization when using HBase as MR input and=
 MapFile.=A0 I also see a small amount when using HBase for both (as one wo=
uld expect).<br>
</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Mon,=
 Feb 25, 2013 at 1:22 AM, Harsh J <span dir=3D"ltr">&lt;<a href=3D"mailto:h=
arsh@cloudera.com" target=3D"_blank">harsh@cloudera.com</a>&gt;</span> wrot=
e:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Hi Robert,<br>
<br>
How are you measuring the network usage? Note that unless short<br>
circuit reading is on, data reads are done over a local socket as<br>
well, and may appear in network traffic observing tools too (but do<br>
not mean they are over the network).<br>
<div><div class=3D"h5"><br>
On Mon, Feb 25, 2013 at 2:35 AM, Robert Dyer &lt;<a href=3D"mailto:psybers@=
gmail.com">psybers@gmail.com</a>&gt; wrote:<br>
&gt; I have a small 6 node dev cluster. =A0I use a 1GB SequenceFile as inpu=
t to a<br>
&gt; MapReduce job, using a custom split size of 10MB (to increase the numb=
er of<br>
&gt; maps). =A0Each map call will read random entries out of a shared MapFi=
le (that<br>
&gt; is around 50GB).<br>
&gt;<br>
&gt; I set replication to 6 on both of these files, so all of the data shou=
ld be<br>
&gt; local for each map task. =A0I verified via fsck that no blocks are<br>
&gt; under-replicated.<br>
&gt;<br>
&gt; Despite this, for some reason the MR job maxes out the network and tak=
es an<br>
&gt; extremely long time. =A0What could be causing this?<br>
&gt;<br>
&gt; Note that the total number of map outputs for this job is around 400 a=
nd the<br>
&gt; reducer just passes the values through, so there shouldn&#39;t be much=
 network<br>
&gt; utilized by the output.<br>
&gt;<br>
&gt; As an experiment, I switched from the SeqFile input to an HBase table =
and<br>
&gt; now see almost no network used. =A0I also tried leaving the SeqFile as=
 input<br>
&gt; and switched the MapFile to an HBase table and see about 30% network u=
sed<br>
&gt; (which makes sense, as now that 50GB data isn&#39;t always local).<br>
&gt;<br>
&gt; What is going on here? =A0How can I debug to see what data is being<br=
>
&gt; transferred over the network?<br>
<br>
<br>
<br>
</div></div>--<br>
Harsh J<br>
</blockquote></div><br><br clear=3D"all"><br>-- <br><br>Robert Dyer<br><a h=
ref=3D"mailto:rdyer@iastate.edu">rdyer@iastate.edu</a>
</div>

--bcaec554d7fe2224b304d687abcd--