Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of samliuhadoop@gmail.com
 designates 74.125.83.43 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAOcnVr021K==1BD9BYCvcawqJGqXNjjmXa5CryULDOh+s8ThSA@mail.gmail.com>
References: 
 <CAHH8OOfVvdz29ebiSmBPcdVz4is=mgmghd8QHp=KTkob1eGTDA@mail.gmail.com>
	<CAJs-K1t93LvFtPqiqkyJWwPbpL3X_gcMb3XcHtBituZROHVVCQ@mail.gmail.com>
	<CAHH8OOfQUG5cgfO8MJAbEAvaw_M2O2esinnHAOKaU8mSbB8x9A@mail.gmail.com>
	<CAOcnVr3nfgCH2hezPWS_BCp-0m0_Y9XXJci9vMR=0zM3BmDpnw@mail.gmail.com>
	<CAHH8OOf=yP+kVecJEmpGXA086=BzZsvBfVw_P0uAPxn+v+Ok7A@mail.gmail.com>
	<CACBYxKJ6JCthLyLOxBzmMoBL2RfXa-jjseaSMfLKOK2PfDTK3A@mail.gmail.com>
	<CAOcnVr1UuzHhuxGonb5Fo4+4bLLTJUjFvdqks_Q5B+=L86H6Jg@mail.gmail.com>
	<CAHH8OOdQUUQJSi93_AxuG7ZsKvSZLK25o_svpjsGT76oHEz54w@mail.gmail.com>
	<CAOcnVr021K==1BD9BYCvcawqJGqXNjjmXa5CryULDOh+s8ThSA@mail.gmail.com>
Date: Tue, 18 Jun 2013 16:58:56 +0800
Message-ID: 
 <CAHH8OOfozVhO_CpKN0TWx_5ZhtYzeM=r=puvMJWKZNRkZq63gw@mail.gmail.com>
Subject: Re: Why my tests shows Yarn is worse than MRv1 for terasort?
From: sam liu <samliuhadoop@gmail.com>
To: "user@hadoop.apache.org" <user@hadoop.apache.org>
Content-Type: multipart/alternative; boundary=089e0158c9da55be6504df69ec05

--089e0158c9da55be6504df69ec05
Content-Type: text/plain; charset=ISO-8859-1

Hi Harsh,

Thanks for your detailed response! Now, the efficiency of my Yarn cluster
improved a lot after increasing the reducer number(mapreduce.job.reduces)
in mapred-site.xml. But I still have some questions about the way of Yarn
to execute MRv1 job:

1.In Hadoop 1.x, a job will be executed by map task and reduce task
together, with a typical process(map > shuffle > reduce). In Yarn, as I
know, a MRv1 job will be executed only by ApplicationMaster.
- Yarn could run multiple kinds of jobs(MR, MPI, ...), but, MRv1 job has
special execution process(map > shuffle > reduce) in Hadoop 1.x, and how
Yarn execute a MRv1 job? still include some special MR steps in Hadoop 1.x,
like map, sort, merge, combine and shuffle?
- Do the MRv1 parameters still work for Yarn? Like
mapreduce.task.io.sort.mb and mapreduce.map.sort.spill.percent?
- What's the general process for ApplicationMaster of Yarn to execute a
job?

2. In Hadoop 1.x, we can set the map/reduce slots by setting
'mapred.tasktracker.map.tasks.maximum' and
'mapred.tasktracker.reduce.tasks.maximum'
- For Yarn, above tow parameter do not work any more, as yarn uses
container instead, right?
- For Yarn, we can set the whole physical mem for a NodeManager using '
yarn.nodemanager.resource.memory-mb'. But how to set the default size of
physical mem of a container?
- How to set the maximum size of physical mem of a container? By the
parameter of 'mapred.child.java.opts'?

Thanks as always!

2013/6/9 Harsh J <harsh@cloudera.com>

> Hi Sam,
>
> > - How to know the container number? Why you say it will be 22 containers
> due to a 22 GB memory?
>
> The MR2's default configuration requests 1 GB resource each for Map
> and Reduce containers. It requests 1.5 GB for the AM container that
> runs the job, additionally. This is tunable using the properties
> Sandy's mentioned in his post.
>
> > - My machine has 32 GB memory, how many memory is proper to be assigned
> to containers?
>
> This is a general question. You may use the same process you took to
> decide optimal number of slots in MR1 to decide this here. Every
> container is a new JVM, and you're limited by the CPUs you have there
> (if not the memory). Either increase memory requests from jobs, to
> lower # of concurrent containers at a given time (runtime change), or
> lower NM's published memory resources to control the same (config
> change).
>
> > - In mapred-site.xml, if I set 'mapreduce.framework.name' to be 'yarn',
> will other parameters for mapred-site.xml still work in yarn framework?
> Like 'mapreduce.task.io.sort.mb' and 'mapreduce.map.sort.spill.percent'
>
> Yes, all of these properties will still work. Old properties specific
> to JobTracker or TaskTracker (usually found as a keyword in the config
> name) will not apply anymore.
>
> On Sun, Jun 9, 2013 at 2:21 PM, sam liu <samliuhadoop@gmail.com> wrote:
> > Hi Harsh,
> >
> > According to above suggestions, I removed the duplication of setting, and
> > reduce the value of 'yarn.nodemanager.resource.cpu-cores',
> > 'yarn.nodemanager.vcores-pcores-ratio' and
> > 'yarn.nodemanager.resource.memory-mb' to 16, 8 and 12000. Ant then, the
> > efficiency improved about 18%.  I have questions:
> >
> > - How to know the container number? Why you say it will be 22 containers
> due
> > to a 22 GB memory?
> > - My machine has 32 GB memory, how many memory is proper to be assigned
> to
> > containers?
> > - In mapred-site.xml, if I set 'mapreduce.framework.name' to be 'yarn',
> will
> > other parameters for mapred-site.xml still work in yarn framework? Like
> > 'mapreduce.task.io.sort.mb' and 'mapreduce.map.sort.spill.percent'
> >
> > Thanks!
> >
> >
> >
> > 2013/6/8 Harsh J <harsh@cloudera.com>
> >>
> >> Hey Sam,
> >>
> >> Did you get a chance to retry with Sandy's suggestions? The config
> >> appears to be asking NMs to use roughly 22 total containers (as
> >> opposed to 12 total tasks in MR1 config) due to a 22 GB memory
> >> resource. This could impact much, given the CPU is still the same for
> >> both test runs.
> >>
> >> On Fri, Jun 7, 2013 at 12:23 PM, Sandy Ryza <sandy.ryza@cloudera.com>
> >> wrote:
> >> > Hey Sam,
> >> >
> >> > Thanks for sharing your results.  I'm definitely curious about what's
> >> > causing the difference.
> >> >
> >> > A couple observations:
> >> > It looks like you've got yarn.nodemanager.resource.memory-mb in there
> >> > twice
> >> > with two different values.
> >> >
> >> > Your max JVM memory of 1000 MB is (dangerously?) close to the default
> >> > mapreduce.map/reduce.memory.mb of 1024 MB. Are any of your tasks
> getting
> >> > killed for running over resource limits?
> >> >
> >> > -Sandy
> >> >
> >> >
> >> > On Thu, Jun 6, 2013 at 10:21 PM, sam liu <samliuhadoop@gmail.com>
> wrote:
> >> >>
> >> >> The terasort execution log shows that reduce spent about 5.5 mins
> from
> >> >> 33%
> >> >> to 35% as below.
> >> >> 13/06/10 08:02:22 INFO mapreduce.Job:  map 100% reduce 31%
> >> >> 13/06/10 08:02:25 INFO mapreduce.Job:  map 100% reduce 32%
> >> >> 13/06/10 08:02:46 INFO mapreduce.Job:  map 100% reduce 33%
> >> >> 13/06/10 08:08:16 INFO mapreduce.Job:  map 100% reduce 35%
> >> >> 13/06/10 08:08:19 INFO mapreduce.Job:  map 100% reduce 40%
> >> >> 13/06/10 08:08:22 INFO mapreduce.Job:  map 100% reduce 43%
> >> >>
> >> >> Any way, below are my configurations for your reference. Thanks!
> >> >> (A) core-site.xml
> >> >> only define 'fs.default.name' and 'hadoop.tmp.dir'
> >> >>
> >> >> (B) hdfs-site.xml
> >> >>   <property>
> >> >>     <name>dfs.replication</name>
> >> >>     <value>1</value>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>dfs.name.dir</name>
> >> >>     <value>/opt/hadoop-2.0.4-alpha/temp/hadoop/dfs_name_dir</value>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>dfs.data.dir</name>
> >> >>     <value>/opt/hadoop-2.0.4-alpha/temp/hadoop/dfs_data_dir</value>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>dfs.block.size</name>
> >> >>     <value>134217728</value><!-- 128MB -->
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>dfs.namenode.handler.count</name>
> >> >>     <value>64</value>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>dfs.datanode.handler.count</name>
> >> >>     <value>10</value>
> >> >>   </property>
> >> >>
> >> >> (C) mapred-site.xml
> >> >>   <property>
> >> >>     <name>mapreduce.cluster.temp.dir</name>
> >> >>     <value>/opt/hadoop-2.0.4-alpha/temp/hadoop/mapreduce_temp</value>
> >> >>     <description>No description</description>
> >> >>     <final>true</final>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>mapreduce.cluster.local.dir</name>
> >> >>
> >> >>
> <value>/opt/hadoop-2.0.4-alpha/temp/hadoop/mapreduce_local_dir</value>
> >> >>     <description>No description</description>
> >> >>     <final>true</final>
> >> >>   </property>
> >> >>
> >> >> <property>
> >> >>   <name>mapreduce.child.java.opts</name>
> >> >>   <value>-Xmx1000m</value>
> >> >> </property>
> >> >>
> >> >> <property>
> >> >>     <name>mapreduce.framework.name</name>
> >> >>     <value>yarn</value>
> >> >>    </property>
> >> >>
> >> >>  <property>
> >> >>     <name>mapreduce.tasktracker.map.tasks.maximum</name>
> >> >>     <value>8</value>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>mapreduce.tasktracker.reduce.tasks.maximum</name>
> >> >>     <value>4</value>
> >> >>   </property>
> >> >>
> >> >>
> >> >>   <property>
> >> >>     <name>mapreduce.tasktracker.outofband.heartbeat</name>
> >> >>     <value>true</value>
> >> >>   </property>
> >> >>
> >> >> (D) yarn-site.xml
> >> >>  <property>
> >> >>     <name>yarn.resourcemanager.resource-tracker.address</name>
> >> >>     <value>node1:18025</value>
> >> >>     <description>host is the hostname of the resource manager and
> >> >>     port is the port on which the NodeManagers contact the Resource
> >> >> Manager.
> >> >>     </description>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <description>The address of the RM web application.</description>
> >> >>     <name>yarn.resourcemanager.webapp.address</name>
> >> >>     <value>node1:18088</value>
> >> >>   </property>
> >> >>
> >> >>
> >> >>   <property>
> >> >>     <name>yarn.resourcemanager.scheduler.address</name>
> >> >>     <value>node1:18030</value>
> >> >>     <description>host is the hostname of the resourcemanager and port
> >> >> is
> >> >> the port
> >> >>     on which the Applications in the cluster talk to the Resource
> >> >> Manager.
> >> >>     </description>
> >> >>   </property>
> >> >>
> >> >>
> >> >>   <property>
> >> >>     <name>yarn.resourcemanager.address</name>
> >> >>     <value>node1:18040</value>
> >> >>     <description>the host is the hostname of the ResourceManager and
> >> >> the
> >> >> port is the port on
> >> >>     which the clients can talk to the Resource Manager.
> </description>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>yarn.nodemanager.local-dirs</name>
> >> >>
> >> >> <value>/opt/hadoop-2.0.4-alpha/temp/hadoop/yarn_nm_local_dir</value>
> >> >>     <description>the local directories used by the
> >> >> nodemanager</description>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>yarn.nodemanager.address</name>
> >> >>     <value>0.0.0.0:18050</value>
> >> >>     <description>the nodemanagers bind to this port</description>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>yarn.nodemanager.resource.memory-mb</name>
> >> >>     <value>10240</value>
> >> >>     <description>the amount of memory on the NodeManager in
> >> >> GB</description>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>yarn.nodemanager.remote-app-log-dir</name>
> >> >>
> <value>/opt/hadoop-2.0.4-alpha/temp/hadoop/yarn_nm_app-logs</value>
> >> >>     <description>directory on hdfs where the application logs are
> moved
> >> >> to
> >> >> </description>
> >> >>   </property>
> >> >>
> >> >>    <property>
> >> >>     <name>yarn.nodemanager.log-dirs</name>
> >> >>     <value>/opt/hadoop-2.0.4-alpha/temp/hadoop/yarn_nm_log</value>
> >> >>     <description>the directories used by Nodemanagers as log
> >> >> directories</description>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>yarn.nodemanager.aux-services</name>
> >> >>     <value>mapreduce.shuffle</value>
> >> >>     <description>shuffle service that needs to be set for Map Reduce
> to
> >> >> run </description>
> >> >>   </property>
> >> >>
> >> >>   <property>
> >> >>     <name>yarn.resourcemanager.client.thread-count</name>
> >> >>     <value>64</value>
> >> >>   </property>
> >> >>
> >> >>  <property>
> >> >>     <name>yarn.nodemanager.resource.cpu-cores</name>
> >> >>     <value>24</value>
> >> >>   </property>
> >> >>
> >> >> <property>
> >> >>     <name>yarn.nodemanager.vcores-pcores-ratio</name>
> >> >>     <value>3</value>
> >> >>   </property>
> >> >>
> >> >>  <property>
> >> >>     <name>yarn.nodemanager.resource.memory-mb</name>
> >> >>     <value>22000</value>
> >> >>   </property>
> >> >>
> >> >>  <property>
> >> >>     <name>yarn.nodemanager.vmem-pmem-ratio</name>
> >> >>     <value>2.1</value>
> >> >>   </property>
> >> >>
> >> >>
> >> >>
> >> >> 2013/6/7 Harsh J <harsh@cloudera.com>
> >> >>>
> >> >>> Not tuning configurations at all is wrong. YARN uses memory resource
> >> >>> based scheduling and hence MR2 would be requesting 1 GB minimum by
> >> >>> default, causing, on base configs, to max out at 8 (due to 8 GB NM
> >> >>> memory resource config) total containers. Do share your configs as
> at
> >> >>> this point none of us can tell what it is.
> >> >>>
> >> >>> Obviously, it isn't our goal to make MR2 slower for users and to not
> >> >>> care about such things :)
> >> >>>
> >> >>> On Fri, Jun 7, 2013 at 8:45 AM, sam liu <samliuhadoop@gmail.com>
> >> >>> wrote:
> >> >>> > At the begining, I just want to do a fast comparision of MRv1 and
> >> >>> > Yarn.
> >> >>> > But
> >> >>> > they have many differences, and to be fair for comparison I did
> not
> >> >>> > tune
> >> >>> > their configurations at all.  So I got above test results. After
> >> >>> > analyzing
> >> >>> > the test result, no doubt, I will configure them and do comparison
> >> >>> > again.
> >> >>> >
> >> >>> > Do you have any idea on current test result? I think, to compare
> >> >>> > with
> >> >>> > MRv1,
> >> >>> > Yarn is better on Map phase(teragen test), but worse on Reduce
> >> >>> > phase(terasort test).
> >> >>> > And any detailed suggestions/comments/materials on Yarn
> performance
> >> >>> > tunning?
> >> >>> >
> >> >>> > Thanks!
> >> >>> >
> >> >>> >
> >> >>> > 2013/6/7 Marcos Luis Ortiz Valmaseda <marcosluis2186@gmail.com>
> >> >>> >>
> >> >>> >> Why not to tune the configurations?
> >> >>> >> Both frameworks have many areas to tune:
> >> >>> >> - Combiners, Shuffle optimization, Block size, etc
> >> >>> >>
> >> >>> >>
> >> >>> >>
> >> >>> >> 2013/6/6 sam liu <samliuhadoop@gmail.com>
> >> >>> >>>
> >> >>> >>> Hi Experts,
> >> >>> >>>
> >> >>> >>> We are thinking about whether to use Yarn or not in the near
> >> >>> >>> future,
> >> >>> >>> and
> >> >>> >>> I ran teragen/terasort on Yarn and MRv1 for comprison.
> >> >>> >>>
> >> >>> >>> My env is three nodes cluster, and each node has similar
> hardware:
> >> >>> >>> 2
> >> >>> >>> cpu(4 core), 32 mem. Both Yarn and MRv1 cluster are set on the
> >> >>> >>> same
> >> >>> >>> env. To
> >> >>> >>> be fair, I did not make any performance tuning on their
> >> >>> >>> configurations, but
> >> >>> >>> use the default configuration values.
> >> >>> >>>
> >> >>> >>> Before testing, I think Yarn will be much better than MRv1, if
> >> >>> >>> they
> >> >>> >>> all
> >> >>> >>> use default configuration, because Yarn is a better framework
> than
> >> >>> >>> MRv1.
> >> >>> >>> However, the test result shows some differences:
> >> >>> >>>
> >> >>> >>> MRv1: Hadoop-1.1.1
> >> >>> >>> Yarn: Hadoop-2.0.4
> >> >>> >>>
> >> >>> >>> (A) Teragen: generate 10 GB data:
> >> >>> >>> - MRv1: 193 sec
> >> >>> >>> - Yarn: 69 sec
> >> >>> >>> Yarn is 2.8 times better than MRv1
> >> >>> >>>
> >> >>> >>> (B) Terasort: sort 10 GB data:
> >> >>> >>> - MRv1: 451 sec
> >> >>> >>> - Yarn: 1136 sec
> >> >>> >>> Yarn is 2.5 times worse than MRv1
> >> >>> >>>
> >> >>> >>> After a fast analysis, I think the direct cause might be that
> Yarn
> >> >>> >>> is
> >> >>> >>> much faster than MRv1 on Map phase, but much worse on Reduce
> >> >>> >>> phase.
> >> >>> >>>
> >> >>> >>> Here I have two questions:
> >> >>> >>> - Why my tests shows Yarn is worse than MRv1 for terasort?
> >> >>> >>> - What's the stratage for tuning Yarn performance? Is any
> >> >>> >>> materials?
> >> >>> >>>
> >> >>> >>> Thanks!
> >> >>> >>
> >> >>> >>
> >> >>> >>
> >> >>> >>
> >> >>> >> --
> >> >>> >> Marcos Ortiz Valmaseda
> >> >>> >> Product Manager at PDVSA
> >> >>> >> http://about.me/marcosortiz
> >> >>> >>
> >> >>> >
> >> >>>
> >> >>>
> >> >>>
> >> >>> --
> >> >>> Harsh J
> >> >>
> >> >>
> >> >
> >>
> >>
> >>
> >> --
> >> Harsh J
> >
> >
>
>
>
> --
> Harsh J
>

--089e0158c9da55be6504df69ec05
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div><div>Hi Harsh,<br><br></div>Thanks for your deta=
iled response! Now, the efficiency of my Yarn cluster improved a lot after =
increasing the reducer number(mapreduce.job.reduces) in mapred-site.xml. Bu=
t I still have some questions about the way of Yarn to execute MRv1 job:<br=
>
<br>1.In Hadoop 1.x, a job will be executed by map task and reduce task tog=
ether, with a typical process(map &gt; shuffle &gt; reduce). In Yarn, as I =
know, a MRv1 job will be executed only by ApplicationMaster.<br>- Yarn coul=
d run multiple kinds of jobs(MR, MPI, ...), but, MRv1 job has special execu=
tion process(map &gt; shuffle &gt; reduce) in Hadoop 1.x, and how Yarn exec=
ute a MRv1 job? still include some special MR steps in Hadoop 1.x, like map=
, sort, merge, combine and shuffle?<br>
- Do the MRv1 parameters still work for Yarn? Like mapreduce.task.io.sort.m=
b and mapreduce.map.sort.spill.percent?<br>- What&#39;s the general process=
 for ApplicationMaster of Yarn to execute a job? <br><br>2. In Hadoop 1.x, =
we can set the map/reduce slots by setting &#39;mapred.tasktracker.map.task=
s.maximum&#39; and &#39;mapred.tasktracker.reduce.tasks.maximum&#39;<br>
</div>- For Yarn, above tow parameter do not work any more, as yarn uses co=
ntainer instead, right?<br></div>- For Yarn, we can set the whole physical =
mem for a NodeManager using &#39;<a name=3D"yarn.nodemanager.resource.memor=
y-mb"><span style=3D"font-size:11pt;line-height:115%;font-family:Calibri" l=
ang=3D"EN-US">yarn.nodemanager.resource.memory-mb</span></a>&#39;. But how =
to set the default size of physical mem of a container? <br>
<div><div>- How to set the maximum size of physical mem of a container? By =
the parameter of &#39;mapred.child.java.opts&#39;?<br></div></div><div clas=
s=3D"gmail_extra"><br></div><div class=3D"gmail_extra">Thanks as always!<br=
>
</div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">2013/6/9 Ha=
rsh J <span dir=3D"ltr">&lt;<a href=3D"mailto:harsh@cloudera.com" target=3D=
"_blank">harsh@cloudera.com</a>&gt;</span><br><blockquote class=3D"gmail_qu=
ote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,20=
4);padding-left:1ex">
Hi Sam,<br>
<div class=3D"im"><br>
&gt; - How to know the container number? Why you say it will be 22 containe=
rs due to a 22 GB memory?<br>
<br>
</div>The MR2&#39;s default configuration requests 1 GB resource each for M=
ap<br>
and Reduce containers. It requests 1.5 GB for the AM container that<br>
runs the job, additionally. This is tunable using the properties<br>
Sandy&#39;s mentioned in his post.<br>
<div class=3D"im"><br>
&gt; - My machine has 32 GB memory, how many memory is proper to be assigne=
d to containers?<br>
<br>
</div>This is a general question. You may use the same process you took to<=
br>
decide optimal number of slots in MR1 to decide this here. Every<br>
container is a new JVM, and you&#39;re limited by the CPUs you have there<b=
r>
(if not the memory). Either increase memory requests from jobs, to<br>
lower # of concurrent containers at a given time (runtime change), or<br>
lower NM&#39;s published memory resources to control the same (config<br>
change).<br>
<div class=3D"im"><br>
&gt; - In mapred-site.xml, if I set &#39;<a href=3D"http://mapreduce.framew=
ork.name" target=3D"_blank">mapreduce.framework.name</a>&#39; to be &#39;ya=
rn&#39;, will other parameters for mapred-site.xml still work in yarn frame=
work? Like &#39;mapreduce.task.io.sort.mb&#39; and &#39;mapreduce.map.sort.=
spill.percent&#39;<br>

<br>
</div>Yes, all of these properties will still work. Old properties specific=
<br>
to JobTracker or TaskTracker (usually found as a keyword in the config<br>
name) will not apply anymore.<br>
<div class=3D""><div class=3D"h5"><br>
On Sun, Jun 9, 2013 at 2:21 PM, sam liu &lt;<a href=3D"mailto:samliuhadoop@=
gmail.com">samliuhadoop@gmail.com</a>&gt; wrote:<br>
&gt; Hi Harsh,<br>
&gt;<br>
&gt; According to above suggestions, I removed the duplication of setting, =
and<br>
&gt; reduce the value of &#39;yarn.nodemanager.resource.cpu-cores&#39;,<br>
&gt; &#39;yarn.nodemanager.vcores-pcores-ratio&#39; and<br>
&gt; &#39;yarn.nodemanager.resource.memory-mb&#39; to 16, 8 and 12000. Ant =
then, the<br>
&gt; efficiency improved about 18%. =A0I have questions:<br>
&gt;<br>
&gt; - How to know the container number? Why you say it will be 22 containe=
rs due<br>
&gt; to a 22 GB memory?<br>
&gt; - My machine has 32 GB memory, how many memory is proper to be assigne=
d to<br>
&gt; containers?<br>
&gt; - In mapred-site.xml, if I set &#39;<a href=3D"http://mapreduce.framew=
ork.name" target=3D"_blank">mapreduce.framework.name</a>&#39; to be &#39;ya=
rn&#39;, will<br>
&gt; other parameters for mapred-site.xml still work in yarn framework? Lik=
e<br>
&gt; &#39;mapreduce.task.io.sort.mb&#39; and &#39;mapreduce.map.sort.spill.=
percent&#39;<br>
&gt;<br>
&gt; Thanks!<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; 2013/6/8 Harsh J &lt;<a href=3D"mailto:harsh@cloudera.com">harsh@cloud=
era.com</a>&gt;<br>
&gt;&gt;<br>
&gt;&gt; Hey Sam,<br>
&gt;&gt;<br>
&gt;&gt; Did you get a chance to retry with Sandy&#39;s suggestions? The co=
nfig<br>
&gt;&gt; appears to be asking NMs to use roughly 22 total containers (as<br=
>
&gt;&gt; opposed to 12 total tasks in MR1 config) due to a 22 GB memory<br>
&gt;&gt; resource. This could impact much, given the CPU is still the same =
for<br>
&gt;&gt; both test runs.<br>
&gt;&gt;<br>
&gt;&gt; On Fri, Jun 7, 2013 at 12:23 PM, Sandy Ryza &lt;<a href=3D"mailto:=
sandy.ryza@cloudera.com">sandy.ryza@cloudera.com</a>&gt;<br>
&gt;&gt; wrote:<br>
&gt;&gt; &gt; Hey Sam,<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; Thanks for sharing your results. =A0I&#39;m definitely curiou=
s about what&#39;s<br>
&gt;&gt; &gt; causing the difference.<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; A couple observations:<br>
&gt;&gt; &gt; It looks like you&#39;ve got yarn.nodemanager.resource.memory=
-mb in there<br>
&gt;&gt; &gt; twice<br>
&gt;&gt; &gt; with two different values.<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; Your max JVM memory of 1000 MB is (dangerously?) close to the=
 default<br>
&gt;&gt; &gt; mapreduce.map/reduce.memory.mb of 1024 MB. Are any of your ta=
sks getting<br>
&gt;&gt; &gt; killed for running over resource limits?<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; -Sandy<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; On Thu, Jun 6, 2013 at 10:21 PM, sam liu &lt;<a href=3D"mailt=
o:samliuhadoop@gmail.com">samliuhadoop@gmail.com</a>&gt; wrote:<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; The terasort execution log shows that reduce spent about =
5.5 mins from<br>
&gt;&gt; &gt;&gt; 33%<br>
&gt;&gt; &gt;&gt; to 35% as below.<br>
&gt;&gt; &gt;&gt; 13/06/10 08:02:22 INFO mapreduce.Job: =A0map 100% reduce =
31%<br>
&gt;&gt; &gt;&gt; 13/06/10 08:02:25 INFO mapreduce.Job: =A0map 100% reduce =
32%<br>
&gt;&gt; &gt;&gt; 13/06/10 08:02:46 INFO mapreduce.Job: =A0map 100% reduce =
33%<br>
&gt;&gt; &gt;&gt; 13/06/10 08:08:16 INFO mapreduce.Job: =A0map 100% reduce =
35%<br>
&gt;&gt; &gt;&gt; 13/06/10 08:08:19 INFO mapreduce.Job: =A0map 100% reduce =
40%<br>
&gt;&gt; &gt;&gt; 13/06/10 08:08:22 INFO mapreduce.Job: =A0map 100% reduce =
43%<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; Any way, below are my configurations for your reference. =
Thanks!<br>
&gt;&gt; &gt;&gt; (A) core-site.xml<br>
&gt;&gt; &gt;&gt; only define &#39;<a href=3D"http://fs.default.name" targe=
t=3D"_blank">fs.default.name</a>&#39; and &#39;hadoop.tmp.dir&#39;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; (B) hdfs-site.xml<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;dfs.replication&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;1&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;dfs.name.dir&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;/opt/hadoop-2.0.4-alpha/temp/hadoop/=
dfs_name_dir&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;dfs.data.dir&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;/opt/hadoop-2.0.4-alpha/temp/hadoop/=
dfs_data_dir&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;dfs.block.size&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;134217728&lt;/value&gt;&lt;!-- 128MB=
 --&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;dfs.namenode.handler.count&lt;/name&g=
t;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;64&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;dfs.datanode.handler.count&lt;/name&g=
t;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;10&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; (C) mapred-site.xml<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;mapreduce.cluster.temp.dir&lt;/name&g=
t;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;/opt/hadoop-2.0.4-alpha/temp/hadoop/=
mapreduce_temp&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;No description&lt;/description=
&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;final&gt;true&lt;/final&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;mapreduce.cluster.local.dir&lt;/name&=
gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; &lt;value&gt;/opt/hadoop-2.0.4-alpha/temp/hadoop/mapreduc=
e_local_dir&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;No description&lt;/description=
&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;final&gt;true&lt;/final&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;name&gt;mapreduce.child.java.opts&lt;/name&gt;<br=
>
&gt;&gt; &gt;&gt; =A0 &lt;value&gt;-Xmx1000m&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;<a href=3D"http://mapreduce.framework=
.name" target=3D"_blank">mapreduce.framework.name</a>&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;yarn&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0&lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0&lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;mapreduce.tasktracker.map.tasks.maxim=
um&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;8&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;mapreduce.tasktracker.reduce.tasks.ma=
ximum&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;4&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;mapreduce.tasktracker.outofband.heart=
beat&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;true&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; (D) yarn-site.xml<br>
&gt;&gt; &gt;&gt; =A0&lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.resourcemanager.resource-tracker=
.address&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;node1:18025&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;host is the hostname of the re=
source manager and<br>
&gt;&gt; &gt;&gt; =A0 =A0 port is the port on which the NodeManagers contac=
t the Resource<br>
&gt;&gt; &gt;&gt; Manager.<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;/description&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;The address of the RM web appl=
ication.&lt;/description&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.resourcemanager.webapp.address&l=
t;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;node1:18088&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.resourcemanager.scheduler.addres=
s&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;node1:18030&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;host is the hostname of the re=
sourcemanager and port<br>
&gt;&gt; &gt;&gt; is<br>
&gt;&gt; &gt;&gt; the port<br>
&gt;&gt; &gt;&gt; =A0 =A0 on which the Applications in the cluster talk to =
the Resource<br>
&gt;&gt; &gt;&gt; Manager.<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;/description&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.resourcemanager.address&lt;/name=
&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;node1:18040&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;the host is the hostname of th=
e ResourceManager and<br>
&gt;&gt; &gt;&gt; the<br>
&gt;&gt; &gt;&gt; port is the port on<br>
&gt;&gt; &gt;&gt; =A0 =A0 which the clients can talk to the Resource Manage=
r. &lt;/description&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.nodemanager.local-dirs&lt;/name&=
gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; &lt;value&gt;/opt/hadoop-2.0.4-alpha/temp/hadoop/yarn_nm_=
local_dir&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;the local directories used by =
the<br>
&gt;&gt; &gt;&gt; nodemanager&lt;/description&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.nodemanager.address&lt;/name&gt;=
<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;<a href=3D"http://0.0.0.0:18050" tar=
get=3D"_blank">0.0.0.0:18050</a>&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;the nodemanagers bind to this =
port&lt;/description&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.nodemanager.resource.memory-mb&l=
t;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;10240&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;the amount of memory on the No=
deManager in<br>
&gt;&gt; &gt;&gt; GB&lt;/description&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.nodemanager.remote-app-log-dir&l=
t;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;/opt/hadoop-2.0.4-alpha/temp/hadoop/=
yarn_nm_app-logs&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;directory on hdfs where the ap=
plication logs are moved<br>
&gt;&gt; &gt;&gt; to<br>
&gt;&gt; &gt;&gt; &lt;/description&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0&lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.nodemanager.log-dirs&lt;/name&gt=
;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;/opt/hadoop-2.0.4-alpha/temp/hadoop/=
yarn_nm_log&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;the directories used by Nodema=
nagers as log<br>
&gt;&gt; &gt;&gt; directories&lt;/description&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.nodemanager.aux-services&lt;/nam=
e&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;mapreduce.shuffle&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;description&gt;shuffle service that needs to =
be set for Map Reduce to<br>
&gt;&gt; &gt;&gt; run &lt;/description&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.resourcemanager.client.thread-co=
unt&lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;64&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0&lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.nodemanager.resource.cpu-cores&l=
t;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;24&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; &lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.nodemanager.vcores-pcores-ratio&=
lt;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;3&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0&lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.nodemanager.resource.memory-mb&l=
t;/name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;22000&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; =A0&lt;property&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;name&gt;yarn.nodemanager.vmem-pmem-ratio&lt;/=
name&gt;<br>
&gt;&gt; &gt;&gt; =A0 =A0 &lt;value&gt;2.1&lt;/value&gt;<br>
&gt;&gt; &gt;&gt; =A0 &lt;/property&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; 2013/6/7 Harsh J &lt;<a href=3D"mailto:harsh@cloudera.com=
">harsh@cloudera.com</a>&gt;<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; Not tuning configurations at all is wrong. YARN uses =
memory resource<br>
&gt;&gt; &gt;&gt;&gt; based scheduling and hence MR2 would be requesting 1 =
GB minimum by<br>
&gt;&gt; &gt;&gt;&gt; default, causing, on base configs, to max out at 8 (d=
ue to 8 GB NM<br>
&gt;&gt; &gt;&gt;&gt; memory resource config) total containers. Do share yo=
ur configs as at<br>
&gt;&gt; &gt;&gt;&gt; this point none of us can tell what it is.<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; Obviously, it isn&#39;t our goal to make MR2 slower f=
or users and to not<br>
&gt;&gt; &gt;&gt;&gt; care about such things :)<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; On Fri, Jun 7, 2013 at 8:45 AM, sam liu &lt;<a href=
=3D"mailto:samliuhadoop@gmail.com">samliuhadoop@gmail.com</a>&gt;<br>
&gt;&gt; &gt;&gt;&gt; wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt; At the begining, I just want to do a fast compar=
ision of MRv1 and<br>
&gt;&gt; &gt;&gt;&gt; &gt; Yarn.<br>
&gt;&gt; &gt;&gt;&gt; &gt; But<br>
&gt;&gt; &gt;&gt;&gt; &gt; they have many differences, and to be fair for c=
omparison I did not<br>
&gt;&gt; &gt;&gt;&gt; &gt; tune<br>
&gt;&gt; &gt;&gt;&gt; &gt; their configurations at all. =A0So I got above t=
est results. After<br>
&gt;&gt; &gt;&gt;&gt; &gt; analyzing<br>
&gt;&gt; &gt;&gt;&gt; &gt; the test result, no doubt, I will configure them=
 and do comparison<br>
&gt;&gt; &gt;&gt;&gt; &gt; again.<br>
&gt;&gt; &gt;&gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt; Do you have any idea on current test result? I t=
hink, to compare<br>
&gt;&gt; &gt;&gt;&gt; &gt; with<br>
&gt;&gt; &gt;&gt;&gt; &gt; MRv1,<br>
&gt;&gt; &gt;&gt;&gt; &gt; Yarn is better on Map phase(teragen test), but w=
orse on Reduce<br>
&gt;&gt; &gt;&gt;&gt; &gt; phase(terasort test).<br>
&gt;&gt; &gt;&gt;&gt; &gt; And any detailed suggestions/comments/materials =
on Yarn performance<br>
&gt;&gt; &gt;&gt;&gt; &gt; tunning?<br>
&gt;&gt; &gt;&gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt; Thanks!<br>
&gt;&gt; &gt;&gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt; 2013/6/7 Marcos Luis Ortiz Valmaseda &lt;<a href=
=3D"mailto:marcosluis2186@gmail.com">marcosluis2186@gmail.com</a>&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; Why not to tune the configurations?<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; Both frameworks have many areas to tune:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; - Combiners, Shuffle optimization, Block siz=
e, etc<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; 2013/6/6 sam liu &lt;<a href=3D"mailto:samli=
uhadoop@gmail.com">samliuhadoop@gmail.com</a>&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; Hi Experts,<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; We are thinking about whether to use Yar=
n or not in the near<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; future,<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; and<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; I ran teragen/terasort on Yarn and MRv1 =
for comprison.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; My env is three nodes cluster, and each =
node has similar hardware:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; 2<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; cpu(4 core), 32 mem. Both Yarn and MRv1 =
cluster are set on the<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; same<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; env. To<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; be fair, I did not make any performance =
tuning on their<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; configurations, but<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; use the default configuration values.<br=
>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; Before testing, I think Yarn will be muc=
h better than MRv1, if<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; they<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; all<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; use default configuration, because Yarn =
is a better framework than<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; MRv1.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; However, the test result shows some diff=
erences:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; MRv1: Hadoop-1.1.1<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; Yarn: Hadoop-2.0.4<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; (A) Teragen: generate 10 GB data:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; - MRv1: 193 sec<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; - Yarn: 69 sec<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; Yarn is 2.8 times better than MRv1<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; (B) Terasort: sort 10 GB data:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; - MRv1: 451 sec<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; - Yarn: 1136 sec<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; Yarn is 2.5 times worse than MRv1<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; After a fast analysis, I think the direc=
t cause might be that Yarn<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; is<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; much faster than MRv1 on Map phase, but =
much worse on Reduce<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; phase.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; Here I have two questions:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; - Why my tests shows Yarn is worse than =
MRv1 for terasort?<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; - What&#39;s the stratage for tuning Yar=
n performance? Is any<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; materials?<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;&gt; Thanks!<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; --<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; Marcos Ortiz Valmaseda<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; Product Manager at PDVSA<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; <a href=3D"http://about.me/marcosortiz" targ=
et=3D"_blank">http://about.me/marcosortiz</a><br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; --<br>
&gt;&gt; &gt;&gt;&gt; Harsh J<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; --<br>
&gt;&gt; Harsh J<br>
&gt;<br>
&gt;<br>
<br>
<br>
<br>
</div></div><span class=3D""><font color=3D"#888888">--<br>
Harsh J<br>
</font></span></blockquote></div><br></div></div>

--089e0158c9da55be6504df69ec05--