Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of acm@hortonworks.com
 designates 209.85.192.182 as permitted sender)
From: Arun C Murthy <acm@hortonworks.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_F213F5B6-3F17-4672-AF5F-CF2B955D7ADA"
Message-Id: <DF3300F1-416D-484A-849F-7FDCF4167330@hortonworks.com>
Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\))
Subject: Re: How Yarn execute MRv1 job?
Date: Wed, 19 Jun 2013 23:59:31 -0700
References: 
 <CAHH8OOc+nYr-aNSCLcQnNTb6tvAf3y0H+A3S8ho_9k-24BRZyA@mail.gmail.com>
 <06006DDA5A27D541991944AC4117E7A96E1C2135@szxeml560-mbx.china.huawei.com>
 <CAO7hTbNxVvxKC0VCwfZcnnPWemv+6M_bN=CY7rjx42Uo43ChLQ@mail.gmail.com>
 <493FAECB-D1F7-4FDA-9CDC-A81CA4AF0A14@hortonworks.com>
 <CAO7hTbNrvMd+D4=rwn3HGrNOvOWW2Tt+6fxw8akJ16rZN8QQzw@mail.gmail.com>
 <CAHH8OOdzHNcGwz-mSSF+ZabNORUBJArV6p41G+B9HNFVB6rT8Q@mail.gmail.com>
 <76B384CD-39E3-4F38-966B-FD26058A722B@hortonworks.com>
 <CAHH8OOdvA_u31aSdxQGa2xVjLWq=-KkRQFKYgWZwBms9TF_vrQ@mail.gmail.com>
 <CALr1C9rGFZX3mYNm=QxS7bBBHJ3urcdMnR0zeYwzeM3vB0U5VA@mail.gmail.com>
 <CAHH8OOf21eBFEZbTd0w+jEOvEbpJi_gDpB7buYoUyJDgvW8twQ@mail.gmail.com>
To: user@hadoop.apache.org
In-Reply-To: 
 <CAHH8OOf21eBFEZbTd0w+jEOvEbpJi_gDpB7buYoUyJDgvW8twQ@mail.gmail.com>


--Apple-Mail=_F213F5B6-3F17-4672-AF5F-CF2B955D7ADA
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=windows-1252

I'd use hive-0.11.

On Jun 19, 2013, at 11:56 PM, sam liu <samliuhadoop@gmail.com> wrote:

> Hi Azurry,
>=20
> So, older versions of HBase and Hive, like HBase 0.94.0 and Hive =
0.9.0, does not support hadoop 2.x, right?
>=20
> Thanks!
>=20
>=20
> 2013/6/20 Azuryy Yu <azuryyyu@gmail.com>
> Hi Sam,=20
> please look at :http://hbase.apache.org/book.html#d2617e499
>=20
> generally, we said YARN is Hadoop-2.x, you can download =
hadoop-2.0.4-alpha. and Hive-0.10 supports hadoop-2.x very well.
>=20
>=20
>=20
> On Thu, Jun 20, 2013 at 2:11 PM, sam liu <samliuhadoop@gmail.com> =
wrote:
> Thanks Arun!
>=20
> #1, Yes, I did tests and found that the MRv1 jobs could run against =
YARN directly, without recompiling
>=20
> #2, do you mean the old versions of HBase/Hive can not run agains =
YARN, and only some special versions of them can run against YARN? If =
yes, how can I get the versions for YARN?
>=20
>=20
> 2013/6/20 Arun C Murthy <acm@hortonworks.com>
>=20
> On Jun 19, 2013, at 6:45 PM, sam liu <samliuhadoop@gmail.com> wrote:
>=20
>> Appreciating for the detailed answers! Here are three further =
questions:
>>=20
>> - Yarn maintains backwards compatibility, and MRv1 job could run on =
Yarn. If yarn does not ask existing MRv1 job to do any code change, but =
why we should recompile the MRv1 job?
>=20
> You don't need to recompile MRv1 jobs to run against YARN.
>=20
>> - Which yarn jar files are required in the recompiling?
>> - In a cluster with Hadoop 1.1.1 and other Hadoop related =
components(HBase 0.94.3,  Hive 0.9.0, Zookeeper 3.4.5,...), if we want =
to replace Hadoop 1.1.1 with yarn, do we need to recompile all other =
Hadoop related components again with yarn jar files? Without any code =
change?
>=20
> You will need versions of HBase, Hive etc. which are integrated with =
hadoop-2.x, but not need to change any of your end-user applications (MR =
jobs, hive queries, pig scripts etc.)
>=20
> Arun
>=20
>>=20
>> Thanks in advance!
>>=20
>>=20
>>=20
>> 2013/6/19 Rahul Bhattacharjee <rahul.rec.dgp@gmail.com>
>> Thanks Arun and Devraj , good to know.
>>=20
>>=20
>>=20
>> On Wed, Jun 19, 2013 at 11:24 AM, Arun C Murthy <acm@hortonworks.com> =
wrote:
>> Not true, the CapacityScheduler has support for both CPU & Memory =
now.
>>=20
>> On Jun 18, 2013, at 10:41 PM, Rahul Bhattacharjee =
<rahul.rec.dgp@gmail.com> wrote:
>>=20
>>> Hi Devaraj,
>>>=20
>>> As for the container request request for yarn container , currently =
only memory is considered as resource , not cpu. Please correct.
>>>=20
>>> Thanks,
>>> Rahul
>>>=20
>>>=20
>>> On Wed, Jun 19, 2013 at 11:05 AM, Devaraj k <devaraj.k@huawei.com> =
wrote:
>>> Hi Sam,
>>>=20
>>>   Please find the answers for your queries.
>>>=20
>>>=20
>>> >- Yarn could run multiple kinds of jobs(MR, MPI, ...), but, MRv1 =
job has special execution process(map > shuffle > reduce) in Hadoop 1.x, =
and how Yarn execute a MRv1 job? still include some special MR steps in =
Hadoop 1.x, like map, sort, merge, combine and shuffle?
>>>=20
>>> =20
>>>=20
>>> In Yarn, it is a concept of application. MR Job is one kind of =
application which makes use of MRAppMaster(i.e ApplicationMaster for the =
application). If we want to run different kinds of applications we =
should have ApplicationMaster for each kind of application.
>>>=20
>>> =20
>>>=20
>>> >- Do the MRv1 parameters still work for Yarn? Like =
mapreduce.task.io.sort.mb and mapreduce.map.sort.spill.percent?
>>>=20
>>> These configurations still work for MR Job in Yarn.
>>>=20
>>>=20
>>> >- What's the general process for ApplicationMaster of Yarn to =
execute a job?
>>>=20
>>> MRAppMaster(Application Master for MR Job) does the Job life cycle =
which includes getting the containers for maps & reducers, launch the =
containers using NM, tacks the tasks status till completion, manage the =
failed tasks.
>>>=20
>>>=20
>>> >2. In Hadoop 1.x, we can set the map/reduce slots by setting =
'mapred.tasktracker.map.tasks.maximum' and =
'mapred.tasktracker.reduce.tasks.maximum'
>>> >- For Yarn, above tow parameter do not work any more, as yarn uses =
container instead, right?
>>>=20
>>> Correct, these params don=92t work in yarn. In Yarn it is completely =
based on the resources(memory, cpu). Application Master can request the =
RM for resources to complete the tasks for that application.
>>>=20
>>>=20
>>> >- For Yarn, we can set the whole physical mem for a NodeManager =
using 'yarn.nodemanager.resource.memory-mb'. But how to set the default =
size of physical mem of a container?
>>>=20
>>> ApplicationMaster is responsible for getting the containers from RM =
by sending the resource requests. For MR Job, you can use =
"mapreduce.map.memory.mb" and =93mapreduce.reduce.memory.mb" =
configurations for specifying the map & reduce container memory sizes.
>>>=20
>>> =20
>>>=20
>>> >- How to set the maximum size of physical mem of a container? By =
the parameter of 'mapred.child.java.opts'?
>>>=20
>>> It can be set based on the resources requested for that container.
>>>=20
>>> =20
>>>=20
>>> =20
>>>=20
>>> Thanks
>>>=20
>>> Devaraj K
>>>=20
>>> From: sam liu [mailto:samliuhadoop@gmail.com]=20
>>> Sent: 19 June 2013 08:16
>>> To: user@hadoop.apache.org
>>> Subject: How Yarn execute MRv1 job?
>>>=20
>>> =20
>>>=20
>>> Hi,
>>>=20
>>> 1.In Hadoop 1.x, a job will be executed by map task and reduce task =
together, with a typical process(map > shuffle > reduce). In Yarn, as I =
know, a MRv1 job will be executed only by ApplicationMaster.
>>> - Yarn could run multiple kinds of jobs(MR, MPI, ...), but, MRv1 job =
has special execution process(map > shuffle > reduce) in Hadoop 1.x, and =
how Yarn execute a MRv1 job? still include some special MR steps in =
Hadoop 1.x, like map, sort, merge, combine and shuffle?
>>> - Do the MRv1 parameters still work for Yarn? Like =
mapreduce.task.io.sort.mb and mapreduce.map.sort.spill.percent?
>>> - What's the general process for ApplicationMaster of Yarn to =
execute a job?
>>>=20
>>> 2. In Hadoop 1.x, we can set the map/reduce slots by setting =
'mapred.tasktracker.map.tasks.maximum' and =
'mapred.tasktracker.reduce.tasks.maximum'
>>> - For Yarn, above tow parameter do not work any more, as yarn uses =
container instead, right?
>>> - For Yarn, we can set the whole physical mem for a NodeManager =
using 'yarn.nodemanager.resource.memory-mb'. But how to set the default =
size of physical mem of a container?
>>> - How to set the maximum size of physical mem of a container? By the =
parameter of 'mapred.child.java.opts'?
>>>=20
>>> Thanks!
>>>=20
>>>=20
>>=20
>> --
>> Arun C. Murthy
>> Hortonworks Inc.
>> http://hortonworks.com/
>>=20
>>=20
>>=20
>>=20
>=20
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
>=20
>=20
>=20
>=20
>=20

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/


--Apple-Mail=_F213F5B6-3F17-4672-AF5F-CF2B955D7ADA
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=windows-1252

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dwindows-1252"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">I'd =
use hive-0.11.<div><br><div><div>On Jun 19, 2013, at 11:56 PM, sam liu =
&lt;<a =
href=3D"mailto:samliuhadoop@gmail.com">samliuhadoop@gmail.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite"><div dir=3D"ltr"><div>Hi Azurry,<br><br>So, older versions =
of HBase and Hive, like HBase 0.94.0 and Hive 0.9.0, does not support =
hadoop 2.x, right?<br><br></div>Thanks!<br></div><div =
class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">
2013/6/20 Azuryy Yu <span dir=3D"ltr">&lt;<a =
href=3D"mailto:azuryyyu@gmail.com" =
target=3D"_blank">azuryyyu@gmail.com</a>&gt;</span><br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">
<div dir=3D"ltr"><div><div>Hi Sam, <br></div>please look at :<a =
href=3D"http://hbase.apache.org/book.html#d2617e499" =
target=3D"_blank">http://hbase.apache.org/book.html#d2617e499</a><br><br><=
/div>generally, we said YARN is Hadoop-2.x, you can download =
hadoop-2.0.4-alpha. and Hive-0.10 supports hadoop-2.x very well.<br>

<br></div><div class=3D"HOEnZb"><div class=3D"h5"><div =
class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Thu, Jun 20, =
2013 at 2:11 PM, sam liu <span dir=3D"ltr">&lt;<a =
href=3D"mailto:samliuhadoop@gmail.com" =
target=3D"_blank">samliuhadoop@gmail.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div =
dir=3D"ltr"><div><div>Thanks Arun!<br><br></div>#1, Yes, I did tests and =
found that the MRv1 jobs could run against YARN directly, without =
recompiling<br>

<br></div>#2, do you mean the old versions of HBase/Hive can not run =
agains YARN, and only some special versions of them can run against =
YARN? If yes, how can I get the versions for YARN?<br>
</div><div><div class=3D"gmail_extra"><br><br><div =
class=3D"gmail_quote">2013/6/20 Arun C Murthy <span dir=3D"ltr">&lt;<a =
href=3D"mailto:acm@hortonworks.com" =
target=3D"_blank">acm@hortonworks.com</a>&gt;</span><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">
<div style=3D"word-wrap:break-word"><br><div><div><div>On Jun 19, 2013, =
at 6:45 PM, sam liu &lt;<a href=3D"mailto:samliuhadoop@gmail.com" =
target=3D"_blank">samliuhadoop@gmail.com</a>&gt; =
wrote:</div><br><blockquote type=3D"cite">


<div dir=3D"ltr"><div>Appreciating for the detailed answers! Here are =
three further questions:<br><br></div>- Yarn maintains backwards =
compatibility, and MRv1 job could run on Yarn. If yarn does not ask =
existing MRv1 job to do any code change, but why we should recompile the =
MRv1 job?<br>


</div></blockquote><div><br></div></div>You don't need to recompile MRv1 =
jobs to run against YARN.</div><div><div><br><blockquote =
type=3D"cite"><div dir=3D"ltr"><div><div>
</div>- Which yarn jar files are required in the recompiling?<br></div>- =
In a cluster with Hadoop 1.1.1 and other Hadoop related components(HBase =
0.94.3,&nbsp; Hive 0.9.0, Zookeeper 3.4.5,...), if we want to replace =
Hadoop 1.1.1 with yarn, do we need to recompile all other Hadoop related =
components again with yarn jar files? Without any code change?<br>


</div></blockquote><div><br></div></div>You will need versions of HBase, =
Hive etc. which are integrated with hadoop-2.x, but not need to change =
any of your end-user applications (MR jobs, hive queries, pig scripts =
etc.)</div>


<span><font =
color=3D"#888888"><div><br></div><div>Arun</div></font></span><div><div><b=
r><blockquote type=3D"cite"><div dir=3D"ltr"><div>
<br></div>Thanks in advance!<br><div><br></div></div><div =
class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">2013/6/19 Rahul =
Bhattacharjee <span dir=3D"ltr">&lt;<a =
href=3D"mailto:rahul.rec.dgp@gmail.com" =
target=3D"_blank">rahul.rec.dgp@gmail.com</a>&gt;</span><br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div =
class=3D"gmail_default" style=3D"font-family:georgia,serif">Thanks Arun =
and Devraj , good to know.<br>


<br>
</div></div><div><div class=3D"gmail_extra"><br><br><div =
class=3D"gmail_quote">On Wed, Jun 19, 2013 at 11:24 AM, Arun C Murthy =
<span dir=3D"ltr">&lt;<a href=3D"mailto:acm@hortonworks.com" =
target=3D"_blank">acm@hortonworks.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div =
style=3D"word-wrap:break-word">Not true, the CapacityScheduler has =
support for both CPU &amp; Memory now.<div><div>


<div>
<br><div><div>On Jun 18, 2013, at 10:41 PM, Rahul Bhattacharjee &lt;<a =
href=3D"mailto:rahul.rec.dgp@gmail.com" =
target=3D"_blank">rahul.rec.dgp@gmail.com</a>&gt; =
wrote:</div><br><blockquote type=3D"cite"><div dir=3D"ltr"><div =
class=3D"gmail_default" style=3D"font-family:georgia,serif">


Hi Devaraj,<br><br></div><div class=3D"gmail_default" =
style=3D"font-family:georgia,serif">As for the container request request =
for yarn container , currently only memory is considered as resource , =
not cpu. Please correct.<br>


<br>Thanks,<br>Rahul<br></div></div><div =
class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed, Jun 19, =
2013 at 11:05 AM, Devaraj k <span dir=3D"ltr">&lt;<a =
href=3D"mailto:devaraj.k@huawei.com" =
target=3D"_blank">devaraj.k@huawei.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">


<div link=3D"blue" vlink=3D"purple" lang=3D"EN-US">
<div><p class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d">Hi Sam,<u></u><u></u></span></p><p =
class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d">&nbsp; Please find the answers for your =
queries.</span>
<u></u><u></u></p><div><p class=3D"MsoNormal"><br>
&gt;- Yarn could run multiple kinds of jobs(MR, MPI, ...), but, MRv1 job =
has special execution process(map &gt; shuffle &gt; reduce) in Hadoop =
1.x, and how Yarn execute a MRv1 job? still include some special MR =
steps in Hadoop 1.x, like map, sort, merge, combine and
 shuffle?<u></u><u></u></p><p class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d"><u></u>&nbsp;<u></u></span></p>
</div><p class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d">In Yarn, it is a concept of application. MR Job is =
one kind of application which makes use of MRAppMaster(i.e =
ApplicationMaster for the application). If we
 want to run different kinds of applications we should have =
ApplicationMaster for each kind of =
application.<u></u><u></u></span></p><div><p =
class=3D"MsoNormal"><u></u>&nbsp;<u></u></p><p class=3D"MsoNormal">&gt;- =
Do the MRv1 parameters still work for Yarn? Like =
mapreduce.task.io.sort.mb and =
mapreduce.map.sort.spill.percent?<u></u><u></u></p>


</div><p class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d">These configurations still work for MR Job in =
Yarn.<u></u><u></u></span></p><div><p class=3D"MsoNormal">


<br>
&gt;- What's the general process for ApplicationMaster of Yarn to =
execute a job?<u></u><u></u></p>
</div><p class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d">MRAppMaster(Application Master for MR Job) does =
the Job life cycle which includes getting the containers for maps &amp; =
reducers, launch the containers using NM,
 tacks the tasks status till completion, manage the failed =
tasks.<u></u><u></u></span></p><div><p class=3D"MsoNormal"><br>
&gt;2. In Hadoop 1.x, we can set the map/reduce slots by setting =
'mapred.tasktracker.map.tasks.maximum' and =
'mapred.tasktracker.reduce.tasks.maximum'<br>
&gt;- For Yarn, above tow parameter do not work any more, as yarn uses =
container instead, right?<u></u><u></u></p>
</div><p class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d">Correct, these params don=92t work in yarn. In =
Yarn it is completely based on the resources(memory, cpu). Application =
Master can request the RM for resources
 to complete the tasks for that =
application.<u></u><u></u></span></p><div><p class=3D"MsoNormal"><br>
&gt;- For Yarn, we can set the whole physical mem for a NodeManager =
using 'yarn.nodemanager.resource.memory-mb'. But how to set the default =
size of physical mem of a container?<u></u><u></u></p>
</div><p class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d">ApplicationMaster is responsible for getting the =
containers from RM by sending the resource requests. For MR Job, you can =
use "mapreduce.map.memory.mb" and
 =93mapreduce.reduce.memory.mb" configurations for specifying the map =
&amp; reduce container memory sizes.<u></u><u></u></span></p><div><p =
class=3D"MsoNormal"><u></u>&nbsp;<u></u></p><p class=3D"MsoNormal">&gt;- =
How to set the maximum size of physical mem of a container? By the =
parameter of 'mapred.child.java.opts'?<span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d"><u></u><u></u></span></p>


</div><p class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d">It can be set based on the resources requested for =
that container.<u></u><u></u></span></p><p class=3D"MsoNormal">


<span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d"><u></u>&nbsp;<u></u></span></p><p =
class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d"><u></u>&nbsp;<u></u></span></p><p =
class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d">Thanks<u></u><u></u></span></p><p =
class=3D"MsoNormal"><span =
style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif=
&quot;;color:#1f497d">Devaraj K<u></u><u></u></span></p>


<div style=3D"border:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt =
0cm 0cm 0cm"><p class=3D"MsoNormal"><b><span =
style=3D"font-size:10.0pt;font-family:&quot;Tahoma&quot;,&quot;sans-serif&=
quot;">From:</span></b><span =
style=3D"font-size:10.0pt;font-family:&quot;Tahoma&quot;,&quot;sans-serif&=
quot;"> sam liu [mailto:<a href=3D"mailto:samliuhadoop@gmail.com" =
target=3D"_blank">samliuhadoop@gmail.com</a>]
<br>
<b>Sent:</b> 19 June 2013 08:16<br>
<b>To:</b> <a href=3D"mailto:user@hadoop.apache.org" =
target=3D"_blank">user@hadoop.apache.org</a><br>
<b>Subject:</b> How Yarn execute MRv1 job?<u></u><u></u></span></p>
</div><div><p class=3D"MsoNormal"><u></u>&nbsp;<u></u></p>
<div>
<div><p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Hi,<br>
<br>
1.In Hadoop 1.x, a job will be executed by map task and reduce task =
together, with a typical process(map &gt; shuffle &gt; reduce). In Yarn, =
as I know, a MRv1 job will be executed only by ApplicationMaster.<br>
- Yarn could run multiple kinds of jobs(MR, MPI, ...), but, MRv1 job has =
special execution process(map &gt; shuffle &gt; reduce) in Hadoop 1.x, =
and how Yarn execute a MRv1 job? still include some special MR steps in =
Hadoop 1.x, like map, sort, merge, combine and
 shuffle?<br>
- Do the MRv1 parameters still work for Yarn? Like =
mapreduce.task.io.sort.mb and mapreduce.map.sort.spill.percent?<br>
- What's the general process for ApplicationMaster of Yarn to execute a =
job?<br>
<br>
2. In Hadoop 1.x, we can set the map/reduce slots by setting =
'mapred.tasktracker.map.tasks.maximum' and =
'mapred.tasktracker.reduce.tasks.maximum'<br>
- For Yarn, above tow parameter do not work any more, as yarn uses =
container instead, right?<br>
- For Yarn, we can set the whole physical mem for a NodeManager using =
'yarn.nodemanager.resource.memory-mb'. But how to set the default size =
of physical mem of a container?<br>
- How to set the maximum size of physical mem of a container? By the =
parameter of 'mapred.child.java.opts'?<u></u><u></u></p>
</div><p class=3D"MsoNormal">Thanks!<u></u><u></u></p>
</div>
</div></div>
</div>

</blockquote></div><br></div>
</blockquote></div><br></div></div><div>
<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;b=
order-collapse:separate;text-transform:none;font-size:medium;white-space:n=
ormal;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">


<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;b=
order-collapse:separate;text-transform:none;font-size:medium;white-space:n=
ormal;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">


--</div><div style=3D"word-wrap:break-word">Arun C. Murthy</div><div =
style=3D"word-wrap:break-word">Hortonworks Inc.<br><a =
href=3D"http://hortonworks.com/" =
target=3D"_blank">http://hortonworks.com/</a><br><br></div></span></div>


</span>
</div>
<br></div></div></blockquote></div><br></div>
</div></blockquote></div><br></div>
</blockquote></div><br><div>
<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;b=
order-collapse:separate;text-transform:none;font-size:medium;white-space:n=
ormal;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">


<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;b=
order-collapse:separate;text-transform:none;font-size:medium;white-space:n=
ormal;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">


--</div><div style=3D"word-wrap:break-word">Arun C. Murthy</div><div =
style=3D"word-wrap:break-word">Hortonworks Inc.<br><a =
href=3D"http://hortonworks.com/" =
target=3D"_blank">http://hortonworks.com/</a><br><br></div></span></div>

</span>
</div>
<br></div></div></blockquote></div><br></div>
</div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</blockquote></div><br><div apple-content-edited=3D"true">
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: =
0px; text-transform: none; white-space: normal; widows: 2; word-spacing: =
0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-align: -webkit-auto; text-indent: 0px; text-transform: none; =
white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; ">--</div><div style=3D"word-wrap: =
break-word; -webkit-nbsp-mode: space; -webkit-line-break: =
after-white-space; ">Arun C. Murthy</div><div style=3D"word-wrap: =
break-word; -webkit-nbsp-mode: space; -webkit-line-break: =
after-white-space; ">Hortonworks Inc.<br><a =
href=3D"http://hortonworks.com/">http://hortonworks.com/</a><br><br></div>=
</span></div></span></span>
</div>
<br></div></body></html>=

--Apple-Mail=_F213F5B6-3F17-4672-AF5F-CF2B955D7ADA--