Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of ojoshi@hortonworks.com
 designates 209.85.215.43 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <3129260.Jz0lzkYlD7@p854801>
References: 
 <5cd8d054fde44601ab5f0a709ac99f6e@BY2PR09MB062.namprd09.prod.outlook.com>
	<CABcwWripDVYcW8ykkVOqKJDJiNxFj6dV_nooeyEVJhY1dWVvew@mail.gmail.com>
	<3129260.Jz0lzkYlD7@p854801>
Date: Fri, 20 Sep 2013 12:20:38 -0700
Message-ID: 
 <CABcwWrgHks0=zu3mWciAkQkX1_YC0ZK_6gDZcSkasHUYiX-Oyw@mail.gmail.com>
Subject: Re: How to make hadoop use all nodes?
From: Omkar Joshi <ojoshi@hortonworks.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=047d7b3441d0d3b26b04e6d59071

--047d7b3441d0d3b26b04e6d59071
Content-Type: text/plain; charset=US-ASCII

Hi,

few more questions

(which has 40 containers slots.) >> for total cluster? Please give below
details

for cluster
1) yarn-site.xml -> what is the resource memory configured for per node?
2) yarn-site.xml -> what is the minimum resource allocation for the cluster?
3) yarn-resource-manager-log  (while starting resource manager "export
YARN_ROOT_LOGGER=DEBUG,RFA").. I am looking for debug logs..
4) On RM UI how much total cluster memory is reported (how many total
nodes). ( RM UI click on Cluster)
5) which scheduler you are using? Capacity/Fair/FIFO
6) have you configured any user limits/ queue capacity? (please add
details).
7) All requests you are making at same priority or with different
priorities? (Ideally it will not matter but want to know).

Please let us know all the above details. Thanks.


Thanks,
Omkar Joshi
*Hortonworks Inc.* <http://www.hortonworks.com>


On Fri, Sep 20, 2013 at 6:55 AM, Antoine Vandecreme <
antoine.vandecreme@nist.gov> wrote:

> Hello Omkar,
>
> Thanks for your reply.
>
> Yes, all 4 points are corrects.
> However, my application is requesting let say 100 containers on my cluster
> which has 40 containers slots.
> So I expected to see all containers slots used but that is not the case.
>
> Just in case it matters, it is the only application running on the server.
>
> Thanks,
> Antoine Vandecreme
>
> On Thursday, September 19, 2013 04:49:36 PM Omkar Joshi wrote:
> > Hi,
> >
> > Let me clarify few things.
> > 1) you are making container requests which are not explicitly looking for
> > certain nodes. (No white listing).
> > 2) All nodes are identical in terms of resources (memory/cores) and every
> > container requires same amount of resources.
> > 3) All nodes have capacity to run say 2 containers.
> > 4) You have 20 nodes.
> >
> > Now if an application is running and is requesting 20 containers then you
> > can not say that you will get all on different nodes (uniformly
> > distributed). It more depends on which node heartbeated to the Resource
> > manager at what time and how much memory is available with it and also
> how
> > many applications are present in queue and how much they are requesting
> at
> > what request priorities. If it has say sufficient memory to run 2
> > containers then they will get allocated (This allocation is quite complex
> > ..I am assuming very simple "*" reuqest). So you may see few running 2,
> few
> > running 1 where as few with 0 containers.
> >
> > I hope it clarifies your doubt.
> >
> > Thanks,
> > Omkar Joshi
> > *Hortonworks Inc.* <http://www.hortonworks.com>
> >
> >
> > On Thu, Sep 19, 2013 at 7:19 AM, Vandecreme, Antoine <
> >
> > antoine.vandecreme@nist.gov> wrote:
> > >  Hi all,
> > >
> > > I am working with Hadoop 2.0.5 (I plan to migrate to 2.1.0 soon).
> > > When I am starting a Job, I notice that some nodes are not used or
> > > partially used.
> > >
> > > For example, if my nodes can hold 2 containers, I notice that some
> nodes
> > > are not running any or just 1 while others are running 2.
> > > All my nodes are configured the same way.
> > >
> > > Is this an expected behavior (maybe in case others jobs are started) ?
> > > Is there a configuration to change this behavior?
> > >
> > > Thanks,
> > > Antoine
>

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.

--047d7b3441d0d3b26b04e6d59071
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi,<div><br></div><div>few more questions=A0</div><div><br=
></div><div>(<span style=3D"font-family:arial,sans-serif;font-size:13.33333=
3969116211px">which has 40 containers slots.) &gt;&gt; for total cluster? P=
lease give below details</span></div>
<div><span style=3D"font-family:arial,sans-serif;font-size:13.3333339691162=
11px"><br></span></div><div><span style=3D"font-family:arial,sans-serif;fon=
t-size:13.333333969116211px">for cluster</span></div><div><span style=3D"fo=
nt-family:arial,sans-serif;font-size:13.333333969116211px">1) yarn-site.xml=
 -&gt; what is the resource memory configured for per node?</span></div>
<div><span style=3D"font-family:arial,sans-serif;font-size:13.3333339691162=
11px">2) yarn-site.xml -&gt; what is the minimum resource allocation for th=
e cluster?</span></div><div><span style=3D"font-family:arial,sans-serif;fon=
t-size:13.333333969116211px">3) yarn-resource-manager-log =A0(while startin=
g resource manager &quot;export YARN_ROOT_LOGGER=3DDEBUG,RFA&quot;).. I am =
looking for debug logs..</span></div>
<div><span style=3D"font-family:arial,sans-serif;font-size:13.3333339691162=
11px">4) On RM UI how much total cluster memory is reported (how many total=
 nodes). ( RM UI click on Cluster)</span></div><div><span style=3D"font-fam=
ily:arial,sans-serif;font-size:13.333333969116211px">5) which scheduler you=
 are using? Capacity/Fair/FIFO</span></div>
<div><span style=3D"font-family:arial,sans-serif;font-size:13.3333339691162=
11px">6) have you configured any user limits/ queue capacity? (please add d=
etails).</span></div><div><span style=3D"font-family:arial,sans-serif;font-=
size:13.333333969116211px">7) All requests you are making at same priority =
or with different priorities? (Ideally it will not matter but want to know)=
.</span></div>
<div><span style=3D"font-family:arial,sans-serif;font-size:13.3333339691162=
11px"><br></span></div><div><span style=3D"font-family:arial,sans-serif;fon=
t-size:13.333333969116211px">Please let us know all the above details. Than=
ks.</span></div>
<div><span style=3D"font-family:arial,sans-serif;font-size:13.3333339691162=
11px"><br></span></div><div class=3D"gmail_extra"><br clear=3D"all"><div><d=
iv dir=3D"ltr"><font face=3D"courier new, monospace">Thanks,</font><div><fo=
nt face=3D"courier new, monospace">Omkar Joshi</font></div>
<div><font face=3D"courier new, monospace"><a href=3D"http://www.hortonwork=
s.com" target=3D"_blank"><b>Hortonworks Inc.</b></a></font></div></div></di=
v>
<br><br><div class=3D"gmail_quote">On Fri, Sep 20, 2013 at 6:55 AM, Antoine=
 Vandecreme <span dir=3D"ltr">&lt;<a href=3D"mailto:antoine.vandecreme@nist=
.gov" target=3D"_blank">antoine.vandecreme@nist.gov</a>&gt;</span> wrote:<b=
r><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:=
1px #ccc solid;padding-left:1ex">
Hello Omkar,<br>
<br>
Thanks for your reply.<br>
<br>
Yes, all 4 points are corrects.<br>
However, my application is requesting let say 100 containers on my cluster<=
br>
which has 40 containers slots.<br>
So I expected to see all containers slots used but that is not the case.<br=
>
<br>
Just in case it matters, it is the only application running on the server.<=
br>
<br>
Thanks,<br>
Antoine Vandecreme<br>
<div class=3D"im"><br>
On Thursday, September 19, 2013 04:49:36 PM Omkar Joshi wrote:<br>
&gt; Hi,<br>
&gt;<br>
&gt; Let me clarify few things.<br>
&gt; 1) you are making container requests which are not explicitly looking =
for<br>
&gt; certain nodes. (No white listing).<br>
&gt; 2) All nodes are identical in terms of resources (memory/cores) and ev=
ery<br>
&gt; container requires same amount of resources.<br>
&gt; 3) All nodes have capacity to run say 2 containers.<br>
&gt; 4) You have 20 nodes.<br>
&gt;<br>
&gt; Now if an application is running and is requesting 20 containers then =
you<br>
&gt; can not say that you will get all on different nodes (uniformly<br>
&gt; distributed). It more depends on which node heartbeated to the Resourc=
e<br>
&gt; manager at what time and how much memory is available with it and also=
 how<br>
&gt; many applications are present in queue and how much they are requestin=
g at<br>
&gt; what request priorities. If it has say sufficient memory to run 2<br>
&gt; containers then they will get allocated (This allocation is quite comp=
lex<br>
&gt; ..I am assuming very simple &quot;*&quot; reuqest). So you may see few=
 running 2, few<br>
&gt; running 1 where as few with 0 containers.<br>
&gt;<br>
&gt; I hope it clarifies your doubt.<br>
&gt;<br>
&gt; Thanks,<br>
&gt; Omkar Joshi<br>
</div>&gt; *Hortonworks Inc.* &lt;<a href=3D"http://www.hortonworks.com" ta=
rget=3D"_blank">http://www.hortonworks.com</a>&gt;<br>
<div class=3D"HOEnZb"><div class=3D"h5">&gt;<br>
&gt;<br>
&gt; On Thu, Sep 19, 2013 at 7:19 AM, Vandecreme, Antoine &lt;<br>
&gt;<br>
&gt; <a href=3D"mailto:antoine.vandecreme@nist.gov">antoine.vandecreme@nist=
.gov</a>&gt; wrote:<br>
&gt; &gt; =A0Hi all,<br>
&gt; &gt;<br>
&gt; &gt; I am working with Hadoop 2.0.5 (I plan to migrate to 2.1.0 soon).=
<br>
&gt; &gt; When I am starting a Job, I notice that some nodes are not used o=
r<br>
&gt; &gt; partially used.<br>
&gt; &gt;<br>
&gt; &gt; For example, if my nodes can hold 2 containers, I notice that som=
e nodes<br>
&gt; &gt; are not running any or just 1 while others are running 2.<br>
&gt; &gt; All my nodes are configured the same way.<br>
&gt; &gt;<br>
&gt; &gt; Is this an expected behavior (maybe in case others jobs are start=
ed) ?<br>
&gt; &gt; Is there a configuration to change this behavior?<br>
&gt; &gt;<br>
&gt; &gt; Thanks,<br>
&gt; &gt; Antoine<br>
</div></div></blockquote></div><br></div></div>

<br>
<span style=3D"color:rgb(128,128,128);font-family:Arial,sans-serif;font-siz=
e:10px">CONFIDENTIALITY NOTICE</span><br style=3D"color:rgb(128,128,128);fo=
nt-family:Arial,sans-serif;font-size:10px"><span style=3D"color:rgb(128,128=
,128);font-family:Arial,sans-serif;font-size:10px">NOTICE: This message is =
intended for the use of the individual or entity to which it is addressed a=
nd may contain information that is confidential, privileged and exempt from=
 disclosure under applicable law. If the reader of this message is not the =
intended recipient, you are hereby notified that any printing, copying, dis=
semination, distribution, disclosure or forwarding of this communication is=
 strictly prohibited. If you have received this communication in error, ple=
ase contact the sender immediately and delete it from your system. Thank Yo=
u.</span>
--047d7b3441d0d3b26b04e6d59071--