Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of ashjain2@gmail.com designates
 74.125.82.54 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <006001cf0d46$575df180$0619d480$@samsung.com>
References: 
 <CABGiY4CRHuFpuChmFTfEs2fTq8RSuHjXykm3Xs-s33CZV=ozZQ@mail.gmail.com>
	<CABGiY4DWZBxoneMTvS_KnB_NowUjU_heq28EC=PZTy0OtYsYDg@mail.gmail.com>
	<CAEpEg_DqbpGkQNsVvgnXVOyctfD0ZNfBdep=MoaHEL1hwfz0oQ@mail.gmail.com>
	<CABGiY4BGZxJ3jmOHgbyzFpsERBxeYyrm9Q-u7CL4M3avpn0OUQ@mail.gmail.com>
	<CABGiY4AQHtfk2gQ78OGHFnUeeXUaR1QK7nzePsO0ovHY-y7U2g@mail.gmail.com>
	<CAEpEg_Dgbj=XKTSA3xh5O3ezFxPU4MdTLg8sHh_Bwyvq+nT_ig@mail.gmail.com>
	<CABGiY4B4=R+7q0JE0xR1BQWxSooY2Mzd65JwJfwYrS1ONz2qtA@mail.gmail.com>
	<CABGiY4CaNqFjETjHGUt9Ab6a2_acf6xV2ag=osN-dGwGYX4hkA@mail.gmail.com>
	<006001cf0d46$575df180$0619d480$@samsung.com>
Date: Fri, 10 Jan 2014 12:58:15 +0530
Message-ID: 
 <CABGiY4CioKzi9eqgo3OtbF69cuqWWdxNV29Zeg_Ckuh9wQWXxA@mail.gmail.com>
Subject: Re: Distributing the code to multiple nodes
From: Ashish Jain <ashjain2@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=f46d04138cdf55f35b04ef98ab46

--f46d04138cdf55f35b04ef98ab46
Content-Type: text/plain; charset=ISO-8859-1

Thanks for all these suggestions. Somehow I do not have access to the
servers today and will try the suggestions made on monday and will let you
know how it goes.

--Ashish


On Thu, Jan 9, 2014 at 7:53 PM, German Florez-Larrahondo <
german.fl@samsung.com> wrote:

> Ashish
>
> Could this be related to the scheduler you are using and its settings?.
>
>
>
> On lab environments when running a single type of job I often use
> FairScheduler (the YARN default in 2.2.0 is CapacityScheduler) and it does
> a good job distributing the load.
>
>
>
> You could give that a try (
> https://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/FairScheduler.html
> )
>
>
>
> I think just changing yarn-site.xml  as follows could demonstrate this
> theory (note that  how the jobs are scheduled depend on resources such as
> memory on the nodes and you would need to setup yarn-site.xml accordingly).
>
>
>
> <property>
>
>   <name>yarn.resourcemanager.scheduler.class</name>
>
>
> <value>org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler</value>
>
> </property>
>
>
>
> Regards
>
> ./g
>
>
>
>
>
> *From:* Ashish Jain [mailto:ashjain2@gmail.com]
> *Sent:* Thursday, January 09, 2014 6:46 AM
> *To:* user@hadoop.apache.org
> *Subject:* Re: Distributing the code to multiple nodes
>
>
>
> Another point to add here 10.12.11.210 is the host which has everything
> running including a slave datanode. Data was also distributed this host as
> well as the jar file. Following are running on 10.12.11.210
>
> 7966 DataNode
> 8480 NodeManager
> 8353 ResourceManager
> 8141 SecondaryNameNode
> 7834 NameNode
>
>
>
> On Thu, Jan 9, 2014 at 6:12 PM, Ashish Jain <ashjain2@gmail.com> wrote:
>
> Logs were updated only when I copied the data. After copying the data
> there has been no updates on the log files.
>
>
>
> On Thu, Jan 9, 2014 at 5:08 PM, Chris Mawata <chris.mawata@gmail.com>
> wrote:
>
> Do the logs on the three nodes contain anything interesting?
> Chris
>
> On Jan 9, 2014 3:47 AM, "Ashish Jain" <ashjain2@gmail.com> wrote:
>
> Here is the block info for the record I distributed. As can be seen only
> 10.12.11.210 has all the data and this is the node which is serving all the
> request. Replicas are available with 209 as well as 210
>
> 1073741857:         10.12.11.210:50010    View Block Info
> 10.12.11.209:50010    View Block Info
> 1073741858:         10.12.11.210:50010    View Block Info
> 10.12.11.211:50010    View Block Info
> 1073741859:         10.12.11.210:50010    View Block Info
> 10.12.11.209:50010    View Block Info
> 1073741860:         10.12.11.210:50010    View Block Info
> 10.12.11.211:50010    View Block Info
> 1073741861:         10.12.11.210:50010    View Block Info
> 10.12.11.209:50010    View Block Info
> 1073741862:         10.12.11.210:50010    View Block Info
> 10.12.11.209:50010    View Block Info
> 1073741863:         10.12.11.210:50010    View Block Info
> 10.12.11.209:50010    View Block Info
> 1073741864:         10.12.11.210:50010    View Block Info
> 10.12.11.209:50010    View Block Info
>
> --Ashish
>
>
>
> On Thu, Jan 9, 2014 at 2:11 PM, Ashish Jain <ashjain2@gmail.com> wrote:
>
> Hello Chris,
>
> I have now a cluster with 3 nodes and replication factor being 2. When I
> distribute a file I could see that there are replica of data available in
> other nodes. However when I run a map reduce job again only one node is
> serving all the request :(. Can you or anyone please provide some more
> inputs.
>
> Thanks
> Ashish
>
>
>
> On Wed, Jan 8, 2014 at 7:16 PM, Chris Mawata <chris.mawata@gmail.com>
> wrote:
>
> 2 nodes and replication factor of 2 results in a replica of each block
> present on each node. This would allow the possibility that a single node
> would do the work and yet be data local.  It will probably happen if that
> single node has the needed capacity.  More nodes than the replication
> factor are needed to force distribution of the processing.
> Chris
>
> On Jan 8, 2014 7:35 AM, "Ashish Jain" <ashjain2@gmail.com> wrote:
>
> Guys,
>
> I am sure that only one node is being used. I just know ran the job again
> and could see that CPU usage only for one server going high other server
> CPU usage remains constant and hence it means other node is not being used.
> Can someone help me to debug this issue?
>
> ++Ashish
>
>
>
> On Wed, Jan 8, 2014 at 5:04 PM, Ashish Jain <ashjain2@gmail.com> wrote:
>
> Hello All,
>
> I have a 2 node hadoop cluster running with a replication factor of 2. I
> have a file of size around 1 GB which when copied to HDFS is replicated to
> both the nodes. Seeing the block info I can see the file has been
> subdivided into 8 parts which means it has been subdivided into 8 blocks
> each of size 128 MB.  I use this file as input to run the word count
> program. Some how I feel only one node is doing all the work and the code
> is not distributed to other node. How can I make sure code is distributed
> to both the nodes? Also is there a log or GUI which can be used for this?
>
> Please note I am using the latest stable release that is 2.2.0.
>
> ++Ashish
>
>
>
>
>
>
>
>
>
>
>

--f46d04138cdf55f35b04ef98ab46
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>Thanks for all these suggestions. Somehow I do not ha=
ve access to the servers today and will try the suggestions made on monday =
and will let you know how it goes.<br><br></div>--Ashish<br></div><div clas=
s=3D"gmail_extra">
<br><br><div class=3D"gmail_quote">On Thu, Jan 9, 2014 at 7:53 PM, German F=
lorez-Larrahondo <span dir=3D"ltr">&lt;<a href=3D"mailto:german.fl@samsung.=
com" target=3D"_blank">german.fl@samsung.com</a>&gt;</span> wrote:<br><bloc=
kquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #cc=
c solid;padding-left:1ex">
<div link=3D"blue" vlink=3D"purple" lang=3D"EN-US"><div><p class=3D"MsoNorm=
al">Ashish<u></u><u></u></p><p class=3D"MsoNormal">Could this be related to=
 the scheduler you are using and its settings?.<u></u><u></u></p><p class=
=3D"MsoNormal">
<u></u>=A0<u></u></p><p class=3D"MsoNormal">On lab environments when runnin=
g a single type of job I often use FairScheduler (the YARN default in 2.2.0=
 is CapacityScheduler) and it does a good job distributing the load.<u></u>=
<u></u></p>
<p class=3D"MsoNormal"><u></u>=A0<u></u></p><p class=3D"MsoNormal">You coul=
d give that a try (<a href=3D"https://hadoop.apache.org/docs/current/hadoop=
-yarn/hadoop-yarn-site/FairScheduler.html" target=3D"_blank">https://hadoop=
.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/FairScheduler.html</a=
><span style=3D"color:#1f497d">)</span><u></u><u></u></p>
<p class=3D"MsoNormal"><u></u>=A0<u></u></p><p class=3D"MsoNormal">I think =
just changing yarn-site.xml =A0as follows could demonstrate this theory (no=
te that =A0how the jobs are scheduled depend on resources such as memory on=
 the nodes and you would need to setup yarn-site.xml accordingly). <u></u><=
u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:9.0pt;font-family:&quot;Ver=
dana&quot;,&quot;sans-serif&quot;"><u></u>=A0<u></u></span></p><p class=3D"=
MsoNormal" style=3D"background:white"><span style=3D"font-size:10.0pt;font-=
family:&quot;Courier New&quot;">&lt;property&gt;<u></u><u></u></span></p>
<p class=3D"MsoNormal" style=3D"background:white"><span style=3D"font-size:=
10.0pt;font-family:&quot;Courier New&quot;">=A0 &lt;name&gt;yarn.resourcema=
nager.scheduler.class&lt;/name&gt;<u></u><u></u></span></p><p class=3D"MsoN=
ormal" style=3D"background:white">
<span style=3D"font-size:10.0pt;font-family:&quot;Courier New&quot;">=A0 &l=
t;value&gt;org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.Fai=
rScheduler&lt;/value&gt;<u></u><u></u></span></p><p class=3D"MsoNormal" sty=
le=3D"background:white">
<span style=3D"font-size:10.0pt;font-family:&quot;Courier New&quot;">&lt;/p=
roperty&gt;<u></u><u></u></span></p><p class=3D"MsoNormal"><u></u>=A0<u></u=
></p><p class=3D"MsoNormal">Regards<u></u><u></u></p><p class=3D"MsoNormal"=
>./g<u></u><u></u></p>
<p class=3D"MsoNormal"><u></u>=A0<u></u></p><p class=3D"MsoNormal"><span st=
yle=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif&qu=
ot;;color:#1f497d"><u></u>=A0<u></u></span></p><p class=3D"MsoNormal"><b><s=
pan style=3D"font-size:10.0pt;font-family:&quot;Tahoma&quot;,&quot;sans-ser=
if&quot;">From:</span></b><span style=3D"font-size:10.0pt;font-family:&quot=
;Tahoma&quot;,&quot;sans-serif&quot;"> Ashish Jain [mailto:<a href=3D"mailt=
o:ashjain2@gmail.com" target=3D"_blank">ashjain2@gmail.com</a>] <br>
<b>Sent:</b> Thursday, January 09, 2014 6:46 AM<br><b>To:</b> <a href=3D"ma=
ilto:user@hadoop.apache.org" target=3D"_blank">user@hadoop.apache.org</a><b=
r><b>Subject:</b> Re: Distributing the code to multiple nodes<u></u><u></u>=
</span></p>
<div><div class=3D"h5"><p class=3D"MsoNormal"><u></u>=A0<u></u></p><div><p =
class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Another point to add her=
e 10.12.11.210 is the host which has everything running including a slave d=
atanode. Data was also distributed this host as well as the jar file. Follo=
wing are running on 10.12.11.210<br>
<br>7966 DataNode<br>8480 NodeManager<br>8353 ResourceManager<br>8141 Secon=
daryNameNode<br>7834 NameNode<u></u><u></u></p></div><div><p class=3D"MsoNo=
rmal" style=3D"margin-bottom:12.0pt"><u></u>=A0<u></u></p><div><p class=3D"=
MsoNormal">
On Thu, Jan 9, 2014 at 6:12 PM, Ashish Jain &lt;<a href=3D"mailto:ashjain2@=
gmail.com" target=3D"_blank">ashjain2@gmail.com</a>&gt; wrote:<u></u><u></u=
></p><div><p class=3D"MsoNormal">Logs were updated only when I copied the d=
ata. After copying the data there has been no updates on the log files.<u><=
/u><u></u></p>
</div><div><div><div><p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">=
<u></u>=A0<u></u></p><div><p class=3D"MsoNormal">On Thu, Jan 9, 2014 at 5:0=
8 PM, Chris Mawata &lt;<a href=3D"mailto:chris.mawata@gmail.com" target=3D"=
_blank">chris.mawata@gmail.com</a>&gt; wrote:<u></u><u></u></p>
<p>Do the logs on the three nodes contain anything interesting?<span style=
=3D"color:#888888"><br>Chris</span><u></u><u></u></p><div><div><div><p clas=
s=3D"MsoNormal">On Jan 9, 2014 3:47 AM, &quot;Ashish Jain&quot; &lt;<a href=
=3D"mailto:ashjain2@gmail.com" target=3D"_blank">ashjain2@gmail.com</a>&gt;=
 wrote:<u></u><u></u></p>
<div><div><p class=3D"MsoNormal">Here is the block info for the record I di=
stributed. As can be seen only 10.12.11.210 has all the data and this is th=
e node which is serving all the request. Replicas are available with 209 as=
 well as 210<br>
<br>1073741857:=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12.11.210:50010"=
 target=3D"_blank">10.12.11.210:50010</a>=A0=A0=A0 View Block Info=A0=A0=A0=
 =A0=A0=A0=A0 <a href=3D"http://10.12.11.209:50010" target=3D"_blank">10.12=
.11.209:50010</a>=A0=A0=A0 View Block Info<br>
1073741858:=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12.11.210:50010" tar=
get=3D"_blank">10.12.11.210:50010</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=
=A0=A0=A0 <a href=3D"http://10.12.11.211:50010" target=3D"_blank">10.12.11.=
211:50010</a>=A0=A0=A0 View Block Info<br>1073741859:=A0=A0=A0 =A0=A0=A0=A0=
 <a href=3D"http://10.12.11.210:50010" target=3D"_blank">10.12.11.210:50010=
</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12=
.11.209:50010" target=3D"_blank">10.12.11.209:50010</a>=A0=A0=A0 View Block=
 Info<br>
1073741860:=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12.11.210:50010" tar=
get=3D"_blank">10.12.11.210:50010</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=
=A0=A0=A0 <a href=3D"http://10.12.11.211:50010" target=3D"_blank">10.12.11.=
211:50010</a>=A0=A0=A0 View Block Info<br>1073741861:=A0=A0=A0 =A0=A0=A0=A0=
 <a href=3D"http://10.12.11.210:50010" target=3D"_blank">10.12.11.210:50010=
</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12=
.11.209:50010" target=3D"_blank">10.12.11.209:50010</a>=A0=A0=A0 View Block=
 Info<br>
1073741862:=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12.11.210:50010" tar=
get=3D"_blank">10.12.11.210:50010</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=
=A0=A0=A0 <a href=3D"http://10.12.11.209:50010" target=3D"_blank">10.12.11.=
209:50010</a>=A0=A0=A0 View Block Info<br>1073741863:=A0=A0=A0 =A0=A0=A0=A0=
 <a href=3D"http://10.12.11.210:50010" target=3D"_blank">10.12.11.210:50010=
</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12=
.11.209:50010" target=3D"_blank">10.12.11.209:50010</a>=A0=A0=A0 View Block=
 Info<br>
1073741864:=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12.11.210:50010" tar=
get=3D"_blank">10.12.11.210:50010</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=
=A0=A0=A0 <a href=3D"http://10.12.11.209:50010" target=3D"_blank">10.12.11.=
209:50010</a>=A0=A0=A0 View Block Info<u></u><u></u></p>
<table border=3D"0" cellpadding=3D"0"><tbody><tr><td style=3D"padding:.75pt=
 .75pt .75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td=
><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"padding:.7=
5pt .75pt .75pt .75pt">
</td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"paddin=
g:.75pt .75pt .75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75p=
t"></td></tr><tr><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td sty=
le=3D"padding:.75pt .75pt .75pt .75pt">
</td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"paddin=
g:.75pt .75pt .75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75p=
t"></td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"pad=
ding:.75pt .75pt .75pt .75pt">
</td></tr><tr><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=
=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"padding:.75pt .75pt =
.75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td st=
yle=3D"padding:.75pt .75pt .75pt .75pt">
</td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"paddin=
g:.75pt .75pt .75pt .75pt"></td></tr><tr><td style=3D"padding:.75pt .75pt .=
75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td sty=
le=3D"padding:.75pt .75pt .75pt .75pt">
</td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"paddin=
g:.75pt .75pt .75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75p=
t"></td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td></tr><tr><td sty=
le=3D"padding:.75pt .75pt .75pt .75pt">
</td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"paddin=
g:.75pt .75pt .75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75p=
t"></td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"pad=
ding:.75pt .75pt .75pt .75pt">
</td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td></tr><tr><td style=
=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"padding:.75pt .75pt =
.75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td st=
yle=3D"padding:.75pt .75pt .75pt .75pt">
</td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"paddin=
g:.75pt .75pt .75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75p=
t"></td></tr><tr><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td sty=
le=3D"padding:.75pt .75pt .75pt .75pt">
</td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"paddin=
g:.75pt .75pt .75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75p=
t"></td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"pad=
ding:.75pt .75pt .75pt .75pt">
</td></tr><tr><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=
=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"padding:.75pt .75pt =
.75pt .75pt"></td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td st=
yle=3D"padding:.75pt .75pt .75pt .75pt">
</td><td style=3D"padding:.75pt .75pt .75pt .75pt"></td><td style=3D"paddin=
g:.75pt .75pt .75pt .75pt"></td></tr></tbody></table></div><p class=3D"MsoN=
ormal">--Ashish<u></u><u></u></p></div><div><p class=3D"MsoNormal" style=3D=
"margin-bottom:12.0pt">
<u></u>=A0<u></u></p><div><p class=3D"MsoNormal">On Thu, Jan 9, 2014 at 2:1=
1 PM, Ashish Jain &lt;<a href=3D"mailto:ashjain2@gmail.com" target=3D"_blan=
k">ashjain2@gmail.com</a>&gt; wrote:<u></u><u></u></p><div><div><div><p cla=
ss=3D"MsoNormal" style=3D"margin-bottom:12.0pt">
Hello Chris,<u></u><u></u></p></div><p class=3D"MsoNormal" style=3D"margin-=
bottom:12.0pt">I have now a cluster with 3 nodes and replication factor bei=
ng 2. When I distribute a file I could see that there are replica of data a=
vailable in other nodes. However when I run a map reduce job again only one=
 node is serving all the request :(. Can you or anyone please provide some =
more inputs.<u></u><u></u></p>
</div><p class=3D"MsoNormal">Thanks<span style=3D"color:#888888"><br>Ashish=
</span><u></u><u></u></p></div><div><div><div><p class=3D"MsoNormal" style=
=3D"margin-bottom:12.0pt"><u></u>=A0<u></u></p><div><p class=3D"MsoNormal">=
On Wed, Jan 8, 2014 at 7:16 PM, Chris Mawata &lt;<a href=3D"mailto:chris.ma=
wata@gmail.com" target=3D"_blank">chris.mawata@gmail.com</a>&gt; wrote:<u><=
/u><u></u></p>
<p>2 nodes and replication factor of 2 results in a replica of each block p=
resent on each node. This would allow the possibility that a single node wo=
uld do the work and yet be data local.=A0 It will probably happen if that s=
ingle node has the needed capacity.=A0 More nodes than the replication fact=
or are needed to force distribution of the processing. <br>
<span style=3D"color:#888888">Chris</span><u></u><u></u></p><div><div><div>=
<p class=3D"MsoNormal">On Jan 8, 2014 7:35 AM, &quot;Ashish Jain&quot; &lt;=
<a href=3D"mailto:ashjain2@gmail.com" target=3D"_blank">ashjain2@gmail.com<=
/a>&gt; wrote:<u></u><u></u></p>
<div><div><div><p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Guys,<=
u></u><u></u></p></div><p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt=
">I am sure that only one node is being used. I just know ran the job again=
 and could see that CPU usage only for one server going high other server C=
PU usage remains constant and hence it means other node is not being used. =
Can someone help me to debug this issue?<u></u><u></u></p>
</div><p class=3D"MsoNormal">++Ashish<u></u><u></u></p></div><div><p class=
=3D"MsoNormal" style=3D"margin-bottom:12.0pt"><u></u>=A0<u></u></p><div><p =
class=3D"MsoNormal">On Wed, Jan 8, 2014 at 5:04 PM, Ashish Jain &lt;<a href=
=3D"mailto:ashjain2@gmail.com" target=3D"_blank">ashjain2@gmail.com</a>&gt;=
 wrote:<u></u><u></u></p>
<div><div><div><div><p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">H=
ello All,<u></u><u></u></p></div><p class=3D"MsoNormal">I have a 2 node had=
oop cluster running with a replication factor of 2. I have a file of size a=
round 1 GB which when copied to HDFS is replicated to both the nodes. Seein=
g the block info I can see the file has been subdivided into 8 parts which =
means it has been subdivided into 8 blocks each of size 128 MB.=A0 I use th=
is file as input to run the word count program. Some how I feel only one no=
de is doing all the work and the code is not distributed to other node. How=
 can I make sure code is distributed to both the nodes? Also is there a log=
 or GUI which can be used for this?<u></u><u></u></p>
</div><p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Please note I a=
m using the latest stable release that is 2.2.0.<u></u><u></u></p></div><p =
class=3D"MsoNormal"><span style=3D"color:#888888">++Ashish</span><u></u><u>=
</u></p>
</div></div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div></div></div><=
/div></div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div></div></div></=
div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div></div></div></div></d=
iv><p class=3D"MsoNormal">
<u></u>=A0<u></u></p></div></div></div></div><p class=3D"MsoNormal"><u></u>=
=A0<u></u></p></div></div></div></div></div></blockquote></div><br></div>

--f46d04138cdf55f35b04ef98ab46--