Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of chris.mawata@gmail.com
 designates 209.85.223.169 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CABGiY4CaNqFjETjHGUt9Ab6a2_acf6xV2ag=osN-dGwGYX4hkA@mail.gmail.com>
References: 
 <CABGiY4CRHuFpuChmFTfEs2fTq8RSuHjXykm3Xs-s33CZV=ozZQ@mail.gmail.com>
	<CABGiY4DWZBxoneMTvS_KnB_NowUjU_heq28EC=PZTy0OtYsYDg@mail.gmail.com>
	<CAEpEg_DqbpGkQNsVvgnXVOyctfD0ZNfBdep=MoaHEL1hwfz0oQ@mail.gmail.com>
	<CABGiY4BGZxJ3jmOHgbyzFpsERBxeYyrm9Q-u7CL4M3avpn0OUQ@mail.gmail.com>
	<CABGiY4AQHtfk2gQ78OGHFnUeeXUaR1QK7nzePsO0ovHY-y7U2g@mail.gmail.com>
	<CAEpEg_Dgbj=XKTSA3xh5O3ezFxPU4MdTLg8sHh_Bwyvq+nT_ig@mail.gmail.com>
	<CABGiY4B4=R+7q0JE0xR1BQWxSooY2Mzd65JwJfwYrS1ONz2qtA@mail.gmail.com>
	<CABGiY4CaNqFjETjHGUt9Ab6a2_acf6xV2ag=osN-dGwGYX4hkA@mail.gmail.com>
Date: Thu, 9 Jan 2014 08:57:42 -0500
Message-ID: 
 <CAEpEg_AxTFhqMYuvUaGA-Mm3c7-dC6BF5sxdSEYbiwE12gepqg@mail.gmail.com>
Subject: Re: Distributing the code to multiple nodes
From: Chris Mawata <chris.mawata@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=bcaec51865f847d36a04ef89fe5d

--bcaec51865f847d36a04ef89fe5d
Content-Type: text/plain; charset=ISO-8859-1

...And do all three nodes appear in the NameNode and YARN web user
interfaces?
Chris
On Jan 9, 2014 7:46 AM, "Ashish Jain" <ashjain2@gmail.com> wrote:

> Another point to add here 10.12.11.210 is the host which has everything
> running including a slave datanode. Data was also distributed this host as
> well as the jar file. Following are running on 10.12.11.210
>
> 7966 DataNode
> 8480 NodeManager
> 8353 ResourceManager
> 8141 SecondaryNameNode
> 7834 NameNode
>
>
>
> On Thu, Jan 9, 2014 at 6:12 PM, Ashish Jain <ashjain2@gmail.com> wrote:
>
>> Logs were updated only when I copied the data. After copying the data
>> there has been no updates on the log files.
>>
>>
>> On Thu, Jan 9, 2014 at 5:08 PM, Chris Mawata <chris.mawata@gmail.com>wrote:
>>
>>> Do the logs on the three nodes contain anything interesting?
>>> Chris
>>>  On Jan 9, 2014 3:47 AM, "Ashish Jain" <ashjain2@gmail.com> wrote:
>>>
>>>> Here is the block info for the record I distributed. As can be seen
>>>> only 10.12.11.210 has all the data and this is the node which is serving
>>>> all the request. Replicas are available with 209 as well as 210
>>>>
>>>> 1073741857:         10.12.11.210:50010    View Block Info
>>>> 10.12.11.209:50010    View Block Info
>>>> 1073741858:         10.12.11.210:50010    View Block Info
>>>> 10.12.11.211:50010    View Block Info
>>>> 1073741859:         10.12.11.210:50010    View Block Info
>>>> 10.12.11.209:50010    View Block Info
>>>> 1073741860:         10.12.11.210:50010    View Block Info
>>>> 10.12.11.211:50010    View Block Info
>>>> 1073741861:         10.12.11.210:50010    View Block Info
>>>> 10.12.11.209:50010    View Block Info
>>>> 1073741862:         10.12.11.210:50010    View Block Info
>>>> 10.12.11.209:50010    View Block Info
>>>> 1073741863:         10.12.11.210:50010    View Block Info
>>>> 10.12.11.209:50010    View Block Info
>>>> 1073741864:         10.12.11.210:50010    View Block Info
>>>> 10.12.11.209:50010    View Block Info
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> --Ashish
>>>>
>>>>
>>>> On Thu, Jan 9, 2014 at 2:11 PM, Ashish Jain <ashjain2@gmail.com> wrote:
>>>>
>>>>> Hello Chris,
>>>>>
>>>>> I have now a cluster with 3 nodes and replication factor being 2. When
>>>>> I distribute a file I could see that there are replica of data available in
>>>>> other nodes. However when I run a map reduce job again only one node is
>>>>> serving all the request :(. Can you or anyone please provide some more
>>>>> inputs.
>>>>>
>>>>> Thanks
>>>>> Ashish
>>>>>
>>>>>
>>>>> On Wed, Jan 8, 2014 at 7:16 PM, Chris Mawata <chris.mawata@gmail.com>wrote:
>>>>>
>>>>>> 2 nodes and replication factor of 2 results in a replica of each
>>>>>> block present on each node. This would allow the possibility that a single
>>>>>> node would do the work and yet be data local.  It will probably happen if
>>>>>> that single node has the needed capacity.  More nodes than the replication
>>>>>> factor are needed to force distribution of the processing.
>>>>>>  Chris
>>>>>> On Jan 8, 2014 7:35 AM, "Ashish Jain" <ashjain2@gmail.com> wrote:
>>>>>>
>>>>>>> Guys,
>>>>>>>
>>>>>>> I am sure that only one node is being used. I just know ran the job
>>>>>>> again and could see that CPU usage only for one server going high other
>>>>>>> server CPU usage remains constant and hence it means other node is not
>>>>>>> being used. Can someone help me to debug this issue?
>>>>>>>
>>>>>>> ++Ashish
>>>>>>>
>>>>>>>
>>>>>>> On Wed, Jan 8, 2014 at 5:04 PM, Ashish Jain <ashjain2@gmail.com>wrote:
>>>>>>>
>>>>>>>> Hello All,
>>>>>>>>
>>>>>>>> I have a 2 node hadoop cluster running with a replication factor of
>>>>>>>> 2. I have a file of size around 1 GB which when copied to HDFS is
>>>>>>>> replicated to both the nodes. Seeing the block info I can see the file has
>>>>>>>> been subdivided into 8 parts which means it has been subdivided into 8
>>>>>>>> blocks each of size 128 MB.  I use this file as input to run the word count
>>>>>>>> program. Some how I feel only one node is doing all the work and the code
>>>>>>>> is not distributed to other node. How can I make sure code is distributed
>>>>>>>> to both the nodes? Also is there a log or GUI which can be used for this?
>>>>>>>> Please note I am using the latest stable release that is 2.2.0.
>>>>>>>>
>>>>>>>> ++Ashish
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>
>>>>
>>
>

--bcaec51865f847d36a04ef89fe5d
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<p dir=3D"ltr">...And do all three nodes appear in the NameNode and YARN we=
b user interfaces?<br>
Chris</p>
<div class=3D"gmail_quote">On Jan 9, 2014 7:46 AM, &quot;Ashish Jain&quot; =
&lt;<a href=3D"mailto:ashjain2@gmail.com">ashjain2@gmail.com</a>&gt; wrote:=
<br type=3D"attribution"><blockquote class=3D"gmail_quote" style=3D"margin:=
0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div dir=3D"ltr">Another point to add here 10.12.11.210 is the host which h=
as everything running including a slave datanode. Data was also distributed=
 this host as well as the jar file. Following are running on 10.12.11.210<b=
r>

<br>7966 DataNode<br>8480 NodeManager<br>8353 ResourceManager<br>8141 Secon=
daryNameNode<br>7834 NameNode<br><br></div><div class=3D"gmail_extra"><br><=
br><div class=3D"gmail_quote">On Thu, Jan 9, 2014 at 6:12 PM, Ashish Jain <=
span dir=3D"ltr">&lt;<a href=3D"mailto:ashjain2@gmail.com" target=3D"_blank=
">ashjain2@gmail.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Logs were updated only when=
 I copied the data. After copying the data there has been no updates on the=
 log files.<br>

</div><div><div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quot=
e">On Thu, Jan 9, 2014 at 5:08 PM, Chris Mawata <span dir=3D"ltr">&lt;<a hr=
ef=3D"mailto:chris.mawata@gmail.com" target=3D"_blank">chris.mawata@gmail.c=
om</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><p dir=3D"ltr">Do the logs on the three node=
s contain anything interesting?<span><font color=3D"#888888"><br>

Chris<br>
</font></span></p><div><div>
<div class=3D"gmail_quote">On Jan 9, 2014 3:47 AM, &quot;Ashish Jain&quot; =
&lt;<a href=3D"mailto:ashjain2@gmail.com" target=3D"_blank">ashjain2@gmail.=
com</a>&gt; wrote:<br type=3D"attribution"><blockquote class=3D"gmail_quote=
" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr"><div>Here is the block info for the record I distributed. =
As can be seen only 10.12.11.210 has all the data and this is the node whic=
h is serving all the request. Replicas are available with 209 as well as 21=
0<br>


<br>1073741857:=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12.11.210:50010"=
 target=3D"_blank">10.12.11.210:50010</a>=A0=A0=A0 View Block Info=A0=A0=A0=
 =A0=A0=A0=A0 <a href=3D"http://10.12.11.209:50010" target=3D"_blank">10.12=
.11.209:50010</a>=A0=A0=A0 View Block Info<br>


1073741858:=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12.11.210:50010" tar=
get=3D"_blank">10.12.11.210:50010</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=
=A0=A0=A0 <a href=3D"http://10.12.11.211:50010" target=3D"_blank">10.12.11.=
211:50010</a>=A0=A0=A0 View Block Info<br>
1073741859:=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12.11.210:50010" tar=
get=3D"_blank">10.12.11.210:50010</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=
=A0=A0=A0 <a href=3D"http://10.12.11.209:50010" target=3D"_blank">10.12.11.=
209:50010</a>=A0=A0=A0 View Block Info<br>1073741860:=A0=A0=A0 =A0=A0=A0=A0=
 <a href=3D"http://10.12.11.210:50010" target=3D"_blank">10.12.11.210:50010=
</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12=
.11.211:50010" target=3D"_blank">10.12.11.211:50010</a>=A0=A0=A0 View Block=
 Info<br>


1073741861:=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12.11.210:50010" tar=
get=3D"_blank">10.12.11.210:50010</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=
=A0=A0=A0 <a href=3D"http://10.12.11.209:50010" target=3D"_blank">10.12.11.=
209:50010</a>=A0=A0=A0 View Block Info<br>1073741862:=A0=A0=A0 =A0=A0=A0=A0=
 <a href=3D"http://10.12.11.210:50010" target=3D"_blank">10.12.11.210:50010=
</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12=
.11.209:50010" target=3D"_blank">10.12.11.209:50010</a>=A0=A0=A0 View Block=
 Info<br>


1073741863:=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12.11.210:50010" tar=
get=3D"_blank">10.12.11.210:50010</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=
=A0=A0=A0 <a href=3D"http://10.12.11.209:50010" target=3D"_blank">10.12.11.=
209:50010</a>=A0=A0=A0 View Block Info<br>1073741864:=A0=A0=A0 =A0=A0=A0=A0=
 <a href=3D"http://10.12.11.210:50010" target=3D"_blank">10.12.11.210:50010=
</a>=A0=A0=A0 View Block Info=A0=A0=A0 =A0=A0=A0=A0 <a href=3D"http://10.12=
.11.209:50010" target=3D"_blank">10.12.11.209:50010</a>=A0=A0=A0 View Block=
 Info<br>


<table><tbody><tr><td><br></td><td><br></td><td><br></td><td><br></td><td><=
br></td><td><br></td><td><br></td></tr><tr><td><br></td><td><br></td><td><b=
r></td><td><br></td><td><br></td><td><br></td><td><br></td></tr><tr><td>


<br></td><td><br></td><td><br></td><td><br></td><td><br></td><td><br></td><=
td><br></td></tr><tr><td><br></td><td><br></td><td><br></td><td><br></td><t=
d><br></td><td><br></td><td><br></td></tr><tr><td><br></td><td><br></td>


<td><br></td><td><br></td><td><br></td><td><br></td><td><br></td></tr><tr><=
td><br></td><td><br></td><td><br></td><td><br></td><td><br></td><td><br></t=
d><td><br></td></tr><tr><td><br></td><td><br></td><td><br></td><td><br>


</td><td><br></td><td><br></td><td><br></td></tr><tr><td><br></td><td><br><=
/td><td><br></td><td><br></td><td><br></td><td><br></td><td><br></td></tr><=
/tbody></table></div>--Ashish<br></div><div class=3D"gmail_extra"><br><br>


<div class=3D"gmail_quote">On Thu, Jan 9, 2014 at 2:11 PM, Ashish Jain <spa=
n dir=3D"ltr">&lt;<a href=3D"mailto:ashjain2@gmail.com" target=3D"_blank">a=
shjain2@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote=
" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr"><div><div>Hello Chris,<br><br></div>I have now a cluster w=
ith 3 nodes and replication factor being 2. When I distribute a file I coul=
d see that there are replica of data available in other nodes. However when=
 I run a map reduce job again only one node is serving all the request :(. =
Can you or anyone please provide some more inputs.<br>


<br></div>Thanks<span><font color=3D"#888888"><br>Ashish<br></font></span><=
/div><div><div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote=
">On Wed, Jan 8, 2014 at 7:16 PM, Chris Mawata <span dir=3D"ltr">&lt;<a hre=
f=3D"mailto:chris.mawata@gmail.com" target=3D"_blank">chris.mawata@gmail.co=
m</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><p dir=3D"ltr">2 nodes and replication facto=
r of 2 results in a replica of each block present on each node. This would =
allow the possibility that a single node would do the work and yet be data =
local.=A0 It will probably happen if that single node has the needed capaci=
ty.=A0 More nodes than the replication factor are needed to force distribut=
ion of the processing. <br>


<span><font color=3D"#888888">

Chris</font></span></p><div><div>
<div class=3D"gmail_quote">On Jan 8, 2014 7:35 AM, &quot;Ashish Jain&quot; =
&lt;<a href=3D"mailto:ashjain2@gmail.com" target=3D"_blank">ashjain2@gmail.=
com</a>&gt; wrote:<br type=3D"attribution"><blockquote class=3D"gmail_quote=
" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr"><div><div>Guys,<br><br></div>I am sure that only one node =
is being used. I just know ran the job again and could see that CPU usage o=
nly for one server going high other server CPU usage remains constant and h=
ence it means other node is not being used. Can someone help me to debug th=
is issue?<br>


<br></div>++Ashish<br></div><div class=3D"gmail_extra"><br><br><div class=
=3D"gmail_quote">On Wed, Jan 8, 2014 at 5:04 PM, Ashish Jain <span dir=3D"l=
tr">&lt;<a href=3D"mailto:ashjain2@gmail.com" target=3D"_blank">ashjain2@gm=
ail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><div><div>Hello All,<b=
r><br></div>I have a 2 node hadoop cluster running with a replication facto=
r of 2. I have a file of size around 1 GB which when copied to HDFS is repl=
icated to both the nodes. Seeing the block info I can see the file has been=
 subdivided into 8 parts which means it has been subdivided into 8 blocks e=
ach of size 128 MB.=A0 I use this file as input to run the word count progr=
am. Some how I feel only one node is doing all the work and the code is not=
 distributed to other node. How can I make sure code is distributed to both=
 the nodes? Also is there a log or GUI which can be used for this?<br>


</div>Please note I am using the latest stable release that is 2.2.0.<span>=
<font color=3D"#888888"><br><br></font></span></div><span><font color=3D"#8=
88888">++Ashish<br></font></span></div>
</blockquote></div><br></div>
</blockquote></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</blockquote></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</blockquote></div>

--bcaec51865f847d36a04ef89fe5d--