Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of write2kishore@gmail.com
 designates 209.85.214.178 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CABcwWrjJtNsQk1N7dFueo0j=2g8D6dHcVS4uf67ewXdve+Xzcw@mail.gmail.com>
References: 
 <CAHg+sbPPJH8ZBUCXDJvqEvyX2FGxzZPMXxoa0RLOkFK-bb4+ig@mail.gmail.com>
	<CAHg+sbPESisaA_a-h2c+2fSsTvENPqHU7uyR5+FdvAbZ9yMM7w@mail.gmail.com>
	<CABcwWrjJtNsQk1N7dFueo0j=2g8D6dHcVS4uf67ewXdve+Xzcw@mail.gmail.com>
Date: Fri, 2 Aug 2013 19:22:25 +0530
Message-ID: 
 <CAHg+sbP-4xCPsa4j=_pqAakdtBYg9S4J+jXgizJpSvRo97CTDw@mail.gmail.com>
Subject: Re: Extra start-up overhead with hadoop-2.1.0-beta
From: Krishna Kishore Bonagiri <write2kishore@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=bcaec51018f7c4fbea04e2f74485

--bcaec51018f7c4fbea04e2f74485
Content-Type: text/plain; charset=ISO-8859-1

Hi Omkar,

  I have got these number by running a simple C program on the containers
that fetches the timestamp in microseconds and exits. The times mentioned
are low and high, they are not varying so drastically with in a version but
there are huge differences(like a second) between the two versions,
2.0.4-alpha and 2.1.0-beta as I mentioned.

  I am using a single node cluster, and all there is absolutely no other
load on the machine/node. My single node cluster is just used for my own
development work, and testing.

  I am not aware of what is resource localization, I am not doing anything
specially for that.

  Please let me know if you need any other info.

Thanks,
Kishore


On Thu, Aug 1, 2013 at 11:20 PM, Omkar Joshi <ojoshi@hortonworks.com> wrote:

> How are you making these measurements can you elaborate more? Is it on a
> best case basis or on an average or worst case? How many resources are you
> sending it for localization? were the sizes and number of these resources
> consistent across tests? Were these resources public/private/application
> specific? Apart from this is the other load on node manager same? is the
> load on hdfs same? did you see any network bottleneck?
>
> More information will help a lot.
>
>
> Thanks,
> Omkar Joshi
> *Hortonworks Inc.* <http://www.hortonworks.com>
>
>
> On Thu, Aug 1, 2013 at 2:19 AM, Krishna Kishore Bonagiri <
> write2kishore@gmail.com> wrote:
>
>> Hi,
>>   Please share with me if you anyone has an answer or clues to my
>> question regarding the start up performance.
>>
>> Also, one more thing I have observed today is the time taken to run a
>> command on a container went up by more than a second in this latest version.
>>
>> When using 2.0.4-alpha, it used to take 0.3 to 0.5 seconds from the point
>> I call startContainer() to the  point the command is started on the
>> container.
>>
>> where as
>>
>> When using 2.1.0-beta, it is taking around 1.5 seconds from the point it
>> came to the call back onContainerStarted() to the point the command is seen
>> started running on the container.
>>
>> Thanks,
>> Kishore
>>
>>
>> On Thu, Jul 25, 2013 at 8:38 PM, Krishna Kishore Bonagiri <
>> write2kishore@gmail.com> wrote:
>>
>>> Hi,
>>>
>>>   I have been using the hadoop-2.0.1-beta release candidate and observed
>>> that it is slower in running my simple application that runs on 2
>>> containers. I have tried to find out which parts of it is really having
>>> this extra overhead(compared to hadoop-2.0.4-alpha), and here is what I
>>> found that.
>>>
>>> 1) From the point my Client has submitted the Application Master to RM,
>>> it is taking 2  seconds extra
>>> 2) From the point my container request are set up by Application Master,
>>> till the containers are allocated, it is taking 2 seconds extra
>>>
>>> Is this overhead expected with the changes that went into the new
>>> version? Or is there to improve it by changing something in configurations
>>> or so?
>>>
>>> Thanks,
>>> Kishore
>>>
>>
>>
>

--bcaec51018f7c4fbea04e2f74485
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi Omkar,<div><br><div>=A0 I have got these number by runn=
ing a simple C program on the containers that fetches the timestamp in micr=
oseconds and exits. The times mentioned are low and high, they are not vary=
ing so drastically with in a version but there are huge differences(like a =
second) between the two versions, 2.0.4-alpha and 2.1.0-beta as I mentioned=
.</div>
<div>=A0=A0</div><div>=A0 I am using a single node cluster, and all there i=
s absolutely no other load on the machine/node. My single node cluster is j=
ust used for my own development work, and testing.</div><div><br></div><div=
>=A0 I am not aware of what is resource localization, I am not doing anythi=
ng specially for that.</div>
</div><div><br></div><div>=A0 Please let me know if you need any other info=
.</div><div><br></div><div>Thanks,</div><div>Kishore</div></div><div class=
=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Thu, Aug 1, 2013 at =
11:20 PM, Omkar Joshi <span dir=3D"ltr">&lt;<a href=3D"mailto:ojoshi@horton=
works.com" target=3D"_blank">ojoshi@hortonworks.com</a>&gt;</span> wrote:<b=
r>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">How are you making these me=
asurements can you elaborate more? Is it on a best case basis or on an aver=
age or worst case? How many resources are you sending it for localization? =
were the sizes and number of these resources consistent across tests? Were =
these resources public/private/application specific? Apart from this is the=
 other load on node manager same? is the load on hdfs same? did you see any=
 network bottleneck?=A0<div>

<br></div><div>More information will help a lot.<br><div><br><div class=3D"=
gmail_extra"><br clear=3D"all"><div><div dir=3D"ltr"><font face=3D"courier =
new, monospace">Thanks,</font><div><font face=3D"courier new, monospace">Om=
kar Joshi</font></div>

<div><font face=3D"courier new, monospace"><a href=3D"http://www.hortonwork=
s.com" target=3D"_blank"><b>Hortonworks Inc.</b></a></font></div></div></di=
v><div class=3D"im">
<br><br><div class=3D"gmail_quote">On Thu, Aug 1, 2013 at 2:19 AM, Krishna =
Kishore Bonagiri <span dir=3D"ltr">&lt;<a href=3D"mailto:write2kishore@gmai=
l.com" target=3D"_blank">write2kishore@gmail.com</a>&gt;</span> wrote:<br><=
blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px=
 #ccc solid;padding-left:1ex">

<div dir=3D"ltr">Hi,<div>=A0 Please share with me if you anyone has an answ=
er or clues to my question regarding the start up performance.=A0</div><div=
><br></div><div>Also, one more thing I have observed today is the time take=
n to run a command on a container went up by more than a second in this lat=
est version.</div>


<div><br></div><div>When using 2.0.4-alpha, it used to take 0.3 to 0.5 seco=
nds from the point I call startContainer() to the =A0point the command is s=
tarted on the container.</div><div><br></div><div>where as</div><div><br>


</div><div>When using 2.1.0-beta, it is taking around 1.5 seconds from the =
point it came to the call back=A0onContainerStarted() to the point the comm=
and is seen started running on the container.</div><div><br></div><div>Than=
ks,<br>


</div><div>Kishore</div></div><div><div><div class=3D"gmail_extra"><br><br>=
<div class=3D"gmail_quote">On Thu, Jul 25, 2013 at 8:38 PM, Krishna Kishore=
 Bonagiri <span dir=3D"ltr">&lt;<a href=3D"mailto:write2kishore@gmail.com" =
target=3D"_blank">write2kishore@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi,<div><br><div>=A0 I have=
 been using the hadoop-2.0.1-beta release candidate and observed that it is=
 slower in running my simple application that runs on 2 containers. I have =
tried to find out which parts of it is really having this extra overhead(co=
mpared to hadoop-2.0.4-alpha), and here is what I found that.</div>


<div><br></div><div>1) From the point my Client has submitted the Applicati=
on Master to RM, it is taking 2 =A0seconds extra</div><div>2) From the poin=
t my container request are set up by Application Master, till the container=
s are allocated, it is taking 2 seconds extra</div>


<div><br></div><div>Is this overhead expected with the changes that went in=
to the new version? Or is there to improve it by changing something in conf=
igurations or so?</div><div><br></div><div>Thanks,</div><div>Kishore</div>


</div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div></div></div></div></div>
</blockquote></div><br></div>

--bcaec51018f7c4fbea04e2f74485--