Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of jayamviswanathan@gmail.com
 designates 209.85.160.52 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CADK15wKLwbbBcNO0PcnomHSozVR_JSnh7swUpHXurw0Hi04rww@mail.gmail.com>
References: 
 <CADK15wKfuTZu-gZqZrVrQKsmXUTr6UnrNVWXaUi7OoBxzDA=Ew@mail.gmail.com>
	<f93470da-06f6-4d4b-b404-b2c9978236af@cloudera.org>
	<CADK15wKu-MM1jOWucn=d5prAqYd5Aw+BeV6QYTxJbmi2wph7NQ@mail.gmail.com>
	<CADK15wKLwbbBcNO0PcnomHSozVR_JSnh7swUpHXurw0Hi04rww@mail.gmail.com>
Date: Mon, 14 Oct 2013 14:22:41 +0530
Message-ID: 
 <CADK15wKccX_6xG2s9_1-Cy87C5G5jHm3RcL90isAe0SREgnEjw@mail.gmail.com>
Subject: Re: Hadoop Jobtracker heap size calculation and OOME
From: Viswanathan J <jayamviswanathan@gmail.com>
To: cdh-user@cloudera.org, antwnis@gmail.com
Cc: "user@hadoop.apache.org" <user@hadoop.apache.org>
Content-Type: multipart/alternative; boundary=047d7bd9036c4a86b204e8af9785

--047d7bd9036c4a86b204e8af9785
Content-Type: text/plain; charset=ISO-8859-1

Hi guys,

Appreciate your response.

Thanks,
Viswa.J
On Oct 12, 2013 11:29 PM, "Viswanathan J" <jayamviswanathan@gmail.com>
wrote:

> Hi Guys,
>
> But I can see the jobtracker OOME issue fixed in hadoop - 1.2.1 version as
> per the hadoop release notes as below.
>
> Please check this URL,
>
> https://issues.apache.org/jira/browse/MAPREDUCE-5351
>
> How come the issue still persist? I'm I asking a valid thing.
>
> Do I need to configure anything our I missing anything.
>
> Please help. Appreciate your response.
>
> Thanks,
> Viswa.J
> On Oct 12, 2013 7:57 PM, "Viswanathan J" <jayamviswanathan@gmail.com>
> wrote:
>
>> Thanks Antonio, hope the memory leak issue will be resolved. Its really
>> nightmare every week.
>>
>> In which release this issue will be resolved?
>>
>> How to solve this issue, please help because we are facing in production
>> environment.
>>
>> Please share the configuration and cron to do that cleanup process.
>>
>> Thanks,
>> Viswa
>> On Oct 12, 2013 7:31 PM, "Antonios Chalkiopoulos" <antwnis@gmail.com>
>> wrote:
>>
>>> "After restart the JT, within a week getting OOME."
>>>
>>> Viswa, we were having the same issue in our cluster as well - roughly
>>> every 5-7 days getting OOME.
>>> The heap size of the Job Tracker was constantly increasing due to a
>>> memory leak that will hopefully be fixed in newest releases.
>>>
>>> There is a configuration change in the JobTracker that will disable a
>>> functionality regarding cleaning up staging files i.e.
>>> /user/build/.staging/* - but that means that you will have to handle the
>>> staging files through a cron / jenkins task
>>>
>>> I'll get you the configuration on Monday..
>>>
>>> On Friday, 11 October 2013 18:08:55 UTC+1, Viswanathan J wrote:
>>>>
>>>> Hi,
>>>>
>>>> I'm running a 14 nodes of Hadoop cluster with datanodes,tasktrackers
>>>> running in all nodes.
>>>>
>>>> *Apache Hadoop :* 1.2.1
>>>>
>>>> It shows the heap size currently as follows:
>>>>
>>>> *Cluster Summary (Heap Size is 5.7/8.89 GB)*
>>>> *
>>>> *
>>>> In the above summary what is the *8.89* GB defines? Is the *8.89*defines maximum heap size for Jobtracker, if yes how it has
>>>> been calculated.
>>>>
>>>> Hope *5.7* is currently running jobs heap-size, how it is calculated.
>>>>
>>>> Have set the jobtracker default memory size in hadoop-env.sh
>>>>
>>>> *HADOOP_HEAPSIZE="1024"*
>>>> *
>>>> *
>>>> Have set the mapred.child.java.opts value in mapred-site.xml as,
>>>>
>>>>  <property>
>>>>   <name>mapred.child.java.opts</**name>
>>>>   <value>-Xmx2048m</value>
>>>>  </property>
>>>>
>>>> Even after setting the above property, getting Jobtracker OOME issue.
>>>> How the jobtracker memory gradually increasing. After restart the JT,
>>>> within a week getting OOME.
>>>>
>>>> How to resolve this, it is in production and critical? Please help.
>>>> Thanks in advance.
>>>>
>>>> --
>>>> Regards,
>>>> Viswa.J
>>>>
>>>  --
>>>
>>> ---
>>> You received this message because you are subscribed to the Google
>>> Groups "CDH Users" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to cdh-user+unsubscribe@cloudera.org.
>>> For more options, visit
>>> https://groups.google.com/a/cloudera.org/groups/opt_out.
>>>
>>

--047d7bd9036c4a86b204e8af9785
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<p>Hi guys,</p>
<p>Appreciate your response.</p>
<p>Thanks,<br>
Viswa.J</p>
<div class=3D"gmail_quote">On Oct 12, 2013 11:29 PM, &quot;Viswanathan J&qu=
ot; &lt;<a href=3D"mailto:jayamviswanathan@gmail.com">jayamviswanathan@gmai=
l.com</a>&gt; wrote:<br type=3D"attribution"><blockquote class=3D"gmail_quo=
te" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"=
>
<p>Hi Guys,</p>
<p>But I can see the jobtracker OOME issue fixed in hadoop - 1.2.1 version =
as per the hadoop release notes as below.</p>
<p>Please check this URL,</p>
<p><a href=3D"https://issues.apache.org/jira/browse/MAPREDUCE-5351" target=
=3D"_blank">https://issues.apache.org/jira/browse/MAPREDUCE-5351</a> </p>
<p>How come the issue still persist? I&#39;m I asking a valid thing.</p>
<p>Do I need to configure anything our I missing anything.</p>
<p>Please help. Appreciate your response.</p>
<p>Thanks,<br>
Viswa.J</p>
<div class=3D"gmail_quote">On Oct 12, 2013 7:57 PM, &quot;Viswanathan J&quo=
t; &lt;<a href=3D"mailto:jayamviswanathan@gmail.com" target=3D"_blank">jaya=
mviswanathan@gmail.com</a>&gt; wrote:<br type=3D"attribution"><blockquote c=
lass=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;=
padding-left:1ex">

<p>Thanks Antonio, hope the memory leak issue will be resolved. Its really =
nightmare every week.</p>
<p>In which release this issue will be resolved?</p>
<p>How to solve this issue, please help because we are facing in production=
 environment.</p>
<p>Please share the configuration and cron to do that cleanup process.</p>
<p>Thanks,<br>
Viswa</p>
<div class=3D"gmail_quote">On Oct 12, 2013 7:31 PM, &quot;Antonios Chalkiop=
oulos&quot; &lt;<a href=3D"mailto:antwnis@gmail.com" target=3D"_blank">antw=
nis@gmail.com</a>&gt; wrote:<br type=3D"attribution"><blockquote class=3D"g=
mail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-l=
eft:1ex">


<div dir=3D"ltr">&quot;After restart the JT, within a week getting OOME.&qu=
ot;<div><br></div><div>Viswa, we were having the same issue in our cluster =
as well - roughly every 5-7 days getting OOME.</div><div>The heap size of t=
he=A0Job Tracker=A0was constantly increasing due to a memory leak that will=
 hopefully be fixed in newest releases.</div>


<div><br></div><div>There is a configuration change in the JobTracker that =
will disable a functionality regarding cleaning up staging files i.e.</div>=
<div>/user/build/.staging/* - but that means that you will have to handle t=
he staging files through a cron / jenkins task</div>


<div><br></div><div>I&#39;ll get you the configuration on Monday..<br><br>O=
n Friday, 11 October 2013 18:08:55 UTC+1, Viswanathan J  wrote:<blockquote =
class=3D"gmail_quote" style=3D"margin:0;margin-left:0.8ex;border-left:1px #=
ccc solid;padding-left:1ex">


<div dir=3D"ltr"><span style=3D"font-family:arial,sans-serif;font-size:12.7=
27272033691406px">Hi,</span><div><br><div style=3D"font-family:arial,sans-s=
erif;font-size:12.727272033691406px">I&#39;m running a 14 nodes of Hadoop c=
luster with datanodes,tasktrackers running in all nodes.<br>


</div><div style=3D"font-family:arial,sans-serif;font-size:12.7272720336914=
06px"><br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727=
272033691406px"><b>Apache Hadoop :</b> 1.2.1</div><div style=3D"font-family=
:arial,sans-serif;font-size:12.727272033691406px">


<br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727272033=
691406px">It shows the heap size currently as follows:</div><div style=3D"f=
ont-family:arial,sans-serif;font-size:12.727272033691406px"><br></div><div>


<font face=3D"arial, sans-serif"><b>Cluster Summary (Heap Size is 5.7/8.89 =
GB)</b></font><br></div><div><font face=3D"arial, sans-serif"><b><br></b></=
font></div><div><font face=3D"arial, sans-serif">In the above summary what =
is the <b>8.89</b> GB defines? Is the <b>8.89</b> defines maximum heap size=
 for Jobtracker, if yes how it has been=A0calculated.=A0</font></div>


<div><font face=3D"arial, sans-serif"><br></font></div><div><font face=3D"a=
rial, sans-serif">Hope <b>5.7</b> is currently running jobs heap-size, how =
it is=A0calculated.</font></div><div style=3D"font-family:arial,sans-serif;=
font-size:12.727272033691406px">


<br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727272033=
691406px">Have set the jobtracker default memory size in hadoop-env.sh</div=
><div style=3D"font-family:arial,sans-serif;font-size:12.727272033691406px"=
>


<br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727272033=
691406px"><b>HADOOP_HEAPSIZE=3D&quot;1024&quot;</b><br></div><div style=3D"=
font-family:arial,sans-serif;font-size:12.727272033691406px"><b><br></b></d=
iv>


<div style=3D"font-family:arial,sans-serif;font-size:12.727272033691406px">=
Have set the mapred.child.java.opts value in mapred-site.xml as,</div><div =
style=3D"font-family:arial,sans-serif;font-size:12.727272033691406px"><br>


</div>
<div style=3D"font-family:arial,sans-serif;font-size:12.727272033691406px">=
<div>&lt;property&gt;</div><div>=A0 &lt;name&gt;mapred.child.java.opts&lt;/=
<u></u>name&gt;</div><div>=A0 &lt;value&gt;-Xmx2048m&lt;/value&gt;</div></d=
iv>


<div style=3D"font-family:arial,sans-serif;font-size:12.727272033691406px">
&lt;/property&gt;<br></div></div><div><br></div><div>Even after setting the=
 above property, getting Jobtracker OOME issue. How the jobtracker memory g=
radually increasing. After restart the JT, within a week getting OOME.</div=
>


<div><br></div><div>How to resolve this, it is in production and critical? =
Please help. Thanks in advance.</div><div><br></div>-- <br>Regards,<br>Visw=
a.J
</div>
</blockquote></div></div>

<p></p>

-- <br>
=A0<br>
--- <br>
You received this message because you are subscribed to the Google Groups &=
quot;CDH Users&quot; group.<br>
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a href=3D"mailto:cdh-user%2Bunsubscribe@cloudera.org" target=3D"_b=
lank">cdh-user+unsubscribe@cloudera.org</a>.<br>
For more options, visit <a href=3D"https://groups.google.com/a/cloudera.org=
/groups/opt_out" target=3D"_blank">https://groups.google.com/a/cloudera.org=
/groups/opt_out</a>.<br>
</blockquote></div>
</blockquote></div>
</blockquote></div>

--047d7bd9036c4a86b204e8af9785--