Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of ablozhou@gmail.com designates
 209.85.215.45 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CACvfG=rZTwC9GH5CVGb-YMbo+WBcUrKQYPKmxQTc9SuuRg0SQw@mail.gmail.com>
References: <013c01cddffd$7ee10b10$7ca32130$@yahoo.com>
 <CALtbyzrjxc5Q9fjVprpPybEBLdW2LxoQjjyr5K9tbovTZp1eag@mail.gmail.com>
 <CACvfG=rZTwC9GH5CVGb-YMbo+WBcUrKQYPKmxQTc9SuuRg0SQw@mail.gmail.com>
From: =?UTF-8?B?5ZGo5qKm5oOz?= <ablozhou@gmail.com>
Date: Mon, 24 Dec 2012 11:30:10 +0800
Message-ID: 
 <CAD=WJAhd_09Acn6p9byayOOe9q_BxsAM+VEArV2KtBQbsf-DBg@mail.gmail.com>
Subject: Re: How to troubleshoot OutOfMemoryError
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=bcaec554089004d6f504d190d273

--bcaec554089004d6f504d190d273
Content-Type: text/plain; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

I encountered the OOM problem, because i don't set ulimit open files limit.
It had nothing to do with Memory. Memory is sufficient.

Best Regards,
Andy

2012/12/22 Manoj Babu <manoj444@gmail.com>

> David,
>
> I faced the same issue due to too much of logging that fills the task
> tracker log folder.
>
> Cheers!
> Manoj.
>
>
> On Sat, Dec 22, 2012 at 9:10 PM, Stephen Fritz <stephenf@cloudera.com>wro=
te:
>
>> Troubleshooting OOMs in the map/reduce tasks can be tricky, see page 118
>> of Hadoop Operations<http://books.google.com/books?id=3DW5VWrrCOuQ8C&pg=
=3DPA123&lpg=3DPA123&dq=3Dmapred+child+address+space+size&source=3Dbl&ots=
=3DPCdqGFbU-Z&sig=3DArgpJroU7UEmMqMB_hwXoCq7whk&hl=3Den&sa=3DX&ei=3DTNPVUMj=
jHsS60AGHtoHQDA&ved=3D0CEUQ6AEwAw#v=3Donepage&q=3Dmapred%20child%20address%=
20space%20size&f=3Dfalse>for a couple of settings which could affect the fr=
equency of OOMs which
>> aren't necessarily intuitive.
>>
>> To answer your question about getting the heap dump, you should be able
>> to add "-XX:+HeapDumpOnOutOfMemoryError -XX:HeapDumpPath=3D/some/path" t=
o
>> your mapred.child.java.opts, then look for the heap dump in that path ne=
xt
>> time you see the OOM.
>>
>>
>> On Fri, Dec 21, 2012 at 11:33 PM, David Parks <davidparks21@yahoo.com>wr=
ote:
>>
>>> I=92m pretty consistently seeing a few reduce tasks fail with
>>> OutOfMemoryError (below). It doesn=92t kill the job, but it slows it do=
wn.
>>> ****
>>>
>>> ** **
>>>
>>> In my current case the reducer is pretty darn simple, the algorithm
>>> basically does:****
>>>
>>> **1.       **Do you have 2 values for this key?****
>>>
>>> **2.       **If so, build a json string and emit a NullWritable and
>>> Text value.****
>>>
>>> ** **
>>>
>>> The string buffer I use to build the json is re-used, and I can=92t see
>>> anywhere in my code that would be taking more than ~50k of memory at an=
y
>>> point in time.****
>>>
>>> ** **
>>>
>>> But I want to verify, is there a way to get the heap dump and all after
>>> this error? I=92m running on AWS MapReduce v1.0.3 of Hadoop.****
>>>
>>> ** **
>>>
>>> Error: java.lang.OutOfMemoryError: Java heap space****
>>>
>>>         at
>>> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.shuffl=
eInMemory(ReduceTask.java:1711)
>>> ****
>>>
>>>         at
>>> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.getMap=
Output(ReduceTask.java:1571)
>>> ****
>>>
>>>         at
>>> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.copyOu=
tput(ReduceTask.java:1412)
>>> ****
>>>
>>>         at
>>> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.run(Re=
duceTask.java:1344)
>>> ****
>>>
>>> ** **
>>>
>>> ** **
>>>
>>
>>
>

--bcaec554089004d6f504d190d273
Content-Type: text/html; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

I encountered the OOM problem, because i don&#39;t set ulimit open files li=
mit. It had nothing to do with Memory. Memory is=A0sufficient.<div><br></di=
v><div>Best Regards,</div><div>Andy<br><br><div class=3D"gmail_quote">2012/=
12/22 Manoj Babu <span dir=3D"ltr">&lt;<a href=3D"mailto:manoj444@gmail.com=
" target=3D"_blank">manoj444@gmail.com</a>&gt;</span><br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">David,<div><br><div>I faced the same issue d=
ue to too much of logging that fills the task tracker log folder.</div><div=
>

<br clear=3D"all"><div>Cheers!<span class=3D"HOEnZb"><font color=3D"#888888=
"><div>Manoj.</div></font></span></div><div><div class=3D"h5">
<br><br><div class=3D"gmail_quote">On Sat, Dec 22, 2012 at 9:10 PM, Stephen=
 Fritz <span dir=3D"ltr">&lt;<a href=3D"mailto:stephenf@cloudera.com" targe=
t=3D"_blank">stephenf@cloudera.com</a>&gt;</span> wrote:<br><blockquote cla=
ss=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;pa=
dding-left:1ex">


Troubleshooting OOMs in the map/reduce tasks can be tricky, see page <a hre=
f=3D"http://books.google.com/books?id=3DW5VWrrCOuQ8C&amp;pg=3DPA123&amp;lpg=
=3DPA123&amp;dq=3Dmapred+child+address+space+size&amp;source=3Dbl&amp;ots=
=3DPCdqGFbU-Z&amp;sig=3DArgpJroU7UEmMqMB_hwXoCq7whk&amp;hl=3Den&amp;sa=3DX&=
amp;ei=3DTNPVUMjjHsS60AGHtoHQDA&amp;ved=3D0CEUQ6AEwAw#v=3Donepage&amp;q=3Dm=
apred%20child%20address%20space%20size&amp;f=3Dfalse" target=3D"_blank">118=
 of Hadoop Operations</a> for a couple of settings which could affect the f=
requency of OOMs which aren&#39;t necessarily intuitive.=A0 <br>


<br>To answer your question about getting the heap dump, you should be able=
 to add &quot;-XX:+HeapDumpOnOutOfMemoryError -XX:HeapDumpPath=3D/some/path=
&quot; to your mapred.child.java.opts, then look for the heap dump in that =
path next time you see the OOM.<div>


<div><br>
<br><div class=3D"gmail_quote">On Fri, Dec 21, 2012 at 11:33 PM, David Park=
s <span dir=3D"ltr">&lt;<a href=3D"mailto:davidparks21@yahoo.com" target=3D=
"_blank">davidparks21@yahoo.com</a>&gt;</span> wrote:<br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex">


<div link=3D"blue" vlink=3D"purple" lang=3D"EN-US"><div><p class=3D"MsoNorm=
al">I=92m pretty consistently seeing a few reduce tasks fail with OutOfMemo=
ryError (below). It doesn=92t kill the job, but it slows it down.<u></u><u>=
</u></p>


<p class=3D"MsoNormal"><u></u>=A0<u></u></p><p class=3D"MsoNormal">In my cu=
rrent case the reducer is pretty darn simple, the algorithm basically does:=
<u></u><u></u></p><p><u></u><span>1.<span style=3D"font:7.0pt &quot;Times N=
ew Roman&quot;">=A0=A0=A0=A0=A0=A0 </span></span><u></u>Do you have 2 value=
s for this key?<u></u><u></u></p>


<p><u></u><span>2.<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=A0=A0=A0=A0=A0=A0 </span></span><u></u>If so, build a json string and emit=
 a NullWritable and Text value.<u></u><u></u></p><p class=3D"MsoNormal"><u>=
</u>=A0<u></u></p>


<p class=3D"MsoNormal">The string buffer I use to build the json is re-used=
, and I can=92t see anywhere in my code that would be taking more than ~50k=
 of memory at any point in time.<u></u><u></u></p><p class=3D"MsoNormal"><u=
></u>=A0<u></u></p>


<p class=3D"MsoNormal">But I want to verify, is there a way to get the heap=
 dump and all after this error? I=92m running on AWS MapReduce v1.0.3 of Ha=
doop.<u></u><u></u></p><p class=3D"MsoNormal"><u></u>=A0<u></u></p><p class=
=3D"MsoNormal">


<span style=3D"font-family:&quot;Courier New&quot;">Error: java.lang.OutOfM=
emoryError: Java heap space<u></u><u></u></span></p><p class=3D"MsoNormal">=
<span style=3D"font-family:&quot;Courier New&quot;">=A0=A0=A0=A0=A0=A0=A0 a=
t org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.shuffleI=
nMemory(ReduceTask.java:1711)<u></u><u></u></span></p>


<p class=3D"MsoNormal"><span style=3D"font-family:&quot;Courier New&quot;">=
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$M=
apOutputCopier.getMapOutput(ReduceTask.java:1571)<u></u><u></u></span></p><=
p class=3D"MsoNormal">


<span style=3D"font-family:&quot;Courier New&quot;">=A0=A0=A0=A0=A0=A0=A0 a=
t org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.copyOutp=
ut(ReduceTask.java:1412)<u></u><u></u></span></p><p class=3D"MsoNormal"><sp=
an style=3D"font-family:&quot;Courier New&quot;">=A0=A0=A0=A0=A0=A0=A0 at o=
rg.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.run(ReduceT=
ask.java:1344)<u></u><u></u></span></p>


<p class=3D"MsoNormal"><span style=3D"font-family:&quot;Courier New&quot;">=
<u></u>=A0<u></u></span></p><p class=3D"MsoNormal"><span style=3D"font-fami=
ly:&quot;Courier New&quot;"><u></u>=A0<u></u></span></p></div></div></block=
quote></div>


<br><div style=3D"line-height:130%;text-align:left;font-size:10px;overflow:=
hidden;margin-left:0px;word-wrap:break-word;margin-top:0px;padding:0px"></d=
iv>
</div></div></blockquote></div><br></div></div></div></div>
</blockquote></div><br></div>

--bcaec554089004d6f504d190d273--