Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of owenzhang1990@gmail.com
 designates 209.85.192.176 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAAD07OLDtt4B+-EROJ7PFwSrRz8Ys=b7P14S5aCNW5yL=c9jSQ@mail.gmail.com>
References: 
 <CABT57mYHRGsObAjsQfxAptmUS1mtHaGUmj=dJVF_vK416Kx7pA@mail.gmail.com>
	<1382550048.43545.YahooMailNeo@web141203.mail.bf1.yahoo.com>
	<CABT57mbN=DF1U2egCcuWsS3B9seLTbX4CiBdqcFUrU2xkHakZQ@mail.gmail.com>
	<CABT57mapLtazih-KxZ7kNP0xdKaihG4TH6_DPERuZhnisrH3wQ@mail.gmail.com>
	<CAAD07OLDtt4B+-EROJ7PFwSrRz8Ys=b7P14S5aCNW5yL=c9jSQ@mail.gmail.com>
Date: Fri, 25 Oct 2013 08:15:12 +0800
Message-ID: 
 <CABT57ma6wZE4Zeq=9QPAAhzGqQPmdNSzvLFJeVE8rz9WN=r2Lg@mail.gmail.com>
Subject: Re: map container is assigned default memory size rather than user
 configured which will cause TaskAttempt failure
From: Manu Zhang <owenzhang1990@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=047d7b5d660cd7430004e985a44e

--047d7b5d660cd7430004e985a44e
Content-Type: text/plain; charset=ISO-8859-1

My mapreduce.map.java.opts is 1024MB

Thanks,
Manu


On Thu, Oct 24, 2013 at 3:11 PM, Tsuyoshi OZAWA <ozawa.tsuyoshi@gmail.com>wrote:

> Hi,
>
> How about checking the value of mapreduce.map.java.opts? Are your JVMs
> launched with assumed heap memory?
>
> On Thu, Oct 24, 2013 at 11:31 AM, Manu Zhang <owenzhang1990@gmail.com>
> wrote:
> > Just confirmed the problem still existed even the "mapred-site.xml"s on
> all
> > nodes have the same configuration (mapreduce.map.memory.mb = 2560).
> >
> > Any more thoughts ?
> >
> > Thanks,
> > Manu
> >
> >
> > On Thu, Oct 24, 2013 at 8:59 AM, Manu Zhang <owenzhang1990@gmail.com>
> wrote:
> >>
> >> Thanks Ravi.
> >>
> >> I do have mapred-site.xml under /etc/hadoop/conf/ on those nodes but it
> >> sounds weird to me should they read configuration from those
> mapred-site.xml
> >> since it's the client who applies for the resource. I have another
> >> mapred-site.xml in the directory where I run my job. I suppose my job
> should
> >> read conf from that mapred-site.xml. Please correct me if I am mistaken.
> >>
> >> Also, not always the same nodes. The number of failures is random, too.
> >>
> >> Anyway, I will have my settings in all the nodes' mapred-site.xml and
> see
> >> if the problem goes away.
> >>
> >> Manu
> >>
> >>
> >> On Thu, Oct 24, 2013 at 1:40 AM, Ravi Prakash <ravihoo@ymail.com>
> wrote:
> >>>
> >>> Manu!
> >>>
> >>> This should not be the case. All tasks should have the configuration
> >>> values you specified propagated to them. Are you sure your setup is
> correct?
> >>> Are they always the same nodes which run with 1024Mb? Perhaps you have
> >>> mapred-site.xml on those nodes?
> >>>
> >>> HTH
> >>> Ravi
> >>>
> >>>
> >>> On Tuesday, October 22, 2013 9:09 PM, Manu Zhang
> >>> <owenzhang1990@gmail.com> wrote:
> >>> Hi,
> >>>
> >>> I've been running Terasort on Hadoop-2.0.4.
> >>>
> >>> Every time there is s a small number of Map failures (like 4 or 5)
> >>> because of container's running beyond virtual memory limit.
> >>>
> >>> I've set mapreduce.map.memory.mb to a safe value (like 2560MB) so most
> >>> TaskAttempt goes fine while the values of those failed maps are the
> default
> >>> 1024MB.
> >>>
> >>> My question is thus, why a small number of container's memory values
> are
> >>> set to default rather than that of user-configured ?
> >>>
> >>> Any thoughts ?
> >>>
> >>> Thanks,
> >>> Manu Zhang
> >>>
> >>>
> >>>
> >>
> >
>
>
>
> --
> - Tsuyoshi
>

--047d7b5d660cd7430004e985a44e
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">My mapreduce.map.java.opts is 1024MB<div><br></div><div>Th=
anks,</div><div>Manu</div></div><div class=3D"gmail_extra"><br><br><div cla=
ss=3D"gmail_quote">On Thu, Oct 24, 2013 at 3:11 PM, Tsuyoshi OZAWA <span di=
r=3D"ltr">&lt;<a href=3D"mailto:ozawa.tsuyoshi@gmail.com" target=3D"_blank"=
>ozawa.tsuyoshi@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Hi,<br>
<br>
How about checking the value of mapreduce.map.java.opts? Are your JVMs<br>
launched with assumed heap memory?<br>
<div class=3D"HOEnZb"><div class=3D"h5"><br>
On Thu, Oct 24, 2013 at 11:31 AM, Manu Zhang &lt;<a href=3D"mailto:owenzhan=
g1990@gmail.com">owenzhang1990@gmail.com</a>&gt; wrote:<br>
&gt; Just confirmed the problem still existed even the &quot;mapred-site.xm=
l&quot;s on all<br>
&gt; nodes have the same configuration (mapreduce.map.memory.mb =3D 2560).<=
br>
&gt;<br>
&gt; Any more thoughts ?<br>
&gt;<br>
&gt; Thanks,<br>
&gt; Manu<br>
&gt;<br>
&gt;<br>
&gt; On Thu, Oct 24, 2013 at 8:59 AM, Manu Zhang &lt;<a href=3D"mailto:owen=
zhang1990@gmail.com">owenzhang1990@gmail.com</a>&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt; Thanks Ravi.<br>
&gt;&gt;<br>
&gt;&gt; I do have mapred-site.xml under /etc/hadoop/conf/ on those nodes b=
ut it<br>
&gt;&gt; sounds weird to me should they read configuration from those mapre=
d-site.xml<br>
&gt;&gt; since it&#39;s the client who applies for the resource. I have ano=
ther<br>
&gt;&gt; mapred-site.xml in the directory where I run my job. I suppose my =
job should<br>
&gt;&gt; read conf from that mapred-site.xml. Please correct me if I am mis=
taken.<br>
&gt;&gt;<br>
&gt;&gt; Also, not always the same nodes. The number of failures is random,=
 too.<br>
&gt;&gt;<br>
&gt;&gt; Anyway, I will have my settings in all the nodes&#39; mapred-site.=
xml and see<br>
&gt;&gt; if the problem goes away.<br>
&gt;&gt;<br>
&gt;&gt; Manu<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; On Thu, Oct 24, 2013 at 1:40 AM, Ravi Prakash &lt;<a href=3D"mailt=
o:ravihoo@ymail.com">ravihoo@ymail.com</a>&gt; wrote:<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Manu!<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; This should not be the case. All tasks should have the configu=
ration<br>
&gt;&gt;&gt; values you specified propagated to them. Are you sure your set=
up is correct?<br>
&gt;&gt;&gt; Are they always the same nodes which run with 1024Mb? Perhaps =
you have<br>
&gt;&gt;&gt; mapred-site.xml on those nodes?<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; HTH<br>
&gt;&gt;&gt; Ravi<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; On Tuesday, October 22, 2013 9:09 PM, Manu Zhang<br>
&gt;&gt;&gt; &lt;<a href=3D"mailto:owenzhang1990@gmail.com">owenzhang1990@g=
mail.com</a>&gt; wrote:<br>
&gt;&gt;&gt; Hi,<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; I&#39;ve been running Terasort on Hadoop-2.0.4.<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Every time there is s a small number of Map failures (like 4 o=
r 5)<br>
&gt;&gt;&gt; because of container&#39;s running beyond virtual memory limit=
.<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; I&#39;ve set mapreduce.map.memory.mb to a safe value (like 256=
0MB) so most<br>
&gt;&gt;&gt; TaskAttempt goes fine while the values of those failed maps ar=
e the default<br>
&gt;&gt;&gt; 1024MB.<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; My question is thus, why a small number of container&#39;s mem=
ory values are<br>
&gt;&gt;&gt; set to default rather than that of user-configured ?<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Any thoughts ?<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Thanks,<br>
&gt;&gt;&gt; Manu Zhang<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;<br>
&gt;<br>
<br>
<br>
<br>
</div></div><span class=3D"HOEnZb"><font color=3D"#888888">--<br>
- Tsuyoshi<br>
</font></span></blockquote></div><br></div>

--047d7b5d660cd7430004e985a44e--