Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of harsh@cloudera.com designates
 209.85.215.46 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAGZOQ8Np-jjmM5CgeBbWjL1aiQvyADinqBOhc3+Je1i+wDzqPA@mail.gmail.com>
References: 
 <CAGZOQ8NFXxC_hBuu3f7fpmZx+QsvPpWwr+JKp0omMCCv4Uuetg@mail.gmail.com>
 <CADadbPGL0LHaaZ-f56wDG2x0JMvc9x7sudVYn6w1c7=QC6Fowg@mail.gmail.com>
 <CAGZOQ8MRrvQmwGit=gmu+gAM+TK8so5Bv0krr2sLJi+mRPT07g@mail.gmail.com>
 <CAOcnVr2fEkfZZJkoWpshi7jHai-Ckc2Mi0RO5+jioTUzkejybw@mail.gmail.com>
 <CAOcnVr2caNXmP9mfiwTEk13bTZf5kjOUu7JwdzpvPnm96HYwaA@mail.gmail.com>
 <CAGZOQ8MuiCV0N4Zp0c32+UbGPmkva8PvEob02aLecGhiKbWcpQ@mail.gmail.com>
 <CAGZOQ8Pfb+y6XE7++L8F70yqn4BwG-F8_Xmp1hHbFSe-ByWo1Q@mail.gmail.com>
 <CAOcnVr103PTF29v-CRL==KTOxrdeDy-jf+KdMZmgME3tdNu5Sw@mail.gmail.com>
 <CAGZOQ8OH-VW0bT+FuY=Q55C-Bn66DgMdWEvADmP0y=DsZm8kiQ@mail.gmail.com>
 <CAOcnVr3+Tp34giz-a5OqecBVs_FYhCfLw01mUKsMddQaw-eWKw@mail.gmail.com>
 <CAGZOQ8Np-jjmM5CgeBbWjL1aiQvyADinqBOhc3+Je1i+wDzqPA@mail.gmail.com>
From: Harsh J <harsh@cloudera.com>
Date: Wed, 23 Jan 2013 22:28:35 +0530
Message-ID: 
 <CAOcnVr1ge=Z-C_qQ8frAuTcDenmKO3C5k6So-K9nG5hFtzbOag@mail.gmail.com>
Subject: Re: NameNode low on available disk space
To: Mohit Vadhera <project.linux.proj@gmail.com>
Cc: "<user@hadoop.apache.org>" <user@hadoop.apache.org>
Content-Type: multipart/alternative; boundary=e0cb4efa6dfc1196c304d3f79c0f

--e0cb4efa6dfc1196c304d3f79c0f
Content-Type: text/plain; charset=ISO-8859-1

The logs display it in simple bytes. If the issue begins to occur when you
start using Hadoop, then its most certainly MR using up the disk space
temporarily.

You could lower the threshold, or you could perhaps use a bigger disk for
your trials/more nodes.


On Wed, Jan 23, 2013 at 10:25 PM, Mohit Vadhera <
project.linux.proj@gmail.com> wrote:

> MR operation are running on the same machine. i checked the parameter "
> mapred.local.dir" in my installed directory /etc/hadoop/ but didn't find .
> One question the disk space reserved size displayed in logs in KB or MB ?
> I am layman on hadoop. The link I followed to install is given below
>
>
> https://ccp.cloudera.com/display/CDH4DOC/Installing+CDH4+on+a+Single+Linux+Node+in+Pseudo-distributed+Mode
>
> Thanks,
>
>
>
>
> On Wed, Jan 23, 2013 at 10:12 PM, Harsh J <harsh@cloudera.com> wrote:
>
>> A random switching behavior can only be explained by a fluctuating disk
>> space I'd think. Are you running MR operations on the same disk (i.e. is it
>> part of mapred.local.dir as well)?
>>
>>
>> On Wed, Jan 23, 2013 at 9:24 PM, Mohit Vadhera <
>> project.linux.proj@gmail.com> wrote:
>>
>>> NN switches randomly into the safemode then I run command to leave
>>> safemode manually. I never got alerts for low disk space on machine level
>>> and i didn't see the space fluctuates GBs into MBs .
>>>
>>>
>>>
>>>
>>>
>>> On Wed, Jan 23, 2013 at 9:10 PM, Harsh J <harsh@cloudera.com> wrote:
>>>
>>>> Mohit,
>>>>
>>>> When do you specifically get the error at the NN? Does your NN
>>>> consistently not start with that error?
>>>>
>>>> Your local disk space availability can certainly fluctuate if you use
>>>> the same disk for MR and other activity which creates temporary files.
>>>>
>>>>
>>>> On Wed, Jan 23, 2013 at 9:01 PM, Mohit Vadhera <
>>>> project.linux.proj@gmail.com> wrote:
>>>>
>>>>> Can somebody answer me on this plz ?
>>>>>
>>>>>
>>>>> On Wed, Jan 23, 2013 at 11:44 AM, Mohit Vadhera <
>>>>> project.linux.proj@gmail.com> wrote:
>>>>>
>>>>>> Thanks Guys, As you said the level is already pretty low i.e 100 MB
>>>>>> but in my case the root fs / has 14 G available.  What can be the root
>>>>>> cause then ?
>>>>>>
>>>>>> /dev/mapper/vg_operamast1-lv_root
>>>>>>                        50G   33G   14G  71% /
>>>>>>
>>>>>> As per logs.
>>>>>>     2013-01-21 01:22:52,217 WARN
>>>>>> org.apache.hadoop.hdfs.server.namenode.NameNodeResourceChecker: Space
>>>>>> available on volume '/dev/mapper/vg_operamast1-lv_root' is 10653696, which
>>>>>> is below the configured reserved amount 104857600
>>>>>>
>>>>>>
>>>>>> On Wed, Jan 23, 2013 at 11:13 AM, Harsh J <harsh@cloudera.com> wrote:
>>>>>>
>>>>>>> Hi again,
>>>>>>>
>>>>>>> Yes, you need to add it to hdfs-site.xml and restart the NN.
>>>>>>>
>>>>>>> > Thanks Harsh, Do I need to add parameters in hdfs-site.xml and
>>>>>>> restart service namenode.
>>>>>>> > +  public static final String  DFS_NAMENODE_DU_RESERVED_KEY =
>>>>>>> "dfs.namenode.resource.du.
>>>>>>> reserved";
>>>>>>> > +  public static final long    DFS_NAMENODE_DU_RESERVED_DEFAULT =
>>>>>>> 1024 * 1024 * 100; // 100 MB
>>>>>>>
>>>>>>>
>>>>>>> On Wed, Jan 23, 2013 at 10:12 AM, Harsh J <harsh@cloudera.com>wrote:
>>>>>>>
>>>>>>>> Edit your hdfs-site.xml (or whatever place of config your NN uses)
>>>>>>>> to lower the value of property "dfs.namenode.resource.du.reserved". Create
>>>>>>>> a new property if one does not exist, and set the value of space to a
>>>>>>>> suitable level. The default itself is pretty low - 100 MB in bytes.
>>>>>>>>
>>>>>>>>
>>>>>>>> On Wed, Jan 23, 2013 at 9:13 AM, Mohit Vadhera <
>>>>>>>> project.linux.proj@gmail.com> wrote:
>>>>>>>>
>>>>>>>>> Ok Steve. I am forwarding my issue again to the list that you
>>>>>>>>> said. The version is
>>>>>>>>>
>>>>>>>>> Hi,
>>>>>>>>>
>>>>>>>>> Namenode switches into safemode when it has low disk space on the
>>>>>>>>> root fs / i have to manually run a command to leave it. Below are log
>>>>>>>>> messages for low space on root / fs. Is there any parameter so that i can
>>>>>>>>> reduce reserved amount.Hadoop 2.0.0-cdh4.1.2
>>>>>>>>>
>>>>>>>>> 2013-01-21 01:22:52,217 WARN
>>>>>>>>> org.apache.hadoop.hdfs.server.namenode.NameNodeResourceChecker: Space
>>>>>>>>> available on volume '/dev/mapper/vg_lv_root' is 10653696, which is below
>>>>>>>>> the configured reserved amount 104857600
>>>>>>>>> 2013-01-21 01:22:52,218 WARN
>>>>>>>>> org.apache.hadoop.hdfs.server.namenode.FSNamesystem: NameNode low on
>>>>>>>>> available disk space. Entering safe mode.
>>>>>>>>> 2013-01-21 01:22:52,218 INFO org.apache.hadoop.hdfs.StateChange:
>>>>>>>>> STATE* Safe mode is ON.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Wed, Jan 23, 2013 at 2:50 AM, Steve Loughran <
>>>>>>>>> steve.loughran@gmail.com> wrote:
>>>>>>>>>
>>>>>>>>>> ser@hadoop.apache.orglist
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> --
>>>>>>>> Harsh J
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>> Harsh J
>>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Harsh J
>>>>
>>>
>>>
>>
>>
>> --
>> Harsh J
>>
>
>


-- 
Harsh J

--e0cb4efa6dfc1196c304d3f79c0f
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">The logs display it in simple bytes. If the issue begins t=
o occur when you start using Hadoop, then its most certainly MR using up th=
e disk space temporarily.<div><br></div><div style>You could lower the thre=
shold, or you could perhaps use a bigger disk for your trials/more nodes.</=
div>

</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed,=
 Jan 23, 2013 at 10:25 PM, Mohit Vadhera <span dir=3D"ltr">&lt;<a href=3D"m=
ailto:project.linux.proj@gmail.com" target=3D"_blank">project.linux.proj@gm=
ail.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>MR operation are runni=
ng on the same machine. i checked the parameter &quot; mapred.local.dir&quo=
t; in my installed directory /etc/hadoop/ but didn&#39;t find . One questio=
n the disk space reserved size displayed in logs in KB or MB ?=A0 I am laym=
an on hadoop. The link I followed to install is given below <br>


<br><a href=3D"https://ccp.cloudera.com/display/CDH4DOC/Installing+CDH4+on+=
a+Single+Linux+Node+in+Pseudo-distributed+Mode" target=3D"_blank">https://c=
cp.cloudera.com/display/CDH4DOC/Installing+CDH4+on+a+Single+Linux+Node+in+P=
seudo-distributed+Mode</a><br>


<br></div>Thanks,<br><br><div><br></div></div><div class=3D"HOEnZb"><div cl=
ass=3D"h5"><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On=
 Wed, Jan 23, 2013 at 10:12 PM, Harsh J <span dir=3D"ltr">&lt;<a href=3D"ma=
ilto:harsh@cloudera.com" target=3D"_blank">harsh@cloudera.com</a>&gt;</span=
> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">A random switching behavior=
 can only be explained by a fluctuating disk space I&#39;d think. Are you r=
unning MR operations on the same disk (i.e. is it part of mapred.local.dir =
as well)?</div>


<div class=3D"gmail_extra"><div><div>

<br><br><div class=3D"gmail_quote">On Wed, Jan 23, 2013 at 9:24 PM, Mohit V=
adhera <span dir=3D"ltr">&lt;<a href=3D"mailto:project.linux.proj@gmail.com=
" target=3D"_blank">project.linux.proj@gmail.com</a>&gt;</span> wrote:<br><=
blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px=
 #ccc solid;padding-left:1ex">


<div dir=3D"ltr"><div>NN switches randomly into the safemode then I run com=
mand to leave safemode manually. I never got alerts for low disk space on m=
achine level and i didn&#39;t see the space fluctuates GBs into MBs . <br>


<br><br></div><br></div><div><div><div class=3D"gmail_extra"><br><br><div c=
lass=3D"gmail_quote">On Wed, Jan 23, 2013 at 9:10 PM, Harsh J <span dir=3D"=
ltr">&lt;<a href=3D"mailto:harsh@cloudera.com" target=3D"_blank">harsh@clou=
dera.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Mohit,<div><br></div><div>W=
hen do you specifically get the error at the NN? Does your NN consistently =
not start with that error?</div>


<div><br></div><div>Your local disk space availability can certainly fluctu=
ate if you use the same disk for MR and other activity which creates tempor=
ary files.</div>

</div><div class=3D"gmail_extra"><div><div><br><br><div class=3D"gmail_quot=
e">On Wed, Jan 23, 2013 at 9:01 PM, Mohit Vadhera <span dir=3D"ltr">&lt;<a =
href=3D"mailto:project.linux.proj@gmail.com" target=3D"_blank">project.linu=
x.proj@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Can somebody answer me on t=
his plz ?<br></div><div><div><div class=3D"gmail_extra">

<br><br><div class=3D"gmail_quote">On Wed, Jan 23, 2013 at 11:44 AM, Mohit =
Vadhera <span dir=3D"ltr">&lt;<a href=3D"mailto:project.linux.proj@gmail.co=
m" target=3D"_blank">project.linux.proj@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Thanks Guys, As you said th=
e level is already pretty low i.e 100 MB but=20
in my case the root fs / has 14 G available.=A0 What can be the root cause
 then ?<br><br>/dev/mapper/vg_operamast1-lv_root<br>=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 50G=A0=A0 33G=A0=A0 14G=A0 71=
% /<br><br>As per logs.<br>=A0=A0=A0
 2013-01-21 01:22:52,217 WARN=20
org.apache.hadoop.hdfs.server.namenode.NameNodeResourceChecker: Space=20
available on volume &#39;/dev/mapper/vg_operamast1-lv_root&#39; is 10653696=
,=20
which is below the configured reserved amount 104857600<br></div><div><div>=
<div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed, Jan 2=
3, 2013 at 11:13 AM, Harsh J <span dir=3D"ltr">&lt;<a href=3D"mailto:harsh@=
cloudera.com" target=3D"_blank">harsh@cloudera.com</a>&gt;</span> wrote:<br=
>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><span style=3D"font-fa=
mily:arial,sans-serif;font-size:13px">Hi again,</span></div><div><span styl=
e=3D"font-family:arial,sans-serif;font-size:13px"><br>


</span></div><div><span style=3D"font-family:arial,sans-serif;font-size:13p=
x">Yes, you need to add it to hdfs-site.xml and restart the NN.</span></div=
>

<span style=3D"font-family:arial,sans-serif;font-size:13px"><div><span styl=
e=3D"font-family:arial,sans-serif;font-size:13px"><br></span></div>&gt; Tha=
nks Harsh, Do I need to add parameters in hdfs-site.xml and restart service=
 namenode.</span><br style=3D"font-family:arial,sans-serif;font-size:13px">


<span style=3D"font-family:arial,sans-serif;font-size:13px">&gt; + =A0publi=
c static final String =A0DFS_NAMENODE_DU_RESERVED_KEY =3D &quot;dfs.namenod=
e.resource.du.</span><div style=3D"font-family:arial,sans-serif;font-size:1=
3px">


reserved&quot;;<br>
&gt; + =A0public static final long =A0 =A0DFS_NAMENODE_DU_RESERVED_DEFAULT =
=3D 1024 * 1024 * 100; // 100 MB</div></div><div class=3D"gmail_extra"><br>=
<br><div class=3D"gmail_quote">On Wed, Jan 23, 2013 at 10:12 AM, Harsh J <s=
pan dir=3D"ltr">&lt;<a href=3D"mailto:harsh@cloudera.com" target=3D"_blank"=
>harsh@cloudera.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Edit your hdfs-site.xml (or=
 whatever place of config your NN uses) to lower the value of property &quo=
t;dfs.namenode.resource.du.reserved&quot;. Create a new property if one doe=
s not exist, and set the value of space to a suitable level. The default it=
self is pretty low - 100 MB in bytes.<div class=3D"gmail_extra">


<div><div>

<div><div>
<br><br><div class=3D"gmail_quote">On Wed, Jan 23, 2013 at 9:13 AM, Mohit V=
adhera <span dir=3D"ltr">&lt;<a href=3D"mailto:project.linux.proj@gmail.com=
" target=3D"_blank">project.linux.proj@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_extra">=
Ok Steve. I am forwarding my issue again to the list that you said. The ver=
sion is <br>


<br><p>Hi,</p>

<p>Namenode switches into safemode when it has low disk space on the=20
root fs / i have to manually run a command to leave it. Below are log=20
messages for low space on root / fs. Is there any parameter so that i=20
can reduce reserved amount.Hadoop 2.0.0-cdh4.1.2</p><div>


<p>2013-01-21 01:22:52,217 WARN=20
org.apache.hadoop.hdfs.server.namenode.NameNodeResourceChecker: Space=20
available on volume &#39;/dev/mapper/vg_lv_root&#39; is 10653696, which is =
below
 the configured reserved amount 104857600<br>
2013-01-21 01:22:52,218 WARN=20
org.apache.hadoop.hdfs.server.namenode.FSNamesystem: NameNode low on=20
available disk space. Entering safe mode.<br>
2013-01-21 01:22:52,218 INFO org.apache.hadoop.hdfs.StateChange: STATE* Saf=
e mode is ON.</p><br></div></div><div class=3D"gmail_extra"><br><br><div cl=
ass=3D"gmail_quote">On Wed, Jan 23, 2013 at 2:50 AM, Steve Loughran <span d=
ir=3D"ltr">&lt;<a href=3D"mailto:steve.loughran@gmail.com" target=3D"_blank=
">steve.loughran@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left:1px solid rgb(204,204,204);padding-left:1ex">ser@hadoop.apache.orglist=
</blockquote></div><br><br></div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div></div></div><=
/div><span><font color=3D"#888888"><span><font color=3D"#888888">-- <br>Har=
sh J
</font></span></font></span></div></div><span><font color=3D"#888888">
</font></span></blockquote></div><span><font color=3D"#888888"><br><br clea=
r=3D"all"><div><br></div>-- <br>Harsh J
</font></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div></div><=
/div><span><font color=3D"#888888">-- <br>Harsh J
</font></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div></div><=
/div><span><font color=3D"#888888">-- <br>Harsh J
</font></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
Harsh J
</div>

--e0cb4efa6dfc1196c304d3f79c0f--