Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of mail2mayank@gmail.com
 designates 74.125.82.170 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAORpBshmRV=oEeNt2jQnJskPf=2cx=DbjtuWtN6QNz_ME8QgFQ@mail.gmail.com>
References: 
 <CA+6rO3w5z+y1Mwd+rOGAwpPb0EZJxsbzxo7yOe0BTj4=vQ=FOg@mail.gmail.com>
	<CAORpBshmRV=oEeNt2jQnJskPf=2cx=DbjtuWtN6QNz_ME8QgFQ@mail.gmail.com>
Date: Mon, 10 Jun 2013 15:18:38 +0530
Message-ID: 
 <CA+6rO3wzcANWZeDEJanwp7XRc1Oib4bO2EQRrOcr_+Pp6fc5Hg@mail.gmail.com>
Subject: Re: Application errors with one disk on datanode getting filled up to
 100%
From: Mayank <mail2mayank@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=f46d04388e555befc704dec9af38

--f46d04388e555befc704dec9af38
Content-Type: text/plain; charset=UTF-8

No it's not a map-reduce job. We've a java app running on around 80
machines which writes to hdfs. The error that I'd mentioned is being thrown
by the application and yes we've replication factor set to 3 and following
is status of hdfs:

Configured Capacity : 16.15 TB DFS Used : 11.84 TB Non DFS Used : 872.66 GB DFS
Remaining : 3.46 TB DFS Used% : 73.3 % DFS Remaining% : 21.42 % Live
Nodes<http://hmaster.production.indix.tv:50070/dfsnodelist.jsp?whatNodes=LIVE>
: 10 Dead Nodes<http://hmaster.production.indix.tv:50070/dfsnodelist.jsp?whatNodes=DEAD>
: 0 Decommissioning
Nodes<http://hmaster.production.indix.tv:50070/dfsnodelist.jsp?whatNodes=DECOMMISSIONING>
: 0 Number of Under-Replicated Blocks : 0


On Mon, Jun 10, 2013 at 3:11 PM, Nitin Pawar <nitinpawar432@gmail.com>wrote:

> when you say application errors out .. does that mean your mapreduce job
> is erroring? In that case apart from hdfs space you will need to look at
> mapred tmp directory space as well.
>
> you got 400GB * 4 * 10 = 16TB of disk and lets assume that you have a
> replication factor of 3 so at max you will have datasize of 5TB with you.
> I am also assuming you are not scheduling your program to run on entire
> 5TB with just 10 nodes.
>
> i suspect your clusters mapred tmp space is getting filled in while the
> job is running.
>
>
>
>
>
> On Mon, Jun 10, 2013 at 3:06 PM, Mayank <mail2mayank@gmail.com> wrote:
>
>> We are running a hadoop cluster with 10 datanodes and a namenode. Each
>> datanode is setup with 4 disks (/data1, /data2, /data3, /data4), which each
>> disk having a capacity 414GB.
>>
>>
>> hdfs-site.xml has following property set:
>>
>> <property>
>>         <name>dfs.data.dir</name>
>>
>> <value>/data1/hadoopfs,/data2/hadoopfs,/data3/hadoopfs,/data4/hadoopfs</value>
>>         <description>Data dirs for DFS.</description>
>> </property>
>>
>> Now we are facing a issue where in we find /data1 getting filled up
>> quickly and many a times we see it's usage running at 100% with just few
>> megabytes of free space. This issue is visible on 7 out of 10 datanodes at
>> present.
>>
>> We've some java applications which are writing to hdfs and many a times
>> we are seeing foloowing errors in our application logs:
>>
>> java.io.IOException: All datanodes xxx.xxx.xxx.xxx:50010 are bad. Aborting...
>> 	at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.processDatanodeError(DFSClient.java:3093)
>> 	at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.access$2200(DFSClient.java:2586)
>> 	at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2790)
>>
>>
>> I went through some old discussions and looks like manual rebalancing is
>> what is required in this case and we should also have
>> dfs.datanode.du.reserved set up.
>>
>> However I'd like to understand if this issue, with one disk getting
>> filled up to 100% can result into the issue which we are seeing in our
>> application.
>>
>> Also, are there any other peformance implications due to some of the
>> disks running at 100% usage on a datanode.
>> --
>> Mayank Joshi
>>
>> Skype: mail2mayank
>> Mb.:  +91 8690625808
>>
>> Blog: http://www.techynfreesouls.co.nr
>> PhotoStream: http://picasaweb.google.com/mail2mayank
>>
>> Today is tommorrow I was so worried about yesterday ...
>>
>
>
>
> --
> Nitin Pawar
>


-- 
Mayank Joshi

Skype: mail2mayank
Mb.:  +91 8690625808

Blog: http://www.techynfreesouls.co.nr
PhotoStream: http://picasaweb.google.com/mail2mayank

Today is tommorrow I was so worried about yesterday ...

--f46d04388e555befc704dec9af38
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">No it&#39;s not a map-reduce job. We&#39;ve a java app run=
ning on around 80 machines which writes to hdfs. The error that I&#39;d men=
tioned is being thrown by the application and yes we&#39;ve replication fac=
tor set to 3 and following is status of hdfs:<br>
<br><table><tbody><tr class=3D""><td id=3D"col1"> Configured Capacity</td><=
td id=3D"col2"> :</td><td id=3D"col3"> 16.15 TB</td></tr><tr class=3D""> <t=
d id=3D"col1"> DFS Used</td><td id=3D"col2"> :</td><td id=3D"col3"> 11.84 T=
B</td></tr><tr class=3D"">
 <td id=3D"col1"> Non DFS Used</td><td id=3D"col2"> :</td><td id=3D"col3"> =
872.66 GB</td></tr><tr class=3D""> <td id=3D"col1"> DFS Remaining</td><td i=
d=3D"col2"> :</td><td id=3D"col3"> 3.46 TB</td></tr><tr class=3D""> <td id=
=3D"col1"> DFS Used%</td>
<td id=3D"col2"> :</td><td id=3D"col3"> 73.3 %</td></tr><tr class=3D""> <td=
 id=3D"col1"> DFS Remaining%</td><td id=3D"col2"> :</td><td id=3D"col3"> 21=
.42 %</td></tr><tr class=3D""> <td id=3D"col1"> <a href=3D"http://hmaster.p=
roduction.indix.tv:50070/dfsnodelist.jsp?whatNodes=3DLIVE">Live Nodes</a> <=
/td>
<td id=3D"col2"> :</td><td id=3D"col3"> 10</td></tr><tr class=3D""> <td id=
=3D"col1"> <a href=3D"http://hmaster.production.indix.tv:50070/dfsnodelist.=
jsp?whatNodes=3DDEAD">Dead Nodes</a> </td><td id=3D"col2"> :</td><td id=3D"=
col3"> 0</td>
</tr><tr class=3D""> <td id=3D"col1"> <a href=3D"http://hmaster.production.=
indix.tv:50070/dfsnodelist.jsp?whatNodes=3DDECOMMISSIONING">Decommissioning=
 Nodes</a> </td><td id=3D"col2"> :</td><td id=3D"col3"> 0</td></tr><tr clas=
s=3D""> <td id=3D"col1">
 Number of Under-Replicated Blocks</td><td id=3D"col2"> :</td><td id=3D"col=
3"> 0</td></tr></tbody></table><br></div><div class=3D"gmail_extra"><br><br=
><div class=3D"gmail_quote">On Mon, Jun 10, 2013 at 3:11 PM, Nitin Pawar <s=
pan dir=3D"ltr">&lt;<a href=3D"mailto:nitinpawar432@gmail.com" target=3D"_b=
lank">nitinpawar432@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">when you say application er=
rors out .. does that mean your mapreduce job is erroring? In that case apa=
rt from hdfs space you will need to look at mapred tmp directory space as w=
ell.=C2=A0<div>
<br></div><div>
you got 400GB * 4 * 10 =3D 16TB of disk and lets assume that you have a rep=
lication factor of 3 so at max you will have datasize of 5TB with you.=C2=
=A0</div><div>I am also assuming you are not scheduling your program to run=
 on entire 5TB with just 10 nodes.=C2=A0</div>

<div><br></div><div>i suspect your clusters mapred tmp space is getting fil=
led in while the job is running.=C2=A0</div><div><br></div><div><br></div><=
div><br></div></div><div class=3D"gmail_extra"><div><div class=3D"h5"><br>
<br><div class=3D"gmail_quote">On Mon, Jun 10, 2013 at 3:06 PM, Mayank <spa=
n dir=3D"ltr">&lt;<a href=3D"mailto:mail2mayank@gmail.com" target=3D"_blank=
">mail2mayank@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail=
_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:=
1ex">

<div dir=3D"ltr"><div>We are running a hadoop cluster with 10 datanodes and=
 a namenode. Each datanode is setup with 4 disks (/data1, /data2, /data3, /=
data4), which each disk having a capacity 414GB.<br><br><br></div><div>hdfs=
-site.xml has following property set:<br>


<br>&lt;property&gt;<br>=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 &lt;name=
&gt;dfs.data.dir&lt;/name&gt;<br>=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
 &lt;value&gt;/data1/hadoopfs,/data2/hadoopfs,/data3/hadoopfs,/data4/hadoop=
fs&lt;/value&gt;<br>=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 &lt;descript=
ion&gt;Data dirs for DFS.&lt;/description&gt;<br>


&lt;/property&gt;<br></div><div><div><br></div><div>Now we are facing a iss=
ue where in we find /data1 getting filled up quickly and many a times we se=
e it&#39;s usage running at 100% with just few megabytes of free space. Thi=
s issue is visible on 7 out of 10 datanodes at present.<br>


<br></div><div>We&#39;ve some java applications which are writing to hdfs a=
nd many a times we are seeing foloowing errors in our application logs:<br>=
<br><pre style=3D"font-family:consolas,monaco,courier;line-height:1.22em;fo=
nt-size:13px;margin-top:0px;margin-bottom:0px;padding:0px;outline:medium no=
ne;overflow:auto;white-space:pre-wrap">
java.io.IOException: All datanodes <a href=3D"http://xxx.xxx.xxx.xxx:50010"=
 target=3D"_blank">xxx.xxx.xxx.xxx:50010</a> are bad. Aborting...
	at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.processDatanodeError(D=
FSClient.java:3093)
	at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.access$2200(DFSClient.=
java:2586)
	at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSCl=
ient.java:2790)</pre><pre style=3D"font-family:consolas,monaco,courier;line=
-height:1.22em;font-size:13px;margin-top:0px;margin-bottom:0px;padding:0px;=
outline:medium none;overflow:auto;white-space:pre-wrap">
<br></pre>I went through some old discussions and looks like manual rebalan=
cing is what is required in this case and we should also have dfs.datanode.=
du.reserved set up.<br><br></div><div>However I&#39;d like to understand if=
 this issue, with one disk getting filled up to 100% can result into the is=
sue which we are seeing in our application. <br>


<br>Also, are there any other peformance implications due to some of the di=
sks running at 100% usage on a datanode.<span><font color=3D"#888888"><br><=
/font></span></div><span><font color=3D"#888888"><div>
-- <br>Mayank Joshi<br><br>Skype: mail2mayank <br>Mb.:=C2=A0 <a href=3D"tel=
:%2B91%208690625808" value=3D"+918690625808" target=3D"_blank">+91 86906258=
08</a><br><br>Blog: <a href=3D"http://www.techynfreesouls.co.nr/" target=3D=
"_blank">http://www.techynfreesouls.co.nr</a><br>


PhotoStream: <a href=3D"http://picasaweb.google.com/mail2mayank" target=3D"=
_blank">http://picasaweb.google.com/mail2mayank</a><br><br>Today is tommorr=
ow I was so worried about yesterday ...
</div></font></span></div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div></div><span c=
lass=3D"HOEnZb"><font color=3D"#888888">-- <br>Nitin Pawar<br>
</font></span></div>
</blockquote></div><br><br clear=3D"all"><br>-- <br>Mayank Joshi<br><br>Sky=
pe: mail2mayank <br>Mb.:=C2=A0 +91 8690625808<br><br>Blog: <a href=3D"http:=
//www.techynfreesouls.co.nr/" target=3D"_blank">http://www.techynfreesouls.=
co.nr</a><br>

PhotoStream: <a href=3D"http://picasaweb.google.com/mail2mayank" target=3D"=
_blank">http://picasaweb.google.com/mail2mayank</a><br><br>Today is tommorr=
ow I was so worried about yesterday ...
</div>

--f46d04388e555befc704dec9af38--