Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of acedres@pivotal.io designates
 209.85.213.170 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <543515EA.10705@etinternational.com>
References: <54329D2A.9020708@etinternational.com>
	<CAM5rYObt2WafA_-dcuqg++arzgsJnEKnmBLT7d2mg-dH015ADg@mail.gmail.com>
	<543515EA.10705@etinternational.com>
Date: Wed, 8 Oct 2014 12:14:03 +0100
Message-ID: 
 <CAM5rYObSPNu0ZCxTgCi89hHNsDqZ628f90251Yv6v7E=zWQqJQ@mail.gmail.com>
Subject: Re: Datanode volume full, but not moving to free volume
From: Aitor Cedres <acedres@pivotal.io>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=90e6ba6147fee1def90504e76983

--90e6ba6147fee1def90504e76983
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi Brian,

I would try to move the Block Pools directories
(BP-1408773897-172.17.1.1-1400769841207). You must shutdown your DataNode
process before doing this operation.

Regards,

Aitor Cedr=C3=A9s


On 8 October 2014 11:46, Brian C. Huffman <bhuffman@etinternational.com>
wrote:

>  Can I move a whole subdir?  Or does it have to be individual block files
> / metadata?
>
> For example, I see this:
> [hadoop@thor1 finalized]$ pwd
>
> /data/data2/hadoop/yarn_data/hdfs/datanode/current/BP-1408773897-172.17.1=
.1-1400769841207/current/finalized
> [hadoop@thor1 finalized]$ du -sh subdir10/
> 80G    subdir10/
>
> So could I move subdir10 to the same location under /data/data3?
>
> Thanks,
> Brian
>
>
> Brian C. Huffman System Administrator ET International, Inc.On 10/8/14,
> 4:44 AM, Aitor Cedres wrote:
>
>
> Hi Brian,
>
>  Hadoop does not balance the disks within a DataNode. If you ran out of
> space and then add additional disks, you should shutdown the DataNode and
> move manually a few files to the new disk.
>
>  Regards,
>
>  Aitor Cedr=C3=A9s
>
>   On 6 October 2014 14:46, Brian C. Huffman <bhuffman@etinternational.com=
>
> wrote:
>
>> All,
>>
>> I have a small hadoop cluster (2.5.0) with 4 datanodes and 3 data disks
>> per node.  Lately some of the volumes have been filling, but instead of
>> moving to other configured volumes that *have* free space, it's giving
>> errors in the datanode logs:
>> 2014-10-03 11:52:44,989 ERROR
>> org.apache.hadoop.hdfs.server.datanode.DataNode:
>> thor2.xmen.eti:50010:DataXceiver error processing WRITE_BLOCK
>>  operation  src: /172.17.1.3:35412 dst: /172.17.1.2:50010
>> java.io.IOException: No space left on device
>>     at java.io.FileOutputStream.writeBytes(Native Method)
>>     at java.io.FileOutputStream.write(FileOutputStream.java:345)
>>     at
>> org.apache.hadoop.hdfs.server.datanode.BlockReceiver.receivePacket(Block=
Receiver.java:592)
>>     at
>> org.apache.hadoop.hdfs.server.datanode.BlockReceiver.receiveBlock(BlockR=
eceiver.java:734)
>>     at
>> org.apache.hadoop.hdfs.server.datanode.DataXceiver.writeBlock(DataXceive=
r.java:741)
>>     at
>> org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.opWriteBlock(Recei=
ver.java:124)
>>     at
>> org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.processOp(Receiver=
.java:71)
>>     at
>> org.apache.hadoop.hdfs.server.datanode.DataXceiver.run(DataXceiver.java:=
234)
>>     at java.lang.Thread.run(Thread.java:745)
>>
>> Unfortunately it's continuing to try to write and when it fails, it's
>> passing the exception to the client.
>>
>> I did a restart and then it seemed to figure out that it should move to
>> the next volume.
>>
>> Any suggestions to keep this from happening in the future?
>>
>> Also - could it be an issue that I have a small amount of non-HDFS data
>> on those volumes?
>>
>> Thanks,
>> Brian
>>
>>
>
>

--90e6ba6147fee1def90504e76983
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><br><div>Hi Brian,</div><div><br></div><div>I would try to=
 move the Block Pools directories (BP-1408773897-172.17.1.1-1400769841207).=
 You must shutdown your DataNode process before doing this operation.</div>=
<div><br></div><div>Regards,</div><div class=3D"gmail_extra"><br clear=3D"a=
ll"><div><div dir=3D"ltr">Aitor Cedr=C3=A9s<br><blockquote style=3D"margin:=
0px 0px 0px 40px;border:none;padding:0px"></blockquote><blockquote style=3D=
"margin:0px 0px 0px 40px;border:none;padding:0px"></blockquote><div><div><b=
r></div></div></div></div><div class=3D"gmail_quote">On 8 October 2014 11:4=
6, Brian C. Huffman <span dir=3D"ltr">&lt;<a href=3D"mailto:bhuffman@etinte=
rnational.com" target=3D"_blank">bhuffman@etinternational.com</a>&gt;</span=
> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bo=
rder-left:1px #ccc solid;padding-left:1ex">
 =20
   =20
 =20
  <div bgcolor=3D"#FFFFFF" text=3D"#000000">
    <div>Can I move a whole subdir?=C2=A0 Or does it
      have to be individual block files / metadata?<br>
      <br>
      For example, I see this:<br>
      [hadoop@thor1 finalized]$ pwd<br>
/data/data2/hadoop/yarn_data/hdfs/datanode/current/BP-1408773897-172.17.1.1=
-1400769841207/current/finalized<br>
      [hadoop@thor1 finalized]$ du -sh subdir10/<br>
      80G=C2=A0=C2=A0=C2=A0 subdir10/<br>
      <br>
      So could I move subdir10 to the same location under /data/data3?<br>
      <br>
      Thanks,<br>
      Brian<div><div class=3D"h5"><br>
      <br>
      Brian C. Huffman
      System Administrator
      ET International, Inc.On 10/8/14, 4:44 AM, Aitor Cedres wrote:<br>
    </div></div></div><div><div class=3D"h5">
    <blockquote type=3D"cite">
      <div dir=3D"ltr"><br>
        <div>Hi Brian,</div>
        <div><br>
        </div>
        <div>Hadoop does not balance the disks within a DataNode. If you
          ran out of space and then add additional disks, you should
          shutdown the DataNode and move manually a few files to the new
          disk.=C2=A0</div>
        <div><br>
        </div>
        <div>Regards,</div>
        <div class=3D"gmail_extra"><br clear=3D"all">
          <div>
            <div dir=3D"ltr">Aitor Cedr=C3=A9s<br>
              <div>
                <div><br>
                </div>
              </div>
            </div>
          </div>
          <div class=3D"gmail_quote">On 6 October 2014 14:46, Brian C.
            Huffman <span dir=3D"ltr">&lt;<a href=3D"mailto:bhuffman@etinte=
rnational.com" target=3D"_blank">bhuffman@etinternational.com</a>&gt;</span=
>
            wrote:<br>
            <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bo=
rder-left:1px #ccc solid;padding-left:1ex">All,<br>
              <br>
              I have a small hadoop cluster (2.5.0) with 4 datanodes and
              3 data disks per node.=C2=A0 Lately some of the volumes have
              been filling, but instead of moving to other configured
              volumes that *have* free space, it&#39;s giving errors in the
              datanode logs:<br>
              2014-10-03 11:52:44,989 ERROR
              org.apache.hadoop.hdfs.server.datanode.DataNode:
              thor2.xmen.eti:50010:DataXceiver error processing
              WRITE_BLOCK<br>
              =C2=A0operation=C2=A0 src: /<a href=3D"http://172.17.1.3:3541=
2" target=3D"_blank">172.17.1.3:35412</a>
              dst: /<a href=3D"http://172.17.1.2:50010" target=3D"_blank">1=
72.17.1.2:50010</a><br>
              java.io.IOException: No space left on device<br>
              =C2=A0 =C2=A0 at java.io.FileOutputStream.writeBytes(Native M=
ethod)<br>
              =C2=A0 =C2=A0 at java.io.FileOutputStream.write(FileOutputStr=
eam.java:345)<br>
              =C2=A0 =C2=A0 at org.apache.hadoop.hdfs.server.datanode.Block=
Receiver.receivePacket(BlockReceiver.java:592)<br>
              =C2=A0 =C2=A0 at org.apache.hadoop.hdfs.server.datanode.Block=
Receiver.receiveBlock(BlockReceiver.java:734)<br>
              =C2=A0 =C2=A0 at org.apache.hadoop.hdfs.server.datanode.DataX=
ceiver.writeBlock(DataXceiver.java:741)<br>
              =C2=A0 =C2=A0 at org.apache.hadoop.hdfs.protocol.datatransfer=
.Receiver.opWriteBlock(Receiver.java:124)<br>
              =C2=A0 =C2=A0 at org.apache.hadoop.hdfs.protocol.datatransfer=
.Receiver.processOp(Receiver.java:71)<br>
              =C2=A0 =C2=A0 at org.apache.hadoop.hdfs.server.datanode.DataX=
ceiver.run(DataXceiver.java:234)<br>
              =C2=A0 =C2=A0 at java.lang.Thread.run(Thread.java:745)<br>
              <br>
              Unfortunately it&#39;s continuing to try to write and when it
              fails, it&#39;s passing the exception to the client.<br>
              <br>
              I did a restart and then it seemed to figure out that it
              should move to the next volume.<br>
              <br>
              Any suggestions to keep this from happening in the future?<br=
>
              <br>
              Also - could it be an issue that I have a small amount of
              non-HDFS data on those volumes?<br>
              <br>
              Thanks,<br>
              Brian<br>
              <br>
            </blockquote>
          </div>
          <br>
        </div>
      </div>
    </blockquote>
    <br>
  </div></div></div>

</blockquote></div><br></div></div>

--90e6ba6147fee1def90504e76983--