Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of azuryyyu@gmail.com designates
 209.85.223.169 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CALr1C9oagSgh+j=n4=kQQp=hq-76huZru-mUT4cDGG0=jcA2Eg@mail.gmail.com>
References: <F2714C69-1419-4C42-87F2-59C6FC08BDE8@gmail.com>
	<CAPwpkBtGrsfMkJswSXWdSQ9o6H+Np3vOuOU2=0F94RjyK6fuxQ@mail.gmail.com>
	<57310067-9C98-41EA-A674-82940D584EED@gmail.com>
	<CALDQvddKKWY_6QJfqVrD_-iu80iGVpr9FAKavEmXR3wuG+55vA@mail.gmail.com>
	<CALr1C9piZWyVcLE3Lm6fU14kqobKR5U7+WdJvQAv0MVJLQhc4w@mail.gmail.com>
	<EB19EC38-3491-488F-92F7-8639099496AD@gmail.com>
	<CALDQvdd0_jvxhiHX=4adJj_raJoD0Ou1S-x1hJ1X8mqzwv_hTw@mail.gmail.com>
	<CAOcnVr2N9U61-tcHuEJWKZaocwN4CV1nL2SKM6qGdLFYMPO7WA@mail.gmail.com>
	<9E45CED1-1D86-4116-8892-7C514EF64342@gmail.com>
	<CAOcnVr39Lk63PP04yFFXSLhxY2_3ard-4A2KY0DbPHRCDee3vQ@mail.gmail.com>
	<8E7DDACE-7658-4B34-AAD7-89A5A6D52EF4@gmail.com>
	<CALr1C9oagSgh+j=n4=kQQp=hq-76huZru-mUT4cDGG0=jcA2Eg@mail.gmail.com>
Date: Wed, 3 Apr 2013 09:53:18 +0800
Message-ID: 
 <CALr1C9ptnjwzT5uBQOKtH8eRhUoAz4q2-6=zvksp_SSfm8pBUQ@mail.gmail.com>
Subject: Re: are we able to decommission multi nodes at one time?
From: Azuryy Yu <azuryyyu@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=089e013c5c4439c7d604d96b1ef8

--089e013c5c4439c7d604d96b1ef8
Content-Type: text/plain; charset=EUC-KR
Content-Transfer-Encoding: quoted-printable

bq. then namenode start to copy block replicates on DN-2 to another DN,
supposed DN-2.

sorry for typo.

Correct for it:
then namenode start to copy block replicates on DN-1 to another DN,
supposed DN-2.


On Wed, Apr 3, 2013 at 9:51 AM, Azuryy Yu <azuryyyu@gmail.com> wrote:

> It's different.
> If you just want to stop DN-1 a short time, just kill the DataNode proces=
s
> on DN-1. then do what you want. during this time, Namenode  cannot receiv=
e
> the heart beat from DN-1, then namenode start to copy block replicates on
> DN-2 to another DN, supposed DN-2.
>
> But when you start DN-1 again, Namenode receive the DN-1 registration,
> then namenode stop to copy the DN-1's block replicates even if NN doesn't
> finish coping.
>
> Am I explain clearly?
>
>
>
> On Wed, Apr 3, 2013 at 9:43 AM, Henry Junyoung Kim <henry.jykim@gmail.com=
>wrote:
>
>> @Harsh
>>
>> What's the reasons to make big gaps for removing nodes between
>> decommission and just down nodes?
>> In my understanding, both are necessary to copy un-replicated blocks to
>> another alive nodes.
>> If main costs of  them are this one, total elapsed time couldn't be big
>> different.
>>
>> Could you share some articles or documents to understand about
>> decommissioning procedures?
>> - explaining is always thanks ;)
>>
>>
>> 2013. 4. 2., =BF=C0=C8=C4 5:37, Harsh J <harsh@cloudera.com> =C0=DB=BC=
=BA:
>>
>> > Yes, you can do the downtime work in steps of 2 DNs at a time,
>> > especially since you mentioned the total work would be only ~30mins at
>> > most.
>> >
>> > On Tue, Apr 2, 2013 at 1:46 PM, Henry Junyoung Kim
>> > <henry.jykim@gmail.com> wrote:
>> >> the rest of nodes to be alive has enough size to store.
>> >>
>> >> for this one that you've mentioned.
>> >>> its easier to do so in a rolling manner without need of a
>> >>> decommission.
>> >>
>> >> to check my understanding, just shutting down 2 of them and then 2
>> more and then 2 more without decommissions.
>> >>
>> >> is this correct?
>> >>
>> >>
>> >> 2013. 4. 2., =BF=C0=C8=C4 4:54, Harsh J <harsh@cloudera.com> =C0=DB=
=BC=BA:
>> >>
>> >>> Note though that its only possible to decommission 7 nodes at the sa=
me
>> >>> time and expect it to finish iff the remaining 8 nodes have adequate
>> >>> free space for the excess replicas.
>> >>>
>> >>> If you're just going to take them down for a short while (few mins
>> >>> each), its easier to do so in a rolling manner without need of a
>> >>> decommission. You can take upto two down at a time on a replication
>> >>> average of 3 or 3+, and put it back in later without too much data
>> >>> movement impact.
>> >>>
>> >>> On Tue, Apr 2, 2013 at 1:06 PM, Yanbo Liang <yanbohappy@gmail.com>
>> wrote:
>> >>>> It's reasonable to decommission 7 nodes at the same time.
>> >>>> But may be it also takes long time to finish it.
>> >>>> Because all the replicas in these 7 nodes need to be copied to
>> remaining 8
>> >>>> nodes.
>> >>>> The size of transfer from these nodes to the remaining nodes is
>> equal.
>> >>>>
>> >>>>
>> >>>> 2013/4/2 Henry Junyoung Kim <henry.jykim@gmail.com>
>> >>>>>
>> >>>>> :)
>> >>>>>
>> >>>>> currently, I  have 15 data nodes.
>> >>>>> for some tests, I am trying to decommission until 8 nodes.
>> >>>>>
>> >>>>> Now, the total dfs used size is 52 TB which is including all
>> replicated
>> >>>>> blocks.
>> >>>>> from 15 to 8, total spent time is almost 4 days long. ;(
>> >>>>>
>> >>>>> someone mentioned that I don't need to decommission node by node.
>> >>>>> for this case, is there no problems if I decommissioned 7 nodes at
>> the
>> >>>>> same time?
>> >>>>>
>> >>>>>
>> >>>>> 2013. 4. 2., =BF=C0=C8=C4 12:14, Azuryy Yu <azuryyyu@gmail.com> =
=C0=DB=BC=BA:
>> >>>>>
>> >>>>> I can translate it to native English: how many nodes you want to
>> >>>>> decommission?
>> >>>>>
>> >>>>>
>> >>>>> On Tue, Apr 2, 2013 at 11:01 AM, Yanbo Liang <yanbohappy@gmail.com=
>
>> wrote:
>> >>>>>>
>> >>>>>> You want to decommission how many nodes?
>> >>>>>>
>> >>>>>>
>> >>>>>> 2013/4/2 Henry JunYoung KIM <henry.jykim@gmail.com>
>> >>>>>>>
>> >>>>>>> 15 for datanodes and 3 for replication factor.
>> >>>>>>>
>> >>>>>>> 2013. 4. 1., =BF=C0=C8=C4 3:23, varun kumar <varun.uid@gmail.com=
> =C0=DB=BC=BA:
>> >>>>>>>
>> >>>>>>>> How many nodes do you have and replication factor for it.
>> >>>>>>>
>> >>>>>>
>> >>>>>
>> >>>>>
>> >>>>
>> >>>
>> >>>
>> >>>
>> >>> --
>> >>> Harsh J
>> >>
>> >
>> >
>> >
>> > --
>> > Harsh J
>>
>>
>

--089e013c5c4439c7d604d96b1ef8
Content-Type: text/html; charset=EUC-KR
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div>bq. then namenode start to copy block replicates=
 on DN-2 to another DN, supposed DN-2. <br><br></div>sorry for typo.<br><br=
></div>Correct for it:<br>then namenode start to copy block replicates on D=
N-1 to another DN, supposed DN-2. </div>
<div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed, Apr 3=
, 2013 at 9:51 AM, Azuryy Yu <span dir=3D"ltr">&lt;<a href=3D"mailto:azuryy=
yu@gmail.com" target=3D"_blank">azuryyyu@gmail.com</a>&gt;</span> wrote:<br=
><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1=
px #ccc solid;padding-left:1ex">
<div dir=3D"ltr"><div>It&#39;s different.<br></div><div>If you just want to=
 stop DN-1 a short time, just kill the DataNode process on DN-1. then do wh=
at you want. during this time, Namenode&nbsp; cannot receive the heart beat=
 from DN-1, then namenode start to copy block replicates on DN-2 to another=
 DN, supposed DN-2. <br>

<br></div><div>But when you start DN-1 again, Namenode receive the DN-1 reg=
istration, then namenode stop to copy the DN-1&#39;s block replicates even =
if NN doesn&#39;t finish coping.<br><br></div><div>Am I explain clearly?<br=
>

<br></div></div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail=
_extra"><br><br><div class=3D"gmail_quote">On Wed, Apr 3, 2013 at 9:43 AM, =
Henry Junyoung Kim <span dir=3D"ltr">&lt;<a href=3D"mailto:henry.jykim@gmai=
l.com" target=3D"_blank">henry.jykim@gmail.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">@Harsh<br>
<br>
What&#39;s the reasons to make big gaps for removing nodes between decommis=
sion and just down nodes?<br>
In my understanding, both are necessary to copy un-replicated blocks to ano=
ther alive nodes.<br>
If main costs of &nbsp;them are this one, total elapsed time couldn&#39;t b=
e big different.<br>
<br>
Could you share some articles or documents to understand about decommission=
ing procedures?<br>
- explaining is always thanks ;)<br>
<div><br>
<br>
2013. 4. 2., =BF=C0=C8=C4 5:37, Harsh J &lt;<a href=3D"mailto:harsh@clouder=
a.com" target=3D"_blank">harsh@cloudera.com</a>&gt; =C0=DB=BC=BA:<br>
<br>
</div><div><div>&gt; Yes, you can do the downtime work in steps of 2 DNs at=
 a time,<br>
&gt; especially since you mentioned the total work would be only ~30mins at=
<br>
&gt; most.<br>
&gt;<br>
&gt; On Tue, Apr 2, 2013 at 1:46 PM, Henry Junyoung Kim<br>
&gt; &lt;<a href=3D"mailto:henry.jykim@gmail.com" target=3D"_blank">henry.j=
ykim@gmail.com</a>&gt; wrote:<br>
&gt;&gt; the rest of nodes to be alive has enough size to store.<br>
&gt;&gt;<br>
&gt;&gt; for this one that you&#39;ve mentioned.<br>
&gt;&gt;&gt; its easier to do so in a rolling manner without need of a<br>
&gt;&gt;&gt; decommission.<br>
&gt;&gt;<br>
&gt;&gt; to check my understanding, just shutting down 2 of them and then 2=
 more and then 2 more without decommissions.<br>
&gt;&gt;<br>
&gt;&gt; is this correct?<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; 2013. 4. 2., =BF=C0=C8=C4 4:54, Harsh J &lt;<a href=3D"mailto:hars=
h@cloudera.com" target=3D"_blank">harsh@cloudera.com</a>&gt; =C0=DB=BC=BA:<=
br>
&gt;&gt;<br>
&gt;&gt;&gt; Note though that its only possible to decommission 7 nodes at =
the same<br>
&gt;&gt;&gt; time and expect it to finish iff the remaining 8 nodes have ad=
equate<br>
&gt;&gt;&gt; free space for the excess replicas.<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; If you&#39;re just going to take them down for a short while (=
few mins<br>
&gt;&gt;&gt; each), its easier to do so in a rolling manner without need of=
 a<br>
&gt;&gt;&gt; decommission. You can take upto two down at a time on a replic=
ation<br>
&gt;&gt;&gt; average of 3 or 3+, and put it back in later without too much =
data<br>
&gt;&gt;&gt; movement impact.<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; On Tue, Apr 2, 2013 at 1:06 PM, Yanbo Liang &lt;<a href=3D"mai=
lto:yanbohappy@gmail.com" target=3D"_blank">yanbohappy@gmail.com</a>&gt; wr=
ote:<br>
&gt;&gt;&gt;&gt; It&#39;s reasonable to decommission 7 nodes at the same ti=
me.<br>
&gt;&gt;&gt;&gt; But may be it also takes long time to finish it.<br>
&gt;&gt;&gt;&gt; Because all the replicas in these 7 nodes need to be copie=
d to remaining 8<br>
&gt;&gt;&gt;&gt; nodes.<br>
&gt;&gt;&gt;&gt; The size of transfer from these nodes to the remaining nod=
es is equal.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; 2013/4/2 Henry Junyoung Kim &lt;<a href=3D"mailto:henry.jy=
kim@gmail.com" target=3D"_blank">henry.jykim@gmail.com</a>&gt;<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; :)<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; currently, I &nbsp;have 15 data nodes.<br>
&gt;&gt;&gt;&gt;&gt; for some tests, I am trying to decommission until 8 no=
des.<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; Now, the total dfs used size is 52 TB which is includi=
ng all replicated<br>
&gt;&gt;&gt;&gt;&gt; blocks.<br>
&gt;&gt;&gt;&gt;&gt; from 15 to 8, total spent time is almost 4 days long. =
;(<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; someone mentioned that I don&#39;t need to decommissio=
n node by node.<br>
&gt;&gt;&gt;&gt;&gt; for this case, is there no problems if I decommissione=
d 7 nodes at the<br>
&gt;&gt;&gt;&gt;&gt; same time?<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; 2013. 4. 2., =BF=C0=C8=C4 12:14, Azuryy Yu &lt;<a href=
=3D"mailto:azuryyyu@gmail.com" target=3D"_blank">azuryyyu@gmail.com</a>&gt;=
 =C0=DB=BC=BA:<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; I can translate it to native English: how many nodes y=
ou want to<br>
&gt;&gt;&gt;&gt;&gt; decommission?<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; On Tue, Apr 2, 2013 at 11:01 AM, Yanbo Liang &lt;<a hr=
ef=3D"mailto:yanbohappy@gmail.com" target=3D"_blank">yanbohappy@gmail.com</=
a>&gt; wrote:<br>
&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt; You want to decommission how many nodes?<br>
&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt; 2013/4/2 Henry JunYoung KIM &lt;<a href=3D"mailto:=
henry.jykim@gmail.com" target=3D"_blank">henry.jykim@gmail.com</a>&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt;&gt; 15 for datanodes and 3 for replication factor.=
<br>
&gt;&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt;&gt; 2013. 4. 1., =BF=C0=C8=C4 3:23, varun kumar &l=
t;<a href=3D"mailto:varun.uid@gmail.com" target=3D"_blank">varun.uid@gmail.=
com</a>&gt; =C0=DB=BC=BA:<br>
&gt;&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt;&gt;&gt; How many nodes do you have and replication=
 factor for it.<br>
&gt;&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; --<br>
&gt;&gt;&gt; Harsh J<br>
&gt;&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; --<br>
&gt; Harsh J<br>
<br>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--089e013c5c4439c7d604d96b1ef8--