Return-Path: Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: (qmail 54181 invoked from network); 28 Aug 2009 20:41:53 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 28 Aug 2009 20:41:53 -0000 Received: (qmail 13577 invoked by uid 500); 28 Aug 2009 20:41:53 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 13512 invoked by uid 500); 28 Aug 2009 20:41:53 -0000 Mailing-List: contact hdfs-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-user@hadoop.apache.org Delivered-To: mailing list hdfs-user@hadoop.apache.org Received: (qmail 13503 invoked by uid 99); 28 Aug 2009 20:41:53 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 28 Aug 2009 20:41:53 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of dhruba@gmail.com designates 209.85.211.191 as permitted sender) Received: from [209.85.211.191] (HELO mail-yw0-f191.google.com) (209.85.211.191) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 28 Aug 2009 20:41:45 +0000 Received: by ywh29 with SMTP id 29so3110226ywh.33 for ; Fri, 28 Aug 2009 13:41:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=+mZvSGFhM32Z6CgD8y/SJmE+tW16oiTT5rFGs3XN5sI=; b=q7GESoY0caBiUavoVwuGTzNZrbkr1eQ4K+gPssSS6nnj/j9oKM86vgUq+4XoPLED+t 9dJoN4mwXLO8Uj194m3Pu7zu2XMKuRUbBK7j1kIlYaq5yVZgrxOgruX7Vc+CJKUer99R RjkR2wptR9dtXWELU+y//SgD7FV09zgzg3E4I= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=J/o18F/4VsalyWmxjDP1iSGIRR0xO5Z0Uhca+/4UVMtEwDRObSRkxAufS+Did26AOz BLqngLkqOWMJEDmb0CH5ncmwbQELIY0IETt5oFGp//SzT3MRc/8bbmp1P08Zkp+Ttw3t T9QiP5zuHzlkOzax2ltni/xOf70CHQLGNLDLM= MIME-Version: 1.0 Received: by 10.150.59.4 with SMTP id h4mr1440580yba.235.1251492084581; Fri, 28 Aug 2009 13:41:24 -0700 (PDT) In-Reply-To: References: Date: Fri, 28 Aug 2009 13:41:24 -0700 Message-ID: <4aa34eb70908281341j4959f21rf2363b4e5816fb58@mail.gmail.com> Subject: Re: Intelligence of decommission From: Dhruba Borthakur To: hdfs-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=000e0cd6a8082406db047239b54b X-Virus-Checked: Checked by ClamAV on apache.org --000e0cd6a8082406db047239b54b Content-Type: text/plain; charset=ISO-8859-1 when a user issues the decommission command, all the blocks that are currently residing on it are inserted into the to-be-replicated queue. Then the ReplicationMonitor inside the namenode starts replicating these blocks (during this period, the replica on the machine being decommissioned is used for reads, but is not considered a valid replica by the ReplicationMonitor). hope this helps, dhruba On Fri, Aug 28, 2009 at 1:07 PM, Allen Wittenauer wrote: > Hi. > > As I sit here and wait for node decommission to finish, I was wondering > about the intelligence of the decision making. [The name nodes, not mine. > :) ] > > Let's say I have the following scenario: > > I have two files. Both files consist of one block with a replication > factor > of three. I decommission two nodes. File #1 has two of its replicas on > the > two nodes I am decommissioning. File #2 has only one of its replicas on > one > of the two nodes I am decommissioning. > > Is the block with two replicas on the two nodes I am decommissioning given > priority? How does the name node decide which blocks to re-replicate > first? > > Thanks. > > --000e0cd6a8082406db047239b54b Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable when a user issues the decommission command, all the blocks that are curren= tly residing on it are inserted into the to-be-replicated queue. Then the R= eplicationMonitor inside the namenode starts replicating these blocks (duri= ng this period, the replica on the machine being decommissioned is used for= reads, but is not considered a valid replica by the ReplicationMonitor).
hope this helps,
dhruba


On Fri= , Aug 28, 2009 at 1:07 PM, Allen Wittenauer <awittenauer@linkedin.com> wrote:
Hi.

As I sit here and wait for node decommission to finish, I was wondering
about the intelligence of the decision making. =A0[The name nodes, not mine= .
:) ]

Let's say I have the following scenario:

I have two files. =A0Both files consist of one block with a replication fac= tor
of three. =A0I decommission two nodes. =A0File #1 has two of its replicas o= n the
two nodes I am decommissioning. =A0File #2 has only one of its replicas on = one
of the two nodes I am decommissioning.

Is the block with two replicas on the two nodes I am decommissioning given<= br> priority? =A0How does the name node decide which blocks to re-replicate fir= st?

Thanks.


--000e0cd6a8082406db047239b54b--