Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 74908 invoked from network); 12 Aug 2009 17:02:16 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 12 Aug 2009 17:02:16 -0000 Received: (qmail 84955 invoked by uid 500); 12 Aug 2009 17:02:20 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 84886 invoked by uid 500); 12 Aug 2009 17:02:20 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 84876 invoked by uid 99); 12 Aug 2009 17:02:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Aug 2009 17:02:20 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [192.139.80.206] (HELO mx1.casalemedia.com) (192.139.80.206) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Aug 2009 17:02:11 +0000 Received: from exchange.casalemedia.com (unknown [10.3.10.15]) by mx1.casalemedia.com (Postfix) with ESMTP id DD93B588013 for ; Wed, 12 Aug 2009 13:01:50 -0400 (EDT) Received: from mayuran.casalemedia.com (10.3.10.40) by exchange.casalemedia.com (10.3.10.15) with Microsoft SMTP Server id 8.1.240.5; Wed, 12 Aug 2009 13:01:50 -0400 Message-ID: <4A82F578.7050309@casalemedia.com> Date: Wed, 12 Aug 2009 13:01:44 -0400 From: Mayuran Yogarajah User-Agent: Thunderbird 2.0.0.22 (X11/20090605) MIME-Version: 1.0 To: "common-user@hadoop.apache.org" Subject: Re: NN + secondary got full, even though data nodes had plenty of space References: <4A81B3C3.3060903@casalemedia.com> <45f85f70908120823h1dc26130ncd6ce1b700035f1a@mail.gmail.com> In-Reply-To: <45f85f70908120823h1dc26130ncd6ce1b700035f1a@mail.gmail.com> Content-Type: text/plain; charset="ISO-8859-1"; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Todd Lipcon wrote: > Hi Mayuran, > > Do you do all of your uploads of data into your Hadoop cluster from node001 > and node002? > > If so, keep in mind that one of your replicas will always be written on > localhost in the case that it is part of the cluster. > > You should consider running the rebalancer to even up your space usage. > > -Todd > > Actually yes I have been doing this. I'll try rebalancer, thanks for your help. M > On Tue, Aug 11, 2009 at 11:09 AM, Mayuran Yogarajah < > mayuran.yogarajah@casalemedia.com> wrote: > > >> I have a 6 node cluster running Hadoop 0.18.3. I'm trying to figure out >> how the data was spread out like this: >> >> node001 94.15% >> node002 94.16% >> node003 48.22% >> node004 47.85% >> node005 48.12% >> node006 43.18% >> Node 001 (NN) and node 002( secondary NN) both got full, while the other >> data nodes had more space left. I had assumed that Hadoop would distribute >> more blocks to nodes 3-6 since they had much more space, but it ended up >> filling up nodes1 and 2. Is this expected? >> >> thanks, >> M >> >> >>