Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: hadoop-user@lucene.apache.org
Received-SPF: neutral (athena.apache.org: local policy)
User-Agent: Microsoft-Entourage/11.3.3.061214
Date: Sat, 24 Nov 2007 16:21:47 -0800
Subject: Re: =?Big5?B?tarOYA==?=: HBase PerformanceEvaluation failing
From: Ted Dunning <tdunning@veoh.com>
To: <hadoop-user@lucene.apache.org>
Message-ID: <C36E001B.32D82%tdunning@veoh.com>
Thread-Topic: =?Big5?B?tarOYA==?=: HBase PerformanceEvaluation failing
Thread-Index: Acgu+So2aJTUHZrsEdySBAAWy8rVfQ==
In-Reply-To: <3bb76bfe0711241609o7453e0cdv3f0a030bfc141791@mail.gmail.com>
Mime-version: 1.0
Content-type: text/plain;
	charset="US-ASCII"
Content-transfer-encoding: 7bit


I think that stack was suggesting an HDFS fsck, not a disk level fsck.

Try [hadoop fsck /]


On 11/24/07 4:09 PM, "Kareem Dana" <kareem.dana@gmail.com> wrote:

> I do not have root access on the xen cluster I'm using. I will ask the
> admin to make sure the disk is working properly. Regarding the
> mismatch versions though, are you suggesting that different region
> servers might be running different versions of hbase/hadoop? They are
> all running the same code from the same shared storage. There isn't
> even another version of hadoop anywhere for the other nodes to run. I
> think I'll try dropping my cluster down to 2 nodes and working back
> up... maybe I can pin point a specific problem node. Thanks for taking
> a look at my logs.
> 
> On Nov 24, 2007 5:49 PM, stack <stack@duboce.net> wrote:
>> I took a quick look Kareem.   As with the last time, hbase keeps having
>> trouble w/ the hdfs.  Things start out fine around 16:00 then go bad
>> because can't write reliably to the hdfs -- a variety of reasons.  You
>> then seem to restart the cluster around 17:37 or so and things seem to
>> go along fine for a while until 19:05 when again, all regionservers
>> report trouble writing the hdfs.  Have you run an fsck?