Return-Path: Delivered-To: apmail-lucene-hadoop-user-archive@locus.apache.org Received: (qmail 81743 invoked from network); 25 Nov 2007 00:22:22 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 25 Nov 2007 00:22:22 -0000 Received: (qmail 89763 invoked by uid 500); 25 Nov 2007 00:22:08 -0000 Delivered-To: apmail-lucene-hadoop-user-archive@lucene.apache.org Received: (qmail 89743 invoked by uid 500); 25 Nov 2007 00:22:08 -0000 Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-user@lucene.apache.org Delivered-To: mailing list hadoop-user@lucene.apache.org Received: (qmail 89734 invoked by uid 99); 25 Nov 2007 00:22:08 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 24 Nov 2007 16:22:08 -0800 X-ASF-Spam-Status: No, hits=2.8 required=10.0 tests=RCVD_IN_DNSWL_LOW,RCVD_NUMERIC_HELO,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [69.50.2.13] (HELO ex9.myhostedexchange.com) (69.50.2.13) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 25 Nov 2007 00:21:48 +0000 Received: from 75.80.179.210 ([75.80.179.210]) by ex9.hostedexchange.local ([69.50.2.13]) with Microsoft Exchange Server HTTP-DAV ; Sun, 25 Nov 2007 00:21:48 +0000 User-Agent: Microsoft-Entourage/11.3.3.061214 Date: Sat, 24 Nov 2007 16:21:47 -0800 Subject: Re: =?Big5?B?tarOYA==?=: HBase PerformanceEvaluation failing From: Ted Dunning To: Message-ID: Thread-Topic: =?Big5?B?tarOYA==?=: HBase PerformanceEvaluation failing Thread-Index: Acgu+So2aJTUHZrsEdySBAAWy8rVfQ== In-Reply-To: <3bb76bfe0711241609o7453e0cdv3f0a030bfc141791@mail.gmail.com> Mime-version: 1.0 Content-type: text/plain; charset="US-ASCII" Content-transfer-encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org I think that stack was suggesting an HDFS fsck, not a disk level fsck. Try [hadoop fsck /] On 11/24/07 4:09 PM, "Kareem Dana" wrote: > I do not have root access on the xen cluster I'm using. I will ask the > admin to make sure the disk is working properly. Regarding the > mismatch versions though, are you suggesting that different region > servers might be running different versions of hbase/hadoop? They are > all running the same code from the same shared storage. There isn't > even another version of hadoop anywhere for the other nodes to run. I > think I'll try dropping my cluster down to 2 nodes and working back > up... maybe I can pin point a specific problem node. Thanks for taking > a look at my logs. > > On Nov 24, 2007 5:49 PM, stack wrote: >> I took a quick look Kareem. As with the last time, hbase keeps having >> trouble w/ the hdfs. Things start out fine around 16:00 then go bad >> because can't write reliably to the hdfs -- a variety of reasons. You >> then seem to restart the cluster around 17:37 or so and things seem to >> go along fine for a while until 19:05 when again, all regionservers >> report trouble writing the hdfs. Have you run an fsck?