Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@minotaur.apache.org Received: (qmail 89080 invoked from network); 30 Sep 2009 03:46:03 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 30 Sep 2009 03:46:03 -0000 Received: (qmail 28280 invoked by uid 500); 30 Sep 2009 03:46:02 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 28195 invoked by uid 500); 30 Sep 2009 03:46:02 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 28182 invoked by uid 99); 30 Sep 2009 03:46:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Sep 2009 03:46:02 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of arber.research@gmail.com designates 209.85.216.190 as permitted sender) Received: from [209.85.216.190] (HELO mail-px0-f190.google.com) (209.85.216.190) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Sep 2009 03:45:53 +0000 Received: by pxi28 with SMTP id 28so6702650pxi.2 for ; Tue, 29 Sep 2009 20:45:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=fk1j3COQghM2aXSrMNHAoPPJK63mLonnapNayyvvuf0=; b=CWA+uS0Kh4eqgauGLLpiYX4StyvsCOcAUUSiTAlZleWWO3fNJEY1EBO3BDJo8gwM1I 4qiB3QWVV/bmSz5itNmhJQEGKiS3uU+zncydMkDQr0ibk/oEMgnKKSdCE+2iVgsEnRMC UNkvZcDXI5ss/a0Odah3ngFkQfkl94rRFsvwo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=HVWSpPug088KaIaH/XRgw+thGWlhgCXUCSZBoYrz3+Pl5LAPt9JiJC+ETEzj0hJxK8 ft92LJzMf7TqTQXSc6bgQ2E3cT3ZN90OgNghs9OopiYeR2fnwJ/eAChbBGZhvrM5xswk xtNO9cBZdi3kgWCanZ59hdyh7VHPdDwSrIrdI= MIME-Version: 1.0 Received: by 10.142.55.11 with SMTP id d11mr485777wfa.17.1254282332000; Tue, 29 Sep 2009 20:45:32 -0700 (PDT) In-Reply-To: <7c962aed0909292031s69c6f4ccl693d54b0a9fbbbca@mail.gmail.com> References: <382e1efc0909291859u50dc3409q70180f392b9d5b55@mail.gmail.com> <7c962aed0909292019i19f18679l7fd12ad6c17b1c7c@mail.gmail.com> <382e1efc0909292026p1127ec7cq7334f7d1909d3d38@mail.gmail.com> <7c962aed0909292031s69c6f4ccl693d54b0a9fbbbca@mail.gmail.com> Date: Wed, 30 Sep 2009 11:45:31 +0800 Message-ID: <382e1efc0909292045v1418d838l275e4279826831a8@mail.gmail.com> Subject: Re: Some regions locked From: Yabo-Arber Xu To: hbase-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Thanks for your prompt reply. Just now I have restarted the whole cluster to continue my testing, and I will do as you suggested when the lock repeats. Best, Arber On Wed, Sep 30, 2009 at 11:31 AM, stack wrote: > Do you know the region? =A0If so, find which server by grepping region in > master log. =A0Once you have that, thread dump the locked up regionserver= . > Make an issue and attach the thread dump then lets chat on it over there? > Thanks, > St.Ack > > On Tue, Sep 29, 2009 at 8:26 PM, Yabo-Arber Xu = wrote: > >> Seems not a row locked, but the whole region. As any data request to a >> particular block was suspended. Same thing happened on the other two >> tables. Likely =A0a dead lock happened somewhere, but i have no clue >> where to find it... >> >> Best, >> Arber >> >> >> >> On Wed, Sep 30, 2009 at 11:19 AM, stack wrote: >> > So, a row was locked? =A0What do your programs do? =A0Do they take out= a row >> > lock? =A0They do not free it? =A0Maybe an error? >> > St.Ack >> > >> > On Tue, Sep 29, 2009 at 6:59 PM, Yabo-Arber Xu wrote: >> > >> >> Hi there: >> >> >> >> I was running a shell ( in parallel on multiple machines) that >> >> encloses a few programs accessing HBase. At some point all the >> >> programs are all suspended, and later I found the reason was because >> >> some region in HBase is locked. Any program that accesses that region= s >> >> was suspended immediately ( in the call of getScanner ). >> >> >> >> Anybody have ideas what might be the causes? >> >> >> >> Best, >> >> Arber >> >> >> > >> >