Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 36277BB28 for ; Wed, 18 Jan 2012 17:34:39 +0000 (UTC) Received: (qmail 96299 invoked by uid 500); 18 Jan 2012 17:34:37 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 96183 invoked by uid 500); 18 Jan 2012 17:34:36 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 96175 invoked by uid 99); 18 Jan 2012 17:34:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Jan 2012 17:34:36 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of saint.ack@gmail.com designates 209.85.214.169 as permitted sender) Received: from [209.85.214.169] (HELO mail-tul01m020-f169.google.com) (209.85.214.169) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Jan 2012 17:34:28 +0000 Received: by obbta7 with SMTP id ta7so5781124obb.14 for ; Wed, 18 Jan 2012 09:34:07 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; bh=zlDO0lO7SaMSLgbmcENnQELLcsD03WcdWQf0FKHlTPk=; b=Qh81OCNELhTt9iW05RXtrWE83FLpclyfc3qDkApTYNGJTqpoy30Pt45mw789akMLHX dAJrAte+8s4KWld4QanrFKrArOqgfdYa6lbxuA4e6poPAsTFcKm13E1dgyQvhYKPGSgD QH9r01BNaC8M42Td41rUfls0N52AlY/7FJHkU= MIME-Version: 1.0 Received: by 10.182.38.7 with SMTP id c7mr13519044obk.44.1326908047695; Wed, 18 Jan 2012 09:34:07 -0800 (PST) Sender: saint.ack@gmail.com Received: by 10.182.246.99 with HTTP; Wed, 18 Jan 2012 09:34:07 -0800 (PST) In-Reply-To: References: Date: Wed, 18 Jan 2012 09:34:07 -0800 X-Google-Sender-Auth: _R_xqI0Wfsi3hAIskzy4TzowU-c Message-ID: Subject: Re: Table region got stuck, doesn't move/assign From: Stack To: user@hbase.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org On Mon, Jan 16, 2012 at 3:45 PM, Dmitriy Lyubimov wrote: > i have a table which seems to get stuck in a state where it can't be > queried, moved or split/compacted. > How many regions in this table? One only? > The logs don't have any error statements. Our admin tried hbck to no avail . > What did your admin see? > We stopped the region server, table did not get reassigned. (all other > did). when bround in UI, this table just showed "region server > offline". (??? shouldn't get reassigned as others did?) > Yes. It should. > Brining region server online loaded it with other regions, but not > that table. master apparently still thinks it is on that node (data6) > and so all requests are failing with region not serving message. > So, there is something 'wrong' w/ that table. Can you track it in master log and see what happens when master tries assign it? Maybe its failing to open? > assign/move/ unassign commands have no effect (move fails, but > assing/unassign seems to be quiet with no apparent effect). > > Another weirdness: it's the only table that is showing up under > hbase/table in zk and its region is listed under /hbase/unassigned. > Maybe its stuck in transition? You should see messages in master log if this the case. > Where can i read about meaning and transitions of zookeeper nodes under /hbase ? > I don't think this documented in the reference guide (its a little too much detail for most I'd say). Best place to look is probably source code. See here for an entrance into the wonderful world of master/regionserver state transitions: http://hbase.apache.org/xref/org/apache/hadoop/hbase/executor/EventHandler.html#93 St.Ack