Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F0786105F8 for ; Tue, 10 Dec 2013 18:46:21 +0000 (UTC) Received: (qmail 68021 invoked by uid 500); 10 Dec 2013 18:46:19 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 67971 invoked by uid 500); 10 Dec 2013 18:46:19 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 67963 invoked by uid 99); 10 Dec 2013 18:46:19 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Dec 2013 18:46:19 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of kevin.odell@cloudera.com designates 209.85.216.45 as permitted sender) Received: from [209.85.216.45] (HELO mail-qa0-f45.google.com) (209.85.216.45) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Dec 2013 18:46:14 +0000 Received: by mail-qa0-f45.google.com with SMTP id o15so3974499qap.4 for ; Tue, 10 Dec 2013 10:45:53 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=RW6mXPLAPZZ6gPby6+rBc99euWeq/5xS/SJ/ccLO8wA=; b=J1+TIIfmtMNekVfAdyg/BIN7RT+fyz0msO0xVw8mPC94nJRd00oMY4qPBXBKYCUz6F SZCut9fov4ppCgPldpy5n7c/qTnso88oKbS0FeE3lFod8hfAEy4ei9l0m+8TvTpA+GmN oUQK9729i3/KLot62z1/OfZ7A0BQ7DhD8WJsFtdQOescdWQLCPWh6RguxUsq0liQa8DG M7Uq/zY+bxv3egBcpaV/eBDxQqtjZ7TJoj5ZvuRHiSEYtqOQBwQjW9K2gngm3D+oCzrH hG3EqFc4D5ruNeqNrhH6fF012eZly7kdb0us916S74SsdbeVR8tRhOnZp4VeMpo1IW8U 1G9Q== X-Gm-Message-State: ALoCoQlF2cvFhvT3tfGfAe2JvxRKBJsJFVhW+ZSd+4hZjDj+szW6viLNx/STOnpuKL/J3IL+q51C MIME-Version: 1.0 X-Received: by 10.49.101.9 with SMTP id fc9mr46494102qeb.42.1386701152509; Tue, 10 Dec 2013 10:45:52 -0800 (PST) Received: by 10.96.245.12 with HTTP; Tue, 10 Dec 2013 10:45:52 -0800 (PST) In-Reply-To: References: <95B9CD55CC75E24AAE1CBC366CB9C3802BA3D61D@NDHEP50004.na.corp.mckesson.com> Date: Tue, 10 Dec 2013 13:45:52 -0500 Message-ID: Subject: Re: Table state From: "Kevin O'dell" To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=001a11c2bd389afb7d04ed328510 X-Virus-Checked: Checked by ClamAV on apache.org --001a11c2bd389afb7d04ed328510 Content-Type: text/plain; charset=ISO-8859-1 Just to close the loop, the previous recommended steps help to get us back up, but one of the HMasters is not happy now. I will update with a final analysis shortly. On Tue, Dec 10, 2013 at 1:10 PM, Jean-Marc Spaggiari < jean-marc@spaggiari.org> wrote: > Also, might be interesting to look in the RS logs to see what this region > can not come back online... > > JM > > > 2013/12/10 Kevin O'dell > > > Hey Raheem, > > > > You can sideline the table into tmp(mv /hbase/table /tmp/table, then > > bring HBase back online. Once HBase is back you can use HBCK to repair > > your META -fixMeta -fixAssignments. Once HBase is consistent again, you > > can move the table back out of tmp and use HBCK to reupdate META. If the > > issue reoccurs let us know. > > > > > > On Tue, Dec 10, 2013 at 11:50 AM, Daya, Raheem > > wrote: > > > > > I have a distributed Hbase cluster that will not start. It looks like > > > there is a table that is an inconsistent state: > > > 2013-12-10 07:40:50,447 FATAL org.apache.hadoop.hbase.master.HMaster: > > > Unexpected state : > > > > > > ct_claims,204845|81V6SO4EF56DD1TKOIU7AS4L5D,1386050670937.6d138b97cde8bc3e49ff34639913109c. > > > state=PENDING_OPEN, ts=1386690050445, > server=rhf-045,60020,1386689069486 > > .. > > > Cannot transit it to OFFLINE. > > > > > > Is there a way to manually set the table to OFFLINE? I have tried > > > deleting the /hbase node in zookeeper. I tried bringing up the master > > and > > > then a region server and vice versa. In the case of bringing the > master > > up > > > first, the master starts. As soon as I bring up a region server the > > master > > > goes down. My thought is to move the tables to OFFLINE, (assuming it > is > > > possible), and try bringing up the cluster again. hbck will not work > as > > > none of the region servers are up. Any one have any other ideas? > > > Thanks, > > > Raheem > > > > > > > > > > > > > > > > > > > > > -- > > Kevin O'Dell > > Systems Engineer, Cloudera > > > -- Kevin O'Dell Systems Engineer, Cloudera --001a11c2bd389afb7d04ed328510--