Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1ECEB10788 for ; Mon, 21 Oct 2013 08:17:57 +0000 (UTC) Received: (qmail 87843 invoked by uid 500); 21 Oct 2013 08:17:50 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 87717 invoked by uid 500); 21 Oct 2013 08:17:50 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 87701 invoked by uid 99); 21 Oct 2013 08:17:45 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 21 Oct 2013 08:17:45 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ahmic.samir@gmail.com designates 209.85.223.169 as permitted sender) Received: from [209.85.223.169] (HELO mail-ie0-f169.google.com) (209.85.223.169) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 21 Oct 2013 08:17:40 +0000 Received: by mail-ie0-f169.google.com with SMTP id ar20so11230591iec.28 for ; Mon, 21 Oct 2013 01:17:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=pbqE4oUOvrIhK58puIGlPpgHW/dBx1YiE539+dspLuM=; b=p4nJ0UCCOg8VMkOZQ5nSMSTT5ZDzhz20YQfmniaR8N/o5n6uNs/+FyrhNU/1hKvQpF sxUK7vCzD+Jpgx4mBxq7/rkB4ME/M/OKqNQvT6pEEdhTefltMJUDF4z+QV8CXDOqA6V9 DeydS//+S5uxKLhbjw/8qTeAtY3xF4oriWJk50G6y2zRmj6kRIyDCtVujgLtFBRmbl/G h+Uygq3kYxVITEb1Qd0dtYHxL3kD7+AwzOv1yiuO82GAsYN+SrEqILznl+QBOKbgvoH/ lbG4ETxRgspBOSiCDgA64+m/idEI/ZhW1lasPOtqKu1HlUBrXjXVUgDeDSge742uUXSh LGlQ== MIME-Version: 1.0 X-Received: by 10.50.16.45 with SMTP id c13mr8295047igd.55.1382343437890; Mon, 21 Oct 2013 01:17:17 -0700 (PDT) Received: by 10.64.69.40 with HTTP; Mon, 21 Oct 2013 01:17:17 -0700 (PDT) In-Reply-To: <5264DD74.3000701@post.km.ru> References: <5264DD74.3000701@post.km.ru> Date: Mon, 21 Oct 2013 10:17:17 +0200 Message-ID: Subject: Re: CDH4.4 and HBASE-8912 issue From: Samir Ahmic To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=047d7bdca43092bc7804e93be917 X-Virus-Checked: Checked by ClamAV on apache.org --047d7bdca43092bc7804e93be917 Content-Type: text/plain; charset=ISO-8859-1 Hi, Boris Did you check RS logs ? There should be exception regarding why assignment failed. Can you past that exception ? Cheers :) On Mon, Oct 21, 2013 at 9:53 AM, Boris Emelyanov wrote: > >Boris, what does hbck say? > > > >We have had this issue a couple times before. To fix it I had to stop the cluster, run offline meta repair tool, > >delete zk-store on each zk quorum node > >Offline Meta repair tool will not work if there are inconsistencies in HBase - you better try hbase hbck > >-fixAll first. > > > >Best regards, > >Vladimir Rodionov > >Principal Platform Engineer > >Carrier IQ, www.carrieriq.com > > >e-mail: vrodionov@... > > Hbck says "0 inconsistencies detected". > I stopped hbase cluster, deleted zk-database on all quorum nodes, ran "hbase org.apache.hadoop.hbase.util.hbck.OfflineMetaRepair", > and got "INFO util.HBaseFsck: Success! .META. table rebuilt.". > After that, cluster continued crashing during auto-loadbalancing. > > > > -- > Best regards, > > Boris Emelyanov. > > --047d7bdca43092bc7804e93be917 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hi, Boris

Did you check RS logs ? There= should be exception regarding why assignment failed. Can you past that exc= eption ?

Cheers :)=A0


On Mon, Oct 21, 2013 at 9:53 AM, Boris E= melyanov <emelyanov@post.km.ru> wrote:
=20 =20 =20
=20
>Boris, what does hbck say? > >We have had this issue a couple times before. To fix it I had to stop t= he cluster, run offline meta repair tool, >delete zk-store on each zk quorum node >Offline Meta repair tool will not work if there are inconsistencies i= n HBase - you better try hbase hbck >-fixAll first. > >Best regards, >Vladimir Rodionov >Principal Platform Engineer >Carrier IQ, www.carrieriq.com
>e-mail: v= rodionov@... Hbck says "0 inconsistencies detected".=20 I stopped hbase cluster, deleted zk-database on all quorum nodes, ran "= ;hbase org.apache.hadoop.hbase.util.hbck.OfflineMetaRepair", and got "INFO util.HBaseFsck: Success! .META. table rebuilt.". After that, cluster continued crashing during auto-loadbalancing.
--=20
Best regards,

Boris Emelyanov.

--047d7bdca43092bc7804e93be917--