Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 55B99179D5 for ; Thu, 10 Sep 2015 00:42:26 +0000 (UTC) Received: (qmail 56956 invoked by uid 500); 10 Sep 2015 00:42:24 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 56881 invoked by uid 500); 10 Sep 2015 00:42:24 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 56780 invoked by uid 99); 10 Sep 2015 00:42:24 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 10 Sep 2015 00:42:23 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 8C9B51A0A7E for ; Thu, 10 Sep 2015 00:42:23 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.9 X-Spam-Level: ** X-Spam-Status: No, score=2.9 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 12l0jolElxWz for ; Thu, 10 Sep 2015 00:42:18 +0000 (UTC) Received: from mail-wi0-f181.google.com (mail-wi0-f181.google.com [209.85.212.181]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id 44981205E9 for ; Thu, 10 Sep 2015 00:42:18 +0000 (UTC) Received: by wicfx3 with SMTP id fx3so4938833wic.0 for ; Wed, 09 Sep 2015 17:42:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=BFG8HRJJyXw6Vy0D8F0w44/1u12QZrddIxm4opriyp4=; b=rhpWxgrUEdwkbBinAbV/JJ3CUBPj1p4NpE2KUts71+VMEQaujk4h+RpuIpRUI0FH6u vle9taufQfS72/kJxPpxyskw4975Z71unoliMvxrw847DSaBSumLxUK9X4nIXVknUbIG +rJB42rCeaZBddb9FuvWORUhKU51Q9R01Z0GtnwNAp6i/eSRwU1acw3YVWR1q/lKzQEO N6Z7ol9b5nW+W+DadeC+YdG7oErVBAigc8Neak60YVowILfK4GH5hNIxDwMFc2Od5tvd A3PNe1oGkmDY6arBxgyHEkXBPzlhZADOsACgrhBGDdVluiLdZxIHhBwJnxAHOGbWUETo NXuA== MIME-Version: 1.0 X-Received: by 10.194.114.37 with SMTP id jd5mr61921138wjb.34.1441845736740; Wed, 09 Sep 2015 17:42:16 -0700 (PDT) Received: by 10.194.123.165 with HTTP; Wed, 9 Sep 2015 17:42:16 -0700 (PDT) In-Reply-To: References: Date: Wed, 9 Sep 2015 20:42:16 -0400 Message-ID: Subject: Re: strange replication problem From: Hbase Janitor To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=001a1130cd1af59eb0051f59defb --001a1130cd1af59eb0051f59defb Content-Type: text/plain; charset=UTF-8 Hi, It seems similar. The infinite loop sounds right because the region server doesn't seem to want to come down cleanly. On Wed, Sep 9, 2015 at 2:23 PM, Rendon, Carlos (KBB - Irvine) < Carlos.Rendon@kbb.com> wrote: > We've been seeing a similar issue. > Take a look at https://issues.apache.org/jira/browse/HBASE-13471 and see > if it matches what you are seeing. > > -Carlos > > -----Original Message----- > From: Hbase Janitor [mailto:hbasejanitor@gmail.com] > Sent: Monday, September 07, 2015 11:05 AM > To: user@hbase.apache.org > Subject: Re: strange replication problem > > I wanted to add that the message just happen once or twice, it's flooding > the logs. > > We had to stop replication and stop the region server to stop it. > > On Mon, Sep 7, 2015 at 2:00 PM, Ted Yu wrote: > > > WrongRegionException is retriable. Meaning the client would retry upon > > receiving the exception. > > > > Did you observe any failed job(s) ? > > > > > > Cheers > > > > On Mon, Sep 7, 2015 at 10:54 AM, Hbase Janitor > > > > wrote: > > > > > Ted, thank you for your response. > > > > > > Yes, there was. Looks like it moved yesterday morning. > > > > > > On Mon, Sep 7, 2015 at 1:47 PM, Ted Yu wrote: > > > > > > > Was there region movement prior to 11:24:00 (on the region server > > > > where WrongRegionException was observed) ? > > > > > > > > Cheers > > > > > > > > On Mon, Sep 7, 2015 at 9:59 AM, Hbase Janitor > > > > > > > > wrote: > > > > > > > > > Hi, > > > > > > > > > > We've recently upgraded to hbase 1.0 and we are seeing a strange > > error > > > in > > > > > the region logs: > > > > > > > > > > 2015-09-07 11:24:00,960 WARN > > > > org.apache.hadoop.hbase.regionserver.HRegion: > > > > > Failed getting lock in batch put, row=HMV14395619228 > > > > > > > > > > org.apache.hadoop.hbase.regionserver.WrongRegionException: > > > > > Requested > > > row > > > > > out of range for row lock on HRegion region-name., > > > > startKey='ZMA178741312', > > > > > getEndKey()='ZMY253622718', row='HMV14395619228' > > > > > > > > > > at > > > > > > > > org.apache.hadoop.hbase.regionserver.HRegion.checkRow(HRegion.java:4 > > > 731) > > > > > > > > > > at > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.regionserver.HRegion.getRowLockInternal(HRegio > > n.java:4762) > > > > > > > > > > at > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.regionserver.HRegion.doMiniBatchMutation(HRegi > > on.java:2877) > > > > > > > > > > at > > > > > > > > > > > > > > org.apache.hadoop.hbase.regionserver.HRegion.batchMutate(HRegion.java: > > 2741) > > > > > > > > > > at > > > > > > > > > > > > > > org.apache.hadoop.hbase.regionserver.HRegion.batchMutate(HRegion.java: > > 2676) > > > > > > > > > > at > > > > > > > > > > > > > > org.apache.hadoop.hbase.regionserver.HRegion.batchMutate(HRegion.java: > > 2680) > > > > > > > > > > at > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.regionserver.RSRpcServices.doBatchOp(RSRpcServ > > ices.java:653) > > > > > > > > > > at > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.regionserver.RSRpcServices.doNonAtomicRegionMu > > tation(RSRpcServices.java:615) > > > > > > > > > > at > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.regionserver.RSRpcServices.multi(RSRpcServices > > .java:1901) > > > > > > > > > > at > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$ > > 2.callBlockingMethod(ClientProtos.java:31451) > > > > > > > > > > at > > > > org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2035) > > > > > > > > > > at > > > > org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:107) > > > > > > > > > > at > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.ipc.RpcExecutor.consumerLoop(RpcExecutor.java: > > 130) > > > > > > > > > > at > > > > > org.apache.hadoop.hbase.ipc.RpcExecutor$1.run(RpcExecutor.java:1 > > > > > 07) > > > > > > > > > > at java.lang.Thread.run(Thread.java:745) > > > > > > > > > > > > > > > Any idea what causes it? > > > > > > > > > > > > > > > --001a1130cd1af59eb0051f59defb--