Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9501A18A92 for ; Mon, 6 Jul 2015 15:50:18 +0000 (UTC) Received: (qmail 26506 invoked by uid 500); 6 Jul 2015 15:50:16 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 26432 invoked by uid 500); 6 Jul 2015 15:50:16 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 26417 invoked by uid 99); 6 Jul 2015 15:50:16 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 Jul 2015 15:50:16 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id B2451D28BA for ; Mon, 6 Jul 2015 15:50:15 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0 X-Spam-Level: X-Spam-Status: No, score=0 tagged_above=-999 required=6.31 tests=[SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id xDTkVdJKsfBG for ; Mon, 6 Jul 2015 15:50:04 +0000 (UTC) Received: from nk11p18im-asmtp001.me.com (nk11p18im-asmtp001.me.com [17.158.120.160]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id C67F120DC7 for ; Mon, 6 Jul 2015 15:50:03 +0000 (UTC) Received: from [192.168.0.7] (ua-83-227-12-104.cust.bredbandsbolaget.se [83.227.12.104]) by nk11p18im-asmtp001.me.com (Oracle Communications Messaging Server 7.0.5.35.0 64bit (built Mar 31 2015)) with ESMTPSA id <0NR20001KPB4RW60@nk11p18im-asmtp001.me.com> for user@hbase.apache.org; Mon, 06 Jul 2015 15:49:55 +0000 (GMT) X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:5.14.151,1.0.33,0.0.0000 definitions=2015-07-06_07:2015-07-06,2015-07-06,1970-01-01 signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 spamscore=0 suspectscore=1 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=7.0.1-1412110000 definitions=main-1507060240 Content-type: text/plain; charset=utf-8 MIME-version: 1.0 (Mac OS X Mail 8.2 \(2098\)) Subject: Re: HBase strange behaviour From: Akmal Abbasov In-reply-to: Date: Mon, 06 Jul 2015 17:49:54 +0200 Content-transfer-encoding: quoted-printable Message-id: <4D642ED6-D32F-4B94-AF28-BC9AFA5DDBB2@icloud.com> References: <3B59357E-5CD8-4918-97A7-0F65142A2067@icloud.com> To: user@hbase.apache.org X-Mailer: Apple Mail (2.2098) > What error(s) did you get when trying to restart the region server ? = Have > you checked its log files ? it was a VM, and I was not able to access it any more, I can=E2=80=99t = login to it. Restarting several times didn=E2=80=99t helped. > Can you check master log around this time ? If there was region in > transition, balancer wouldn't balance. I have a lot of this=20 2015-07-06 15:15:39,918 INFO [snapshot-log-cleaner-cache-refresher] = util.FSVisitor: No logs under = directory:hdfs://test/hbase/.hbase-snapshot/table1-snapshot-31.05.2015_18.= 14/WALs 2015-07-06 15:15:39,918 INFO [snapshot-log-cleaner-cache-refresher] = util.FSVisitor: No logs under = directory:hdfs://test/hbase/.hbase-snapshot/table1-snapshot-31.05.2015_19.= 14/WALs 2015-07-06 15:15:39,921 INFO [snapshot-log-cleaner-cache-refresher] = util.FSVisitor: No logs under = directory:hdfs://test/hbase/.hbase-snapshot/table1-snapshot-31.05.2015_20.= 13/WALs 2015-07-06 15:15:39,925 INFO [snapshot-log-cleaner-cache-refresher] = util.FSVisitor: No logs under = directory:hdfs://test/hbase/.hbase-snapshot/table1-snapshot-31.05.2015_21.= 14/WALs 2015-07-06 15:15:39,926 INFO [snapshot-log-cleaner-cache-refresher] = util.FSVisitor: No logs under = directory:hdfs://test/hbase/.hbase-snapshot/table1-snapshot-31.05.2015_22.= 14/WALs 2015-07-06 15:15:39,927 INFO [snapshot-log-cleaner-cache-refresher] = util.FSVisitor: No logs under = directory:hdfs://test/hbase/.hbase-snapshot/table1-snapshot-31.05.2015_23.= 14/WALs 2015-07-06 15:15:39,928 INFO [snapshot-log-cleaner-cache-refresher] = util.FSVisitor: No logs under = directory:hdfs://test/hbase/.hbase-snapshot/testsnap/WALs 2015-07-06 15:15:47,324 INFO [FifoRpcScheduler.handler1-thread-18] = master.HMaster: Client=3Dhadoop//10.32.0.140 set balanceSwitch=3Dfalse 2015-07-06 15:23:31,265 DEBUG [master:hbase-m2:60000.oldLogCleaner] = master.ReplicationLogCleaner: Didn't find this log in ZK, deleting: = hbase-rs1%2C60020%2C1436189457794.1436190023718 2015-07-06 15:23:31,504 DEBUG [master:hbase-m2:60000.oldLogCleaner] = master.ReplicationLogCleaner: Didn't find this log in ZK, deleting: = hbase-rs1%2C60020%2C1436189457794.1436193624562 2015-07-06 15:32:49,382 INFO [FifoRpcScheduler.handler1-thread-14] = master.HMaster: Client=3Dhadoop//10.32.0.156 set balanceSwitch=3Dfalse 2015-07-06 15:32:56,936 INFO [FifoRpcScheduler.handler1-thread-1] = master.HMaster: Client=3Dhadoop//10.32.0.156 set balanceSwitch=3Dfalse Thank you. > On 06 Jul 2015, at 17:37, Ted Yu wrote: >=20 > bq. I had to delete and recreate it >=20 > What error(s) did you get when trying to restart the region server ? = Have > you checked its log files ? >=20 > bq. start balancer manually, but it returned false >=20 > Can you check master log around this time ? If there was region in > transition, balancer wouldn't balance. >=20 > Cheers >=20 > On Mon, Jul 6, 2015 at 8:29 AM, Akmal Abbasov = > wrote: >=20 >> Hi all, >> I have a strange behaviour in my HBase cluster. I have 5 rs and 2 = masters. >> One of the rs stopped working, restart didn=E2=80=99t worked, and I = had to delete >> and recreate it. >> But when this rs have stopped, the cluster also stopped functioning. >> There were a lot of inconsistencies. When I recreated the rs with = disks of >> the previous one, cluster started working. >> But now, only 3 rs host the regions, other 2 have 0 regions. >> I=E2=80=99ve tried to start balancer manually, but it returned false? >> Any idea? >>=20 >> I am using hbase hbase-0.98.7-hadoop2. >> Thank you. >>=20 >> Kind regards, >> Akmal Abbasov >>=20 >>=20