Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 920AC10D12 for ; Wed, 23 Oct 2013 13:17:11 +0000 (UTC) Received: (qmail 79311 invoked by uid 500); 23 Oct 2013 13:17:05 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 79186 invoked by uid 500); 23 Oct 2013 13:17:04 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 79155 invoked by uid 99); 23 Oct 2013 13:17:04 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Oct 2013 13:17:04 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,MIME_QP_LONG_LINE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of julian.zhou@me.com designates 17.158.161.2 as permitted sender) Received: from [17.158.161.2] (HELO nk11p00mm-asmtp003.mac.com) (17.158.161.2) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Oct 2013 13:16:57 +0000 Received: from nk11p00mm-spool004.mac.com ([17.158.161.119]) by nk11p00mm-asmtp003.mac.com (Oracle Communications Messaging Server 7u4-27.08(7.0.4.27.7) 64bit (built Aug 22 2013)) with ESMTP id <0MV400EHUI7L5D10@nk11p00mm-asmtp003.mac.com>; Wed, 23 Oct 2013 13:16:35 +0000 (GMT) X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:5.10.8794,1.0.431,0.0.0000 definitions=2013-10-23_03:2013-10-23,2013-10-23,1970-01-01 signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 spamscore=0 suspectscore=2 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=7.0.1-1308280000 definitions=main-1310230041 MIME-version: 1.0 Content-type: multipart/alternative; boundary="Boundary_(ID_IMpu4JFXhw1QtpGzUy6TYw)" Received: from localhost ([17.158.233.92]) by nk11p00mm-spool004.mac.com (Oracle Communications Messaging Server 7u4-27.08(7.0.4.27.7) 64bit (built Aug 22 2013)) with ESMTP id <0MV400E6HI7M5SE0@nk11p00mm-spool004.mac.com>; Wed, 23 Oct 2013 13:16:34 +0000 (GMT) To: user@hbase.apache.org Cc: dev@hbase.apache.org, user@hbase.apache.org, issues@hbase.apache.org From: Julian Zhou Subject: Re: HConnectionImplementation.listTables and list table exception Date: Wed, 23 Oct 2013 13:16:18 +0000 (GMT) X-Mailer: iCloud MailClient1T.111546 MailServer1T X-Originating-IP: [202.108.130.138] Message-id: In-reply-to: X-Virus-Checked: Checked by ClamAV on apache.org --Boundary_(ID_IMpu4JFXhw1QtpGzUy6TYw) Content-type: text/plain; charset=utf-8; format=flowed Content-transfer-encoding: quoted-printable Hello Michelle, How many regions totally are there in your 600 nodes cluster? Looks lik= e many of them are pending for open and being assigned to region servers. Can you see many items under zookeeper dir /hbase/unassigned? =EF=BB=BF You would like to refer http://blog.sina.com.cn/s/blog_4a1f59bf01018tu4.ht= ml? Best Regards, Julian On Oct 23, 2013, at 01:46 PM, =E5=BC=A0=E8=8E=89=E8=8B=B9 wrote: > Dear HBase dev and users, > > Did you meet this > "org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementa= tion.listTables" > issue? > > We setup a 600 nodes cluster, 9 zookeeper nodes to load data into hbase, > but it seemed hbase master was busy handling transition with zookeeper, = and > hbase =E2=80=9Clist=E2=80=9D could not get response. The hbase table was= created but it > didn't do any insert. > > Do you have any idea of the root cause and how to fix it? :)Highly > appreciate for your answers! > > > > Here is the exception stack: > --------------------------------------------------- > java.lang.reflect.UndeclaredThrowableException > at $Proxy7.getHTableDescriptors(Unknown Source) > at > org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementat= ion.listTables(HConnectionManager.java:2237) > at > org.apache.hadoop.hbase.client.HBaseAdmin.listTables(HBaseAdmin.java:317= ) > > > > > hbase master log: > > ----------------------------- > > 2013-10-18 06:19:41,279 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign= : > master:60000-0x341be88202300ab* Deleting existing unassigned node* for > 0ec3308bd1e2bdd9576b2d60d2eee68e that is in expected state > RS_ZK_REGION_OPENED > > 2013-10-18 06:19:41,279 DEBUG > org.apache.hadoop.hbase.master.AssignmentManager:* Handling > transition=3DRS_ZK_REGION_OPENING*, s*erver=3Dnode0878*. > ic.analyticsworkbench.com,60020,1381883086785, > region=3D15a4fb29aa1d905b13f33594e50bc8de, which is more than 15 seconds= late > > 2013-10-18 06:19:41,280 DEBUG > org.apache.hadoop.hbase.master.AssignmentManager: *Handling > transition=3DRS_ZK_REGION_OPENING, > server=3Dnode0898*.ic.analyticsworkbench.com,60020,1381883200494, > region=3D1a4c929534e6828c85f22b062f949304, which is more than 15 seconds= late > > 2013-10-18 06:19:41,289 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign= : > master:60000-0x341be88202300ab Successfully *deleted unassigned node *fo= r > region 0ec3308bd1e2bdd9576b2d60d2eee68e in expected state > RS_ZK_REGION_OPENED > > 2013-10-18 06:19:41,289 DEBUG > org.apache.hadoop.hbase.master.AssignmentManager: Handling > transition=3DRS_ZK_REGION_OPENING, > server=3Dnode0693.ic.analyticsworkbench.com,60020,1381881773670, > region=3Dd47bfe1af0051c405de295a51c1c6e63, which is more than 15 seconds= late > > > > We also try to "list" in hbase shell,it also failed: > > The hbase =E2=80=9Clist=E2=80=9D got error as: > > ------------------------------------------ > > > > hbase(main):001:0> list > > TABLE > > > > > ERROR: java.lang.reflect.UndeclaredThrowableException: Call to > node0997.ic.analyticsworkbench.com/10.1.50.17:60000 failed on socket > timeout exception: java.net.SocketTimeoutException: 120000 millis timeou= t > while waiting for channel to be ready for read. ch : > java.nio.channels.SocketChannel[connected local=3D/10.1.50.15:45726 remo= te=3D > node0997.ic.analyticsworkbench.com/10.1.50.17:60000] > > > > > Cheers, > ----- > Big Data - Big Wisdom - Big Value > -------------- > Michelle Zhang (Li Ping Zhang) --Boundary_(ID_IMpu4JFXhw1QtpGzUy6TYw) Content-type: multipart/related; boundary="Boundary_(ID_Nnw8Y7S/jYlGQbVIhg94nA)"; type="text/html" --Boundary_(ID_Nnw8Y7S/jYlGQbVIhg94nA) Content-type: text/html; charset=utf-8 Content-transfer-encoding: quoted-printable
Hello Michelle,
   How many regions totally are there in= your 600 nodes cluster? Looks like many of them are pending for open and = being assigned to region servers.
Can you see many items under zookeepe= r dir /hbase/unassigned?

You would like to refer http://blog.sina.c= om.cn/s/blog_4a1f59bf01018tu4.html?
Best Regards, Julian

On Oct 23, 2013, at 01:46 PM, =E5=BC=A0=E8=8E=89=E8=8B=B9 <zl= pmichelle@gmail.com> wrote:

Dear HBase dev and users,
=
Did you meet this
"org.apache.hadoop.hbase.client.HConnectionMana= ger$HConnectionImplementation.listTables"
issue?

We setup a 6= 00 nodes cluster, 9 zookeeper nodes to load data into hbase,
but it se= emed hbase master was busy handling transition with zookeeper, and
hba= se =E2=80=9Clist=E2=80=9D could not get response. The hbase table was crea= ted but it
didn't do any insert.

Do you have any idea of the = root cause and how to fix it? :)Highly
appreciate for your answers!


Here is the exception stack:
------------------------= ---------------------------
java.lang.reflect.UndeclaredThrowableExcep= tion
at $Proxy7.getHTableDescriptors(Unknown Source)
at
org.ap= ache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.list= Tables(HConnectionManager.java:2237)
at
org.apache.hadoop.hbase.cl= ient.HBaseAdmin.listTables(HBaseAdmin.java:317)




hb= ase master log:

-----------------------------

2013-10-18= 06:19:41,279 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign:
master= :60000-0x341be88202300ab* Deleting existing unassigned node* for
0ec33= 08bd1e2bdd9576b2d60d2eee68e that is in expected state
RS_ZK_REGION_OPE= NED

2013-10-18 06:19:41,279 DEBUG
org.apache.hadoop.hbase.mas= ter.AssignmentManager:* Handling
transition=3DRS_ZK_REGION_OPENING*, s= *erver=3Dnode0878*.
ic.analyticsworkbench.com,60020,1381883086785,
= region=3D15a4fb29aa1d905b13f33594e50bc8de, which is more than 15 seconds = late

2013-10-18 06:19:41,280 DEBUG
org.apache.hadoop.hbase.ma= ster.AssignmentManager: *Handling
transition=3DRS_ZK_REGION_OPENING, server=3Dnode0898*.ic.analyticsworkbench.com,60020,1381883200494,
r= egion=3D1a4c929534e6828c85f22b062f949304, which is more than 15 seconds la= te

2013-10-18 06:19:41,289 DEBUG org.apache.hadoop.hbase.zookeepe= r.ZKAssign:
master:60000-0x341be88202300ab Successfully *deleted unass= igned node *for
region 0ec3308bd1e2bdd9576b2d60d2eee68e in expected st= ate
RS_ZK_REGION_OPENED

2013-10-18 06:19:41,289 DEBUG
org= .apache.hadoop.hbase.master.AssignmentManager: Handling
transition=3DR= S_ZK_REGION_OPENING,
server=3Dnode0693.ic.analyticsworkbench.com,60020= ,1381881773670,
region=3Dd47bfe1af0051c405de295a51c1c6e63, which is mo= re than 15 seconds late



We also try to "list" in hbase = shell,it also failed:

The hbase =E2=80=9Clist=E2=80=9D got error = as:

------------------------------------------



= hbase(main):001:0> list

TABLE




ERROR: = java.lang.reflect.UndeclaredThrowableException: Call to
node0997.ic.an= alyticsworkbench.com/10.1.50.17:60000 failed on socket
timeout excepti= on: java.net= .SocketTimeoutException: 120000 millis timeout
while waiting for c= hannel to be ready for read. ch :
java.nio.channels.SocketChannel[conn= ected local=3D/10.1.50.15:45726 remote=3D
node0997.ic.analyticsworkben= ch.com/10.1.50.17:60000]




Cheers,
-----
Big= Data - Big Wisdom - Big Value
--------------
Michelle Zhang (Li P= ing Zhang)
= --Boundary_(ID_Nnw8Y7S/jYlGQbVIhg94nA)-- --Boundary_(ID_IMpu4JFXhw1QtpGzUy6TYw)--