Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 838D811FCF for ; Tue, 15 Jul 2014 02:25:19 +0000 (UTC) Received: (qmail 17672 invoked by uid 500); 15 Jul 2014 02:25:17 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 17603 invoked by uid 500); 15 Jul 2014 02:25:17 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 17590 invoked by uid 99); 15 Jul 2014 02:25:17 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Jul 2014 02:25:17 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of tianq01@gmail.com designates 209.85.216.48 as permitted sender) Received: from [209.85.216.48] (HELO mail-qa0-f48.google.com) (209.85.216.48) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Jul 2014 02:25:13 +0000 Received: by mail-qa0-f48.google.com with SMTP id m5so1917725qaj.7 for ; Mon, 14 Jul 2014 19:24:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=6xBCFZdGj0q6DEOEu0vxVAk5NAJAFP5jSTGWDd7oGVE=; b=x6ACh9JxayOQS/liizLzEpUf+ecEP7WA7N4ZxTMysN2XW5HLrnogCvx8k8C8CLQpDJ Ozs9Hnx3mWCqIjFRcaziZwHoi3gckxwFtCXT9K3mVnpZJoROEE1K7xQUiqEvsx9rc+Fz RoR8zfQkAvjvXx0TXMzshPCSbr7tjTU/04rO1zJHgOc4iG+Gv62YPmosdx0bBZ+s42yO UfGLbc2jZSbuT5X0iFK1hgHZ8qG5DP+Jy/fL4kT3iATkTWph1Z7MvBjwwD4vfMDd4w7C DLC9Z0YYDSKTtf7wGzXj576WHfQj8M8nuh5lFqrbSbmw4d7Vnvp3LEQFpVmGgUohbVnY hBoQ== MIME-Version: 1.0 X-Received: by 10.224.38.137 with SMTP id b9mr28189848qae.74.1405391088367; Mon, 14 Jul 2014 19:24:48 -0700 (PDT) Received: by 10.140.43.228 with HTTP; Mon, 14 Jul 2014 19:24:48 -0700 (PDT) In-Reply-To: <53C3BCAC.3050604@gmail.com> References: <53C1FF5D.9060408@gmail.com> <1405282709523-4061293.post@n3.nabble.com> <53C33165.6010400@gmail.com> <53C3BCAC.3050604@gmail.com> Date: Tue, 15 Jul 2014 10:24:48 +0800 Message-ID: Subject: Re: hbase region servers refuse connection From: Qiang Tian To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=001a11c2cfc897c0e104fe321c72 X-Virus-Checked: Checked by ClamAV on apache.org --001a11c2cfc897c0e104fe321c72 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable not sure if you could also try using different connections for your client program. see the related issue HBASE-11306. I guess sharing connection is a contributing factor for this issue. On Mon, Jul 14, 2014 at 7:19 PM, Rural Hunter wrote= : > Hi Tian Qiang, > > Good to hear that. It seems the jira is fixed. I will seek time to build > the latest 0.96 branch and test it. > > =E4=BA=8E 2014/7/14 18:06, Qiang Tian =E5=86=99=E9=81=93: > >> Hi YuMing, :) >> yes. several iterations of jstack on the problem regionserver could help >> identify the problem >> >> Rural, >> you probably hit hbase11277(and probably YuMin as well) - the reader 14 >> loops again and again in >> below stack(high cpu usage) and listener 12 is blocked and cannot >> accept new connections. >> >> >> >> 1. Thread 12 (RpcServer.listener,port=3D60020): >> 2. State: BLOCKED >> 3. Blocked count: 123264191 >> 4. Waited count: 0 >> 5. Blocked on >> org.apache.hadoop.hbase.ipc.RpcServer$Listener$Reader@77f87716 >> 6. Blocked by 14 (RpcServer.reader=3D1,port=3D60020) >> 7. Stack: >> 8. >> org.apache.hadoop.hbase.ipc.RpcServer$Listener$Reader. >> registerChannel(RpcServer.java:598) >> 9. >> org.apache.hadoop.hbase.ipc.RpcServer$Listener.doAccept( >> RpcServer.java:755) >> 10. >> org.apache.hadoop.hbase.ipc.RpcServer$Listener.run( >> RpcServer.java:673) >> 11. Thread 24 (RpcServer.responder): >> >> >> >> 1. Thread 14 (RpcServer.reader=3D1,port=3D60020): >> 2. State: RUNNABLE >> 3. Blocked count: 12510492 >> 4. Waited count: 12826560 >> 5. Stack: >> 6. sun.nio.ch.FileDispatcher.read0(Native Method) >> 7. sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39) >> 8. sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:251) >> 9. sun.nio.ch.IOUtil.read(IOUtil.java:224) >> 10. sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:254= ) >> 11. >> org.apache.hadoop.hbase.ipc.RpcServer.channelIO(RpcServer.java:2438) >> 12. >> org.apache.hadoop.hbase.ipc.RpcServer.channelRead( >> RpcServer.java:2404) >> 13. >> org.apache.hadoop.hbase.ipc.RpcServer$Connection. >> readAndProcess(RpcServer.java:1498) >> 14. >> org.apache.hadoop.hbase.ipc.RpcServer$Listener.doRead( >> RpcServer.java:780) >> 15. >> org.apache.hadoop.hbase.ipc.RpcServer$Listener$Reader. >> doRunLoop(RpcServer.java:568) >> 16. >> org.apache.hadoop.hbase.ipc.RpcServer$Listener$Reader.run( >> RpcServer.java:543) >> 17. >> java.util.concurrent.ThreadPoolExecutor.runWorker( >> ThreadPoolExecutor.java:1146) >> 18. >> java.util.concurrent.ThreadPoolExecutor$Worker.run( >> ThreadPoolExecutor.java:615) >> 19. java.lang.Thread.run(Thread.java:701) >> 20. Thread 13 (RpcServer.reader=3D0,port=3D60020): >> 21. >> >> >> >> 1. 2014-07-10 14:13:49,614 WARN [RpcServer.reader=3D7,port=3D60020] >> >> ipc.RpcServer: RpcServer.listener,port=3D60020: count of bytes read:= 0 >> 2. java.io.IOException: Connection reset by peer >> 3. at sun.nio.ch.FileDispatcher.read0(Native Method) >> 4. at sun.nio.ch.SocketDispatcher. >> read(SocketDispatcher.java:39) >> 5. at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:251= ) >> 6. at sun.nio.ch.IOUtil.read(IOUtil.java:224) >> 7. at >> sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:254) >> 8. at >> org.apache.hadoop.hbase.ipc.RpcServer.channelRead( >> RpcServer.java:2404) >> 9. at >> org.apache.hadoop.hbase.ipc.RpcServer$Connection. >> readAndProcess(RpcServer.java:1425) >> 10. at >> org.apache.hadoop.hbase.ipc.RpcServer$Listener.doRead( >> RpcServer.java:780) >> 11. at >> org.apache.hadoop.hbase.ipc.RpcServer$Listener$Reader. >> doRunLoop(RpcServer.java:568) >> 12. at >> org.apache.hadoop.hbase.ipc.RpcServer$Listener$Reader.run( >> RpcServer.java:543) >> 13. at >> java.util.concurrent.ThreadPoolExecutor.runWorker( >> ThreadPoolExecutor.java:1146) >> 14. at >> java.util.concurrent.ThreadPoolExecutor$Worker.run( >> ThreadPoolExecutor.java:615) >> 15. at java.lang.Thread.run(Thread.java:701) >> >> >> >> > --001a11c2cfc897c0e104fe321c72--