Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6781D730C for ; Mon, 26 Sep 2011 19:15:10 +0000 (UTC) Received: (qmail 24296 invoked by uid 500); 26 Sep 2011 19:15:09 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 24237 invoked by uid 500); 26 Sep 2011 19:15:08 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 24229 invoked by uid 99); 26 Sep 2011 19:15:08 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Sep 2011 19:15:08 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jdcryans@gmail.com designates 209.85.220.169 as permitted sender) Received: from [209.85.220.169] (HELO mail-vx0-f169.google.com) (209.85.220.169) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Sep 2011 19:15:03 +0000 Received: by vcbfo14 with SMTP id fo14so4396025vcb.14 for ; Mon, 26 Sep 2011 12:14:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type :content-transfer-encoding; bh=EC8sfMR4FuvGguy2IiZfmukbonSOi7oCV921ij9oSF4=; b=cncfNjfzqNI+yoUXQ+ElmrKdjsayA9KTFbFkrHswHDxwLtUe0FeF+0RkYSiKNNg1om zLrm6eRBfZBQoFh6OMSGetCnKz/0RTQ7NA1BgZ6bNzQSy5ViGDFkH2PgslSNtyvADhtf hEAjwk3Z8aNHXyVYIPTmZC4FltI8NfVDIplx4= MIME-Version: 1.0 Received: by 10.52.20.227 with SMTP id q3mr6377879vde.23.1317064482893; Mon, 26 Sep 2011 12:14:42 -0700 (PDT) Sender: jdcryans@gmail.com Received: by 10.52.108.164 with HTTP; Mon, 26 Sep 2011 12:14:42 -0700 (PDT) In-Reply-To: <6C5C1804772DB944BA88A0DC48D338DA0B44EE7D@dct-mail.sanjose.telcontar.com> References: <6C5C1804772DB944BA88A0DC48D338DA0B44EE7D@dct-mail.sanjose.telcontar.com> Date: Mon, 26 Sep 2011 12:14:42 -0700 X-Google-Sender-Auth: 86PgWQKmN3k6Yz5Y4VyrmG7U-FE Message-ID: Subject: Re: SocketChannel exception From: Jean-Daniel Cryans To: user@hbase.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable (late answer, sorry) 60 seconds is a long time to be left hanging, my first guess would be to look for signs of GC in that region server and make sure you don't swap at all. You could go further and jstack the RS while it's hanging, maybe you'll see that all the handlers are busy, but even then if they take that long to process it might be due what I previously mentioned. J-D On Tue, Sep 20, 2011 at 11:19 AM, Rohit Nigam wrote: > Hi Guys > > I am getting this exception while running the job in the reducer phase. > The reducer =A0retrieves =A0a data structure =A0for a key from hbase =A0a= nd then > populates it and put =A0it back again. The exception says like this :-- > > > > java.net.SocketTimeoutException: Call to x server /xx.xx.xx.xxx:60020 > failed on socket timeout exception: java.net.SocketTimeoutException: > 60000 millis timeout while waiting for channel to be ready for read. ch > : java.nio.channels.SocketChannel[connected local=3D/yy.yy.yy.yyy:44751 > remote=3D x server /xx.xx.xx.xxx:60020] > > java.net.SocketTimeoutException: 20000 millis timeout while waiting for > channel to be ready for connect. ch : > java.nio.channels.SocketChannel[connection-pending > remote=3Ddoop18.dt.sv4.decarta.com/10.241.8.238:60020] > > java.net.SocketTimeoutException: 20000 millis timeout while waiting for > channel to be ready for connect. ch : > java.nio.channels.SocketChannel[connection-pending > remote=3Ddoop18.dt.sv4.decarta.com/10.241.8.238:60020] > > java.net.SocketTimeoutException: 20000 millis timeout while waiting for > channel to be ready for connect. ch : > java.nio.channels.SocketChannel[connection-pending > remote=3Ddoop18.dt.sv4.decarta.com/10.241.8.238:60020] > > java.net.SocketTimeoutException: 20000 millis timeout while waiting for > channel to be ready for connect. ch : > java.nio.channels.SocketChannel[connection-pending > remote=3Ddoop18.dt.sv4.decarta.com/10.241.8.238:60020] > > java.net.SocketTimeoutException: 20000 millis timeout while waiting for > channel to be ready for connect. ch : > java.nio.channels.SocketChannel[connection-pending > remote=3Ddoop18.dt.sv4.decarta.com/10.241.8.238:60020] > > java.net.SocketTimeoutException > > > > There is a lot of IO involved in the reducer phase as to talking with > HBASE. Would really appreciate if somebody can shed some light as to > when can this exception happen. Is it related to regionservers being too > busy to cater the request? > > > > Thanks > > Rohit > > > >