Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B1B48EC56 for ; Sun, 23 Dec 2012 01:50:42 +0000 (UTC) Received: (qmail 28341 invoked by uid 500); 23 Dec 2012 01:50:40 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 28106 invoked by uid 500); 23 Dec 2012 01:50:40 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 28094 invoked by uid 99); 23 Dec 2012 01:50:40 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 23 Dec 2012 01:50:40 +0000 X-ASF-Spam-Status: No, hits=2.5 required=5.0 tests=FREEMAIL_REPLY,HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of azuryyyu@gmail.com designates 209.85.215.50 as permitted sender) Received: from [209.85.215.50] (HELO mail-la0-f50.google.com) (209.85.215.50) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 23 Dec 2012 01:50:33 +0000 Received: by mail-la0-f50.google.com with SMTP id c1so6940007lah.23 for ; Sat, 22 Dec 2012 17:50:12 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=GFOf34cu5H1kYNDtCCzdkbOj/iHY81LcN37QSnIv4pk=; b=n6GkfZNM/DGLmmdU4e0UtBzDRXHiF7sFiaRKhjkeJKucNIBzxnBWOE5LYiEXQ/5jRM mH6RimJqmgDLb6fIWyztoMotLVcqw7EaMAYPUTOPK0r6iB8mNzF88cOeZn7kUgOeJCpo CEK5rRz0etn5oDHDY+FokXevcblI8yvxooTo7aT+aMDsCNSouxdmUhgIhwl+ylPkOIXw 03ziMkLZ+uC9Nyh9VMhjsFxUEQp91G4Ot+Gwrapqqw7bI9JqUnR8NyqakF/cQGCjMBqL NE6d6yoKL5K3+3IRUynv83eInH//3yvQH3iTq/gVh1+dYEODpheFPlzLPzFVjEGBxFrS gEkQ== MIME-Version: 1.0 Received: by 10.152.109.146 with SMTP id hs18mr8095572lab.8.1356227412281; Sat, 22 Dec 2012 17:50:12 -0800 (PST) Received: by 10.112.2.164 with HTTP; Sat, 22 Dec 2012 17:50:12 -0800 (PST) Received: by 10.112.2.164 with HTTP; Sat, 22 Dec 2012 17:50:12 -0800 (PST) In-Reply-To: References: Date: Sun, 23 Dec 2012 09:50:12 +0800 Message-ID: Subject: Re: responseTooSlow From: Azuryy Yu To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=bcaec54ee77624c7fb04d17b4d1c X-Virus-Checked: Checked by ClamAV on apache.org --bcaec54ee77624c7fb04d17b4d1c Content-Type: text/plain; charset=ISO-8859-1 I am sure you have a long gc, please monitor your gc log. On Dec 22, 2012 8:14 PM, "Mohammad Tariq" wrote: > yeah > > Best Regards, > Tariq > +91-9741563634 > https://mtariq.jux.com/ > > > On Sat, Dec 22, 2012 at 6:57 AM, Mohit Anchlia >wrote: > > > You mean batch multiple put? > > > > On Fri, Dec 21, 2012 at 4:16 PM, Mohammad Tariq > > wrote: > > > > > It might be the RS which could not complete the operation in time. The > > > appropriate way to find out is to monitor that RS's metrics and see if > > > anything unusual is happening there. What type of keys are you using? > It > > is > > > time-series data?You might be a victim of RS hotspotting in that case > or > > > perhaps some other processes are eating up resources there. Try using > > > "put(List puts)" instead of "put(Put put)" and see if it makes any > > > difference. > > > > > > I'm afraid, I can't say anything with 100% confidence as there could be > > 'n' > > > reasons which are not traceable from here. Some of the possible reasons > > > could be : > > > hotspotting region > > > too much I/O wait due to Swapping > > > overloaded disk > > > slowness due to high cpu consumption > > > > > > Best Regards, > > > Tariq > > > +91-9741563634 > > > https://mtariq.jux.com/ > > > > > > > > > On Sat, Dec 22, 2012 at 5:23 AM, Mohit Anchlia > > >wrote: > > > > > > > I am just doing a put. This operation generally takes 10ms but in > this > > > case > > > > it took more than 10sec. Nothing out of ordinary in the logs > > > > > > > > On Fri, Dec 21, 2012 at 3:26 PM, Mohammad Tariq > > > > wrote: > > > > > > > > > what exactly is the operation your trying to do?how is your > network's > > > > > health?is swapping too high at RS side?anything odd in your RS > logs? > > > > > > > > > > Best Regards, > > > > > Tariq > > > > > +91-9741563634 > > > > > https://mtariq.jux.com/ > > > > > > > > > > > > > > > On Sat, Dec 22, 2012 at 4:36 AM, Mohit Anchlia < > > mohitanchlia@gmail.com > > > > > >wrote: > > > > > > > > > > > I looked at that link, but couldn't find anything useful. How do > I > > > > check > > > > > if > > > > > > it was client who didn't write data within that time or if it was > > > > region > > > > > > server that didn't finish operation in time. > > > > > > > > > > > > On Fri, Dec 21, 2012 at 2:54 PM, Mohammad Tariq < > > dontariq@gmail.com> > > > > > > wrote: > > > > > > > > > > > > > The socket through which your client is communicating is > getting > > > > closed > > > > > > > before the operation could get finished. May be it is taking > > longer > > > > > than > > > > > > > usual or something. > > > > > > > > > > > > > > Best Regards, > > > > > > > Tariq > > > > > > > +91-9741563634 > > > > > > > https://mtariq.jux.com/ > > > > > > > > > > > > > > > > > > > > > On Sat, Dec 22, 2012 at 4:08 AM, Mohammad Tariq < > > > dontariq@gmail.com > > > > > > > > > > > > wrote: > > > > > > > > > > > > > > > Hello Mohit, > > > > > > > > > > > > > > > > You might this link< > > > > > > > http://hbase.apache.org/book/ops.monitoring.html>useful. > > > > > > > > > > > > > > > > Best Regards, > > > > > > > > Tariq > > > > > > > > +91-9741563634 > > > > > > > > https://mtariq.jux.com/ > > > > > > > > > > > > > > > > > > > > > > > > On Sat, Dec 22, 2012 at 2:09 AM, Mohit Anchlia < > > > > > mohitanchlia@gmail.com > > > > > > > >wrote: > > > > > > > > > > > > > > > >> Could someone help me understand what this really means. Is > > this > > > > the > > > > > > > >> network transfer taking long from client -> server or region > > > > server > > > > > > > taking > > > > > > > >> long time writing to the memory? > > > > > > > >> > > > > > > > >> 2012-12-21 10:54:21,980 WARN > > org.apache.hadoop.ipc.HBaseServer: > > > > > > > >> (responseTooSlow): {"processingtimems":135652,"call":"multi( > > > > > > > >> org.apache.hadoop.hbase.client.MultiAction@28338472), rpc > > > > > version=1, > > > > > > > >> client > > > > > > > >> version=29, methodsFingerPrint=54742778","client":" > > > > 10.18.3.80:48218 > > > > > > > >> > > > > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > > ","starttimems":1356115926326,"queuetimems":0,"class":"HRegionServer","responsesize":0,"method":"multi"} > > > > > > > >> 2012-12-21 10:54:21,985 WARN > > org.apache.hadoop.ipc.HBaseServer: > > > > IPC > > > > > > > Server > > > > > > > >> handler 26 on 60020 caught: > > > > java.nio.channels.ClosedChannelException > > > > > > > >> at > > > > > > > >> > > > > > > > > > > > > sun.nio.ch.SocketChannelImpl.ensureWriteOpen(SocketChannelImpl.java:133) > > > > > > > >> at > > > > > > > sun.nio.ch.SocketChannelImpl.write(SocketChannelImpl.java:324) > > > > > > > >> at > > > > > > > >> > > > > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.ipc.HBaseServer.channelWrite(HBaseServer.java:1653) > > > > > > > >> at > > > > > > > >> > > > > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.ipc.HBaseServer$Responder.processResponse(HBaseServer.java:924) > > > > > > > >> at > > > > > > > >> > > > > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.ipc.HBaseServer$Responder.doRespond(HBaseServer.java:1003) > > > > > > > >> at > > > > > > > >> > > > > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.ipc.HBaseServer$Call.sendResponseIfReady(HBaseServer.java:409) > > > > > > > >> at > > > > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > > org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1346) > > > > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > --bcaec54ee77624c7fb04d17b4d1c--