Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 518BC1056F for ; Mon, 26 Aug 2013 14:28:21 +0000 (UTC) Received: (qmail 6484 invoked by uid 500); 26 Aug 2013 14:28:18 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 6191 invoked by uid 500); 26 Aug 2013 14:28:18 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 6156 invoked by uid 99); 26 Aug 2013 14:28:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Aug 2013 14:28:14 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of olle.martensson@gmail.com designates 209.85.128.46 as permitted sender) Received: from [209.85.128.46] (HELO mail-qe0-f46.google.com) (209.85.128.46) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Aug 2013 14:28:10 +0000 Received: by mail-qe0-f46.google.com with SMTP id f6so1769194qej.5 for ; Mon, 26 Aug 2013 07:27:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=gf+PtV7DZGHzNkcQmj74x500iMPWfiZtGi8jPtnads4=; b=IirmnrBwocoaYBi30qs4zTBLkHOV4pTlJI7feqOGXaeSBtWz6qs44KJ0yyUkUkkZpF g1lHUAYhP9BXpjZ3NiWwlfAaI9OvKw76oABMHpvOMkwHLatdh+8ynLHunxACTl6k6yt8 poP1oI2TEajZmpVXpflDvO7J6AlgH1W7tfea/qqe8istmdFDtZ1zbR1yz2uQSVnJstDY THlz3zl0wxVkTUTyLITMpo9T6xRGygs1mlw0eFPk4Tuq/MF/BlW8QChLdYgG7yMfNTOF lkLLe0HZSn/S0Rzdn2pPRCmdHF/vgacuylfR48/1ITPBAuViuICaPUtrG8czCt5vgdkO 0eag== MIME-Version: 1.0 X-Received: by 10.229.250.5 with SMTP id mm5mr4162638qcb.19.1377527269864; Mon, 26 Aug 2013 07:27:49 -0700 (PDT) Received: by 10.49.101.51 with HTTP; Mon, 26 Aug 2013 07:27:49 -0700 (PDT) Date: Mon, 26 Aug 2013 16:27:49 +0200 Message-ID: Subject: timeouts with lots of coprocessor puts on single row From: =?ISO-8859-1?Q?Olle_M=E5rtensson?= To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=001a113495c496bb3104e4da8f53 X-Virus-Checked: Checked by ClamAV on apache.org --001a113495c496bb3104e4da8f53 Content-Type: text/plain; charset=ISO-8859-1 Hi, I have developed a coprocessor that is extending BaseRegionObserver and implements the postPut method. The postPut method scans the columns of the row that the put was issued on and calculates an aggregated based on these values, when this is done a row in another table is updated with the aggregated value. This works out fine until I put some stress on one row, then the threads on the regionserver hosting the table will freeze on flushing the put on the aggregated value. The client application basically do 100 concurrent puts on one row in a tight loop( on the table where the coprocessor is activated ). After that the client sleeps for a while and tries to fetch the aggregated value and here the client freezes and periodically burps out exceptions. It works if I don't run so many put's in parallel. The HBASE environment is pseudo distributed 0.94.11 with one regionserver. I have tried using a connection pool in the coprocessor, bumped up the heapsize of the regionServer and also to up the number of RPC threads for the regionserver but without luck. The pseudo code postPut would be something like this: vals = env.getRegion().get(get).getFamilyMap().values() agg_val = aggregate(vals) agg_table = env.getTable("aggregates") agg_table.setAutoFlush(false) put = new Put() put.add(agg_val) agg_table.put(put) agg_table.flushCommits() agg_table.close() And the real clojure variant is: https://gist.github.com/ollez/d0450930a591912aea5d#file-gistfile1-clj The hbase-site.xml: https://gist.github.com/ollez/d0450930a591912aea5d#file-hbase-site-xml The regionserver stacktrace: https://gist.github.com/ollez/d0450930a591912aea5d#file-regionserver-stacktrace The client exceptions: https://gist.github.com/ollez/d0450930a591912aea5d#file-client-exceptions Thanks // Olle --001a113495c496bb3104e4da8f53--