Return-Path: Delivered-To: apmail-hbase-user-archive@www.apache.org Received: (qmail 1763 invoked from network); 26 Mar 2011 17:01:36 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 26 Mar 2011 17:01:36 -0000 Received: (qmail 15773 invoked by uid 500); 26 Mar 2011 17:01:34 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 15738 invoked by uid 500); 26 Mar 2011 17:01:34 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 15730 invoked by uid 99); 26 Mar 2011 17:01:34 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 26 Mar 2011 17:01:34 +0000 X-ASF-Spam-Status: No, hits=-0.1 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of hsreekumar@clickable.com designates 74.125.149.77 as permitted sender) Received: from [74.125.149.77] (HELO na3sys009aog106.obsmtp.com) (74.125.149.77) by apache.org (qpsmtpd/0.29) with SMTP; Sat, 26 Mar 2011 17:01:26 +0000 Received: from source ([209.85.160.180]) (using TLSv1) by na3sys009aob106.postini.com ([74.125.148.12]) with SMTP ID DSNKTY4bzzB+4NeaDP3Zvs3EjGcwkCs8sZI3@postini.com; Sat, 26 Mar 2011 10:01:05 PDT Received: by mail-gy0-f180.google.com with SMTP id 2so760200gyf.11 for ; Sat, 26 Mar 2011 10:01:03 -0700 (PDT) MIME-Version: 1.0 Received: by 10.150.233.2 with SMTP id f2mr2330285ybh.97.1301158863438; Sat, 26 Mar 2011 10:01:03 -0700 (PDT) Received: by 10.150.198.10 with HTTP; Sat, 26 Mar 2011 10:01:03 -0700 (PDT) Date: Sat, 26 Mar 2011 22:31:03 +0530 Message-ID: Subject: HBase read latency From: Hari Sreekumar To: user Content-Type: multipart/alternative; boundary=000e0cd56208da037e049f65a7a2 X-Virus-Checked: Checked by ClamAV on apache.org --000e0cd56208da037e049f65a7a2 Content-Type: text/plain; charset=ISO-8859-1 Hi guys, On what factors does HBase read latency primarily depend? What would be the approx theoretical limit for read latency in v0.90.1 on a cluster of 7 nodes (16 core/16 GB RAM on 5 machines and 36 GB on the other two)? I have an application where I generate around 1000 rows/s to be input into HBase. Then I have to read this data and process it at regular intervals. Write speed is not a problem as the cluster seems to be able to write at the reqd. rate. But while processing this data also, I would need a read speed of at least 1000 rows/s since I need to keep the processing speed at least equal to the data generation speed. So far, I am getting around 200-300 rows/s only it seems. I have LZO compression on the tables and I haven't tried in-memory yet as my RAM usage is too high already while running jobs. Is it possible to achieve this read speed, and what can I do to improve it? How far can adding more nodes/more RAM help? Please let me know if the scope is too huge to answer this question and if you need more details. Thanks, Hari --000e0cd56208da037e049f65a7a2--