Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DE15FEF3F for ; Fri, 25 Jan 2013 17:59:52 +0000 (UTC) Received: (qmail 66528 invoked by uid 500); 25 Jan 2013 17:59:50 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 66468 invoked by uid 500); 25 Jan 2013 17:59:50 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 66460 invoked by uid 99); 25 Jan 2013 17:59:50 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Jan 2013 17:59:50 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mundra@gmail.com designates 209.85.214.43 as permitted sender) Received: from [209.85.214.43] (HELO mail-bk0-f43.google.com) (209.85.214.43) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Jan 2013 17:59:44 +0000 Received: by mail-bk0-f43.google.com with SMTP id jf20so434810bkc.30 for ; Fri, 25 Jan 2013 09:59:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type; bh=iojIbDNVRdbrsMbbY7LAO8kAB5FvwLiEBt2QVrMt20A=; b=MAbaGx7yc+uhQPAMTcCppbyNKJyCSfZLfSytvTVY3dE7/PUqynKd4t5f/LPLS40zm5 5k2xHXZ3BYF/zXOVZFYZUb8iFOE4sa1vXSL0hmEvl80nRO4bp/CwGJr+VU4R0uQfnqD/ Bwqfc4NrZSJHODp6NVy58e+x1d3kfGQ23trOcJ7jUAmhngr2Gj0+9ebsGifkvdqNhK9W ktkYFK9seOvFCKGtEi9AHxkNKTzTxXGBeeHMxuKPaQl4aQ6RSzKa5bvf2QN2X65sJmvS KIsZnyVNukcl7d9fK9B1rNbLbprK8H4Bk2bBS1HD70BKT2AEYfapU415puaUtnniW/7V +Oyg== MIME-Version: 1.0 X-Received: by 10.204.131.74 with SMTP id w10mr2048077bks.4.1359136762328; Fri, 25 Jan 2013 09:59:22 -0800 (PST) Received: by 10.205.104.74 with HTTP; Fri, 25 Jan 2013 09:59:22 -0800 (PST) In-Reply-To: References: Date: Fri, 25 Jan 2013 23:29:22 +0530 Message-ID: Subject: Re: Hbase scans taking a lot of time From: Vibhav Mundra To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=00151747bd0ceb75a004d420afe2 X-Virus-Checked: Checked by ClamAV on apache.org --00151747bd0ceb75a004d420afe2 Content-Type: text/plain; charset=ISO-8859-1 The number of column families I have is 13, which I guess is okie? -Vibhav On Fri, Jan 25, 2013 at 11:01 PM, Luke Lu wrote: > You'll have this problem if you have a large number of column families > being scanned/populated at the same time. Make sure the data you > scan/populate frequently are in the same column family (you can have many > columns in a column family). Unlike BigTable/Hypertable which has the > concept of locality/access groups, HBase always stores column families in > separate files, essentially making column family not only a logic grouping > mechanism but also a physical locality group. > > > On Fri, Jan 25, 2013 at 1:10 AM, Vibhav Mundra wrote: > > > I am facing a very strange problem with HBase. > > > > This what I did: > > a) Create a table, using pre partioned splits. > > b) Also the column familes are zipped with lzo compression. > > c) Using the above configuration I am able to populate 2 million row per > > min in the Hbase. > > d) I have created a table with 300 million odd rows, which roughy took > me 3 > > hours to populate and the data size is of 25GB. > > > > e) But when I query for data the performance I am getting is very bad. > > Basically this is what I am seeing: High CPU, no disk I/O and network > > I/O is happening at the rate of 6~7MB secs. > > > > > > Because of this, if I scan the entries of the table using Hive it is > taking > > ages. > > Basically it is taking around 24 hours to scan the table. Any idea, of > how > > to debug. > > > > > > -Vibhav > > > --00151747bd0ceb75a004d420afe2--