Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EA030E750 for ; Fri, 25 Jan 2013 23:57:28 +0000 (UTC) Received: (qmail 85047 invoked by uid 500); 25 Jan 2013 23:57:26 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 84995 invoked by uid 500); 25 Jan 2013 23:57:26 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 84987 invoked by uid 99); 25 Jan 2013 23:57:26 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Jan 2013 23:57:26 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [72.30.239.143] (HELO nm32-vm7.bullet.mail.bf1.yahoo.com) (72.30.239.143) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Jan 2013 23:57:17 +0000 Received: from [98.139.215.142] by nm32.bullet.mail.bf1.yahoo.com with NNFMP; 25 Jan 2013 23:56:56 -0000 Received: from [98.139.212.211] by tm13.bullet.mail.bf1.yahoo.com with NNFMP; 25 Jan 2013 23:56:55 -0000 Received: from [127.0.0.1] by omp1020.mail.bf1.yahoo.com with NNFMP; 25 Jan 2013 23:56:55 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 931698.25622.bm@omp1020.mail.bf1.yahoo.com Received: (qmail 24519 invoked by uid 60001); 25 Jan 2013 23:56:55 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1359158215; bh=OReuQTLgRAAU7lTU4jLi0TfTPMsdghqFbRv/8AMXh6U=; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-RocketYMMF:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=ddYCI8rbY426f3dvpjcrmXngxyp6cgCsO4EwbbixpoG5vbJCHGQqYmenBW4z9di2P2W4iWP2IVzR07zqvcYMRjtynHIUn6sQl21Y+Nef6RNcfFs1UziU7Cu1AJ9kSyImUkSp31zuug9Ea6HAEa5GhLjU2vwfFqzZMHasb2gsakM= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-RocketYMMF:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=IsqA8ZEtL2RJsOUAQsmmfLQ4j+yUusMeJbj6g5VZSmvQ3R/hf5zZFywhn1nHS9sIBvYPTIGt3kckFzrxmvyEo2Q7YTGYoWsFpGehORpsOP1T7WB4ntEXj+iq5tmtjtorJ8NIHxTXSh5eAF1GRGpmEHXytg3zNChAw6nBum5+UQw=; X-YMail-OSG: RHeU7hkVM1nNsOJpIGI9jtCw7WH4iDQt3nujrPRlqXkMogQ NBtqNZmnFnsfau3l6Xc1USqBaZ.JFMIeoxrF_vvhwfBlmEAjE3b19lNN.Y.M iwRFqyC3jT1aptc82EG52MdoNIWWZQcBk6m9JRgthTO6BMUyxLSKwBODzBWK qJedtO_.v26Km1WjhmbNiab6sGP1HcCOQgfxkee5C8ea18WO1xNrjGzEp8DQ oPewzvAEDd0qFIJCLWBFjZzlBtuG7oEbpBqvdpx94nX.VH77387PNGERMBCf AgioCB5BRMDwFUVGb2.wICqWwhVEL7JmKYXWlw837mXdkH4l8Tem5Ngr5wJY 48GczENgw1hCDbsKdrn5WvlfxMnHJRPDSdIZdqE0f4kQ3NDmx38NCZa73H4V z5QabRmsyqyJg27cdxI9kYV.fiilb90PQwKIc4U.4jiSYbM4_wClQubzsyGn Z48iJf_NAPQlxi12uCXufopnK9Gz_V.y7Bhbud1BWknDrVowKh.w7l8LUlSB 0mRiJWyMCjOkot8chJg-- Received: from [204.14.239.221] by web140605.mail.bf1.yahoo.com via HTTP; Fri, 25 Jan 2013 15:56:55 PST X-Rocket-MIMEInfo: 001.001,U29ycnkgSSBtZWFudCBzY2FuIGNhY2hpbmcuIChub3QgYmF0Y2hpbmcpCgoKCl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fCiBGcm9tOiBsYXJzIGhvZmhhbnNsIDxsYXJzaEBhcGFjaGUub3JnPgpUbzogInVzZXJAaGJhc2UuYXBhY2hlLm9yZyIgPHVzZXJAaGJhc2UuYXBhY2hlLm9yZz47ICJkZXZAaGJhc2UuYXBhY2hlLm9yZyIgPGRldkBoYmFzZS5hcGFjaGUub3JnPiAKU2VudDogRnJpZGF5LCBKYW51YXJ5IDI1LCAyMDEzIDI6MDAgUE0KU3ViamVjdDogUmU6IEhiYXNlIHNjYW5zIHRha2luZyABMAEBAQE- X-RocketYMMF: lhofhansl X-Mailer: YahooMailWebService/0.8.130.494 References: <1359151251.68831.YahooMailNeo@web140603.mail.bf1.yahoo.com> Message-ID: <1359158215.20903.YahooMailNeo@web140605.mail.bf1.yahoo.com> Date: Fri, 25 Jan 2013 15:56:55 -0800 (PST) From: lars hofhansl Reply-To: lars hofhansl Subject: Re: Hbase scans taking a lot of time To: "user@hbase.apache.org" , "dev@hbase.apache.org" , lars hofhansl In-Reply-To: <1359151251.68831.YahooMailNeo@web140603.mail.bf1.yahoo.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="1254654340-1547007759-1359158215=:20903" X-Virus-Checked: Checked by ClamAV on apache.org --1254654340-1547007759-1359158215=:20903 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Sorry I meant scan caching. (not batching)=0A=0A=0A=0A_____________________= ___________=0A From: lars hofhansl =0ATo: "user@hbase.apa= che.org" ; "dev@hbase.apache.org" =0ASent: Friday, January 25, 2013 2:00 PM=0ASubject: Re: Hbase scans t= aking a lot of time=0A =0AEnable scan batching in Hive.=0AYou're probably p= erforming 300m RPC requests, i.e. you're mostly measuring network latency.= =0A=0A-- Lars=0A=0A=0A=0A________________________________=0AFrom: Vibhav Mu= ndra =0ATo: user@hbase.apache.org; dev@hbase.apache.org = =0ASent: Friday, January 25, 2013 1:10 AM=0ASubject: Hbase scans taking a l= ot of time=0A=0AI am facing a very strange problem with HBase.=0A=0AThis wh= at I did:=0Aa) Create a table, using pre partioned splits.=0Ab) Also the co= lumn familes are zipped with lzo compression.=0Ac) Using the above configur= ation I am able to populate 2 million row per=0Amin in the Hbase.=0Ad) I ha= ve created a table with 300 million odd rows, which roughy took me 3=0Ahour= s to populate and the data size is of 25GB.=0A=0Ae) But when I query for da= ta the performance I am getting is very bad.=0A=A0=A0 Basically this is wha= t I am seeing: High CPU, no disk I/O and network=0AI/O is happening at the = rate of 6~7MB secs.=0A=0A=0ABecause of this, if I scan the entries of the t= able using Hive it is taking=0Aages.=0ABasically it is taking around 24 hou= rs to scan the table. Any idea, of how=0Ato debug.=0A=0A=0A-Vibhav --1254654340-1547007759-1359158215=:20903--