Return-Path: Delivered-To: apmail-hbase-user-archive@www.apache.org Received: (qmail 20789 invoked from network); 25 Aug 2010 10:54:27 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 25 Aug 2010 10:54:27 -0000 Received: (qmail 9769 invoked by uid 500); 25 Aug 2010 10:54:26 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 9293 invoked by uid 500); 25 Aug 2010 10:54:23 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 9284 invoked by uid 99); 25 Aug 2010 10:54:21 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Aug 2010 10:54:21 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of michael_segel@hotmail.com designates 65.55.34.88 as permitted sender) Received: from [65.55.34.88] (HELO col0-omc2-s14.col0.hotmail.com) (65.55.34.88) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Aug 2010 10:54:15 +0000 Received: from COL117-W12 ([65.55.34.71]) by col0-omc2-s14.col0.hotmail.com with Microsoft SMTPSVC(6.0.3790.4675); Wed, 25 Aug 2010 03:53:54 -0700 Message-ID: Content-Type: multipart/alternative; boundary="_5f7a1d9f-fdd2-45d0-a062-b0769b1b0999_" X-Originating-IP: [173.15.87.33] From: Michael Segel To: Subject: RE: Best way to get multiple non-sequential rows Date: Wed, 25 Aug 2010 05:53:54 -0500 Importance: Normal In-Reply-To: <5A76F6CE309AD049AAF9A039A392428207329F8B@sc-mbx03.TheFacebook.com> References: ,<4C73F4F0.1050208@gmail.com>,,,<5A76F6CE309AD049AAF9A039A392428207329F8B@sc-mbx03.TheFacebook.com> MIME-Version: 1.0 X-OriginalArrivalTime: 25 Aug 2010 10:53:54.0890 (UTC) FILETIME=[CFBC1AA0:01CB4443] --_5f7a1d9f-fdd2-45d0-a062-b0769b1b0999_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Jonathan=2C Ok=2C that makes some sense... So you would have some method mget(fetchKeyList=2Cnumthreads) returning res= ultList[]. So what's thread safe these days?=20 -Mike > From: jgray@facebook.com > To: user@hbase.apache.org > Subject: RE: Best way to get multiple non-sequential rows > Date: Wed=2C 25 Aug 2010 03:52:38 +0000 >=20 > Michael=2C >=20 > MultiGet is about performing a set of Get operations in parallel from the= client. So it buys you potential performance benefits from the concurrenc= y/distribution of your operations. >=20 > Roughly=2C you would bucket the gets according to their region and region= server. Then spawn a thread for each RS and fire off the Gets concurrently= . >=20 > If I have 100 Gets to perform on a random set of keys=2C assuming each ge= t takes 10ms=2C doing them sequentially will take 1 second. Other factors = and RS concurrency aside=2C with MultiGet on a 10 node cluster=2C the total= time would be reduced to 100ms. With 50 nodes=2C 20ms. >=20 > JG >=20 >=20 > > -----Original Message----- > > From: Michael Segel [mailto:michael_segel@hotmail.com] > > Sent: Tuesday=2C August 24=2C 2010 7:53 PM > > To: user@hbase.apache.org > > Subject: RE: Best way to get multiple non-sequential rows > >=20 > >=20 > > Igor=2C > >=20 > > What does this really buy you? > >=20 > > I'm trying to figure out a use case that would show a benefit from just > > fetching the rows individually. Since the rows are not contiguous=2C th= e > > odds of the next row you want being in cache are going to slight to > > most likely not. =3B-) > >=20 > > Can you give a use case where having a 'multi-get' will make life > > easier? > >=20 > > Thx > >=20 > > -Mike > >=20 > >=20 > > > Date: Wed=2C 25 Aug 2010 07:17:13 +0600 > > > Subject: Re: Best way to get multiple non-sequential rows > > > From: imyousuf@gmail.com > > > To: user@hbase.apache.org > > > > > > Thanks Igor=2C I will have a look at it. > > > > > > /Imran > > > > > > On Tue=2C Aug 24=2C 2010 at 10:36 PM=2C Igor Ranitovic > > wrote: > > > > Take a look at > > > > https://issues.apache.org/jira/browse/HBASE-1845 > > > > > > > > As an HBase user=2C multi gets is something that I have been lookin= g > > forward > > > > to for some time now. If there is enough interest it would be great > > if this > > > > becomes part of 0.90. > > > > > > > > Take care=2C > > > > i. > > > > > > > > Imran M Yousuf wrote: > > > >> > > > >> Hi=2C > > > >> > > > >> I am using the HBase client API to interact with HBase. I have > > noticed > > > >> that HTableInterface has operations such as put(List)=2C > > > >> delete(List)=2C but there is no similar method for Get. > > Using > > > >> scan it is possible to load a range of rows=2C i.e. sequential row= s. > > My > > > >> question is - > > > >> how would it be most efficient to load N non-sequential rows? > > > >> > > > >> Currently I am using get(Get) method N times. > > > >> > > > > > > > > > > > > > > > > > > > > -- > > > Imran M Yousuf > > > Blog: http://imyousuf-tech.blogs.smartitengineering.com/ > > > Mobile: +880-1711402557 > >=20 = --_5f7a1d9f-fdd2-45d0-a062-b0769b1b0999_--