Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DF06F18618 for ; Thu, 30 Apr 2015 21:21:41 +0000 (UTC) Received: (qmail 90729 invoked by uid 500); 30 Apr 2015 21:21:34 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 90661 invoked by uid 500); 30 Apr 2015 21:21:34 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 90650 invoked by uid 99); 30 Apr 2015 21:21:34 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Apr 2015 21:21:34 +0000 X-ASF-Spam-Status: No, hits=1.0 required=5.0 tests=SPF_SOFTFAIL X-Spam-Check-By: apache.org Received-SPF: softfail (athena.apache.org: transitioning domain of michael_segel@hotmail.com does not designate 54.164.171.186 as permitted sender) Received: from [54.164.171.186] (HELO mx1-us-east.apache.org) (54.164.171.186) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Apr 2015 21:21:28 +0000 Received: from BLU004-OMC2S35.hotmail.com (blu004-omc2s35.hotmail.com [65.55.111.110]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 341B447352 for ; Thu, 30 Apr 2015 21:21:07 +0000 (UTC) Received: from BLU436-SMTP239 ([65.55.111.73]) by BLU004-OMC2S35.hotmail.com over TLS secured channel with Microsoft SMTPSVC(7.5.7601.22751); Thu, 30 Apr 2015 14:21:01 -0700 X-TMN: [R+To/hPwMMpVlKpSE7ArDPu9Hd3pKI+y] X-Originating-Email: [michael_segel@hotmail.com] Message-ID: Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 (Mac OS X Mail 8.2 \(2098\)) Subject: Re: Hbase row ingestion .. From: Michael Segel In-Reply-To: <554289C7.6060103@upsight.com> Date: Thu, 30 Apr 2015 16:20:58 -0500 Content-Transfer-Encoding: quoted-printable References: <554289C7.6060103@upsight.com> To: user@hbase.apache.org X-Mailer: Apple Mail (2.2098) X-OriginalArrivalTime: 30 Apr 2015 21:20:59.0852 (UTC) FILETIME=[8DEC70C0:01D0838B] X-Virus-Checked: Checked by ClamAV on apache.org Exactly! So if you don=E2=80=99t need to know if your table is bucketed or not.=20= You just put() or get()/scan() like it any other table.=20 > On Apr 30, 2015, at 3:00 PM, Andrew Mains = wrote: >=20 > Thanks all again for the replies--this is a very interesting = discussion :). >=20 > @Michael HBASE-12853 is definitely an interesting proposition for our = (Upsight's) use case--we've done a moderate amount of work to make our = reads over the bucketed table efficient using hive. In particular, we = added support for predicate pushdown to multiple scans, which allows us = to read only a specific range within each bucket--see HIVE-7805. If I = understand correctly, with HBASE-12853 we could make that pushdown work = transparently--that is, the client code could just push down a single = scan, which would then be fanned out to each bucket. It would certainly = make our code somewhat cleaner (we currently create a scan with our = predicate for each bucket, and then push all of those to = MultiTableInputFormat). >=20 > Best, >=20 > Andrew >=20 >=20 > On 4/30/15 12:36 PM, Michael Segel wrote: >> The downside >> >>here is that you will lose your ability to perform range scans >=20 >=20 The opinions expressed here are mine, while they may reflect a cognitive = thought, that is purely accidental.=20 Use at your own risk.=20 Michael Segel michael_segel (AT) hotmail.com