Return-Path: Delivered-To: apmail-hbase-user-archive@www.apache.org Received: (qmail 74622 invoked from network); 11 Mar 2011 19:14:18 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 11 Mar 2011 19:14:18 -0000 Received: (qmail 61959 invoked by uid 500); 11 Mar 2011 19:14:17 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 61921 invoked by uid 500); 11 Mar 2011 19:14:17 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 61913 invoked by uid 99); 11 Mar 2011 19:14:17 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Mar 2011 19:14:17 +0000 X-ASF-Spam-Status: No, hits=3.6 required=5.0 tests=FREEMAIL_FROM,FS_REPLICA,RCVD_IN_DNSWL_NONE,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [66.94.238.131] (HELO web130104.mail.mud.yahoo.com) (66.94.238.131) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 11 Mar 2011 19:14:10 +0000 Received: (qmail 64999 invoked by uid 60001); 11 Mar 2011 19:13:48 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1299870828; bh=I1+0YkQRfa5C82WlA4UqpPH4K3ccTzNzYUhYwOIzhi0=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:References:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=zmtd7UfBdjspvQ0ecP7+t7zptJiABtdQhaKPcCzetfG7YaK62mevoR17WA5893hdiOFKQw2ngbuBKFLBLQzQGsiqD/ohizkdU6IPFP+51W3MGAKboAzWmeJDbTOgf1XNEHPBOegrEHJq2uJF2LuPI1UOwRSFBZNeVBFEojwZlyQ= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:References:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=J2FkGG7tBbfy41bXdcUWsByaRwu3fQrDkQnt/GqcKBV3aeWHs0RsvTaqj4fUMowjJafW31UtOaX5lGlI0wXURqhtLzJZ5myM13B3uZAaVkdbDvn3dBFgWezvNwVBNQ+OIrkWs10OjkD8/qPbXSQbG98bKo6ca5pO2rsPGpEzFX4=; Message-ID: <680621.26646.qm@web130104.mail.mud.yahoo.com> X-YMail-OSG: f4p7gF4VM1kyiMyC96419YF5MGKvTlYaH21YoImIZ7CbgRJ .YLCWLXEXSwghvDqVNpRv8UGYrGfuPRgEjz7FvRqxyoLtmqvzyJ4uSBhvcom Red2aMhEBLwaH7FR.WNPWGo.gBwvrlIgbQ3fAH6keR9vPrNZaQiet5H4Thas OP4O9nPfWNjd7IGpSKQcQcITGT.qJrA29mx4xd2kG5.u9Rfl0WTHCA822wXJ 10qAEOiYlqf76ImXGQTvgZFZIPS99o98JDK_Vh1c_aQVp9dTju4al3_HCHsi UDIshuiwC0Qe_yLruRNzR3lu0q4Nnt.30c8C8jOCsr4zA71yl.LkOayGVD2i 8d9WoDZ1np_d9bl6223nX4TSxHvsFzTJAJnjpoKj6Q80unqv1eSrQKHiJQoF yAIzD.8tTGrQ- Received: from [184.75.0.186] by web130104.mail.mud.yahoo.com via HTTP; Fri, 11 Mar 2011 11:13:48 PST X-Mailer: YahooMailRC/559 YahooMailWebService/0.8.109.295617 References: <945626.31022.qm@web65511.mail.ac4.yahoo.com> Date: Fri, 11 Mar 2011 11:13:48 -0800 (PST) From: Otis Gospodnetic Subject: Re: HBase => replication => Hive To: user@hbase.apache.org In-Reply-To: <945626.31022.qm@web65511.mail.ac4.yahoo.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Checked: Checked by ClamAV on apache.org Hi, ----- Original Message ---- > From: Andrew Purtell > > Pardon, I'm not as familiar with this area as I should, but > > > apparently Hive queries run about x5 > > slower than queries that go against normal Hive tables. > > Is this not a reasonable place to start? Why is this? Reasonable? I don't know. :) That's really the first thing I was hoping to find out. J-Ds reaction makes it sound like this is not unreasonable. > > I was wondering if people think it would be possible to > > implement HBase=>Hive replication? > > This strikes me as non trivial. If doing this level of effort, why not look >into the Hive/HBase integration? Maybe there is something HBase can do to make >it faster? At this point I don't know how trivial or non-trivial it is yet. But I thought that if John Sichi, who strikes me as a pretty smart fellow, says he's seeing x5 performance loss and he's the one who worked on the integration, getting from 5 to 4 or lower may be non-trivial. HBase => Hive is terra incognita so, who knows, maybe it's easy to do. :) Otis ---- Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ > Best regards, > > - Andy > > Problems worthy of attack prove their worth by hitting back. > - Piet Hein (via Tom White) > > > --- On Thu, 3/10/11, Otis Gospodnetic wrote: > > > From: Otis Gospodnetic > > Subject: HBase => replication => Hive > > To: user@hbase.apache.org > > Date: Thursday, March 10, 2011, 10:43 PM > > Hi, > > > > Since HBase has a mechanism to replicate edit logs to > > another HBase cluster, I was wondering if people think it > > would be possible to implement HBase=>Hive > > replication? (and really make the destination pluggable > > later on) > > > > I'm asking because while one can integrate Hive and HBase > > by creating external tables in Hive that actually point to > > tables in HBase, apparently Hive queries run about x5 > > slower than queries that go against normal Hive tables. > > > > And because all HBase export options are for 1 table at a > > time and not point in time snapshots of the whole table, > > exporting data from HBase and importing into Hive doesn't > > sound like a viable option. > > > > Thanks, > > Otis > > ---- > > Sematext :: http://sematext.com/ :: Solr - Lucene - Hadoop > > Hadoop ecosystem search :: http://search-hadoop.com/ > > > > > > > >