Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DB968112DB for ; Mon, 22 Sep 2014 16:31:37 +0000 (UTC) Received: (qmail 61401 invoked by uid 500); 22 Sep 2014 16:31:36 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 61332 invoked by uid 500); 22 Sep 2014 16:31:36 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 61318 invoked by uid 99); 22 Sep 2014 16:31:35 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Sep 2014 16:31:35 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.220.172] (HELO mail-vc0-f172.google.com) (209.85.220.172) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Sep 2014 16:31:08 +0000 Received: by mail-vc0-f172.google.com with SMTP id hy10so4368650vcb.31 for ; Mon, 22 Sep 2014 09:31:07 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; bh=EjdE0D8QMyNQhZqV3R2FcyzofWn11ew/46DFgpfiz70=; b=Vp5TuBRXGRaTL3noTod9FVUCRmEMD62gz/LulLXhsLhJyVjd2Bfn/xNR0Yuc1tPtAQ y1No9utfGeveXQYddH9LG8ThqHGYxUcC3sasARgEgzIzAum8fYgW1oMZ3y0V3zfHPm6O Uyxc0ywquPD7ZkEsgK6BOQbtLm8e6ijskcxzv1+F5ZsVrWS/Be6FunY3J/4POEjlbDqn eehfdFqctKlw+k3C8+JalJIKyMT7GQIGK8mpfeYjJVUZYnrbb/EfQVGjzgQXdjNcFMDB teBYkVJNPLX76O8iT6Fvb3kdHNCzjwJFiip5MTKUbZGD+kdSjWFiOe45ejVd2fEOhBdj AHwQ== X-Gm-Message-State: ALoCoQkGSln7/Jy1kFNXOQeWbL9skT2yW0xK3jHCCJ1tEO08LCpQtkoS64QW6hzx9BA0N0J9e9JS X-Received: by 10.52.73.170 with SMTP id m10mr17160892vdv.20.1411403467674; Mon, 22 Sep 2014 09:31:07 -0700 (PDT) MIME-Version: 1.0 Received: by 10.52.100.227 with HTTP; Mon, 22 Sep 2014 09:30:47 -0700 (PDT) In-Reply-To: References: From: Jean-Marc Spaggiari Date: Mon, 22 Sep 2014 12:30:47 -0400 Message-ID: Subject: Re: HBase Bulk Lookup To: user Content-Type: multipart/alternative; boundary=bcaec5016643536d5e0503a9faf3 X-Virus-Checked: Checked by ClamAV on apache.org --bcaec5016643536d5e0503a9faf3 Content-Type: text/plain; charset=UTF-8 Hi Bin, 100M rows 1 byte is about 100MB. 100M rows 1KB is about 100GB. What is your record size and what is your SLA? Do you expected 100GB to be transfered in few seconds? How to you query your data? A single get? All of it? You might want to give way more details of your usecases if you want more accurate advices. HBase is VERY good for random writes and random reads. It call also scale (almost) to the infinite. 100MB for HBase is pretty small. So far from what I can understand on your usecase, HBAse bulk load + HBase get/multi get is what you need, but as I said above, need more details. JM 2014-09-22 12:24 GMT-04:00 Bin Wang : > Hi Ted, > > I have not dive into the programming part yet... I am still at the POC and > pick up the right tool stage. Based on your experience, do you think the > get(List gets) will return the result from 100M level in an > interactive time? say a few seconds? > > If that is the case, I will start working on a prototype. > > Bin > > On Mon, Sep 22, 2014 at 10:00 AM, Ted Yu wrote: > > > bq. upload a list of product ids > > > > Have you looked at the following API in HTable ? > > > > public Result[] get(List gets) throws IOException { > > > > Cheers > > > > On Mon, Sep 22, 2014 at 8:14 AM, Bin Wang wrote: > > > > > Hi there, > > > > > > I have a use case that I need to do bulk look up in a table of size 100 > > > million key value pairs. Where key is the unique ID (product id), and > the > > > value is inventory history (time series) for that particular part. > > > > > > I want user upload a list of product ids, and I am wondering if HBase > is > > > the right tool to return the corresponding value in an interactive > speed? > > > > > > If not, I heard of Solr/ElasticSearch, mongo, redis, Cassandra also, > and > > I > > > am wondering which tool is the best fit in my use case. > > > > > > Thanks for any suggestion. > > > > > > Bin > > > > > > --bcaec5016643536d5e0503a9faf3--