Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 01EAFE2EA for ; Fri, 11 Jan 2013 15:28:56 +0000 (UTC) Received: (qmail 89705 invoked by uid 500); 11 Jan 2013 15:28:53 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 89649 invoked by uid 500); 11 Jan 2013 15:28:53 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 89641 invoked by uid 99); 11 Jan 2013 15:28:53 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Jan 2013 15:28:53 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of dontariq@gmail.com designates 209.85.220.174 as permitted sender) Received: from [209.85.220.174] (HELO mail-vc0-f174.google.com) (209.85.220.174) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Jan 2013 15:28:48 +0000 Received: by mail-vc0-f174.google.com with SMTP id d16so1516459vcd.19 for ; Fri, 11 Jan 2013 07:28:27 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=lbV7IGJcxUSxT+dDIOKqMpj6Kh/dZjHjbag8fH9yh2w=; b=PEspRWhR7lw0SRHRFl1j9VfGxHuzSbcA3D2VNvaBkjgUYgl21OAgD1bnX22QphhLdB FjjOJ7i5JrZMMFRYGqe37B1e4ygl4ipwr3JyikA3sWZfhlRIr0n0vWtkTb08/fDyF0b/ XmSW3NA7hhmS4De0gwNSzn0lPpKLjDNpNIdw6gv4a5zkMwWTEeHuQAqQm20jxpDNV6ek sNP3i+j4rbOxKzaefezfFDAW1kNBPQTP/ycTGncm5SB7mqUtXo++KMTY/7ht57bwvUw3 1WGKta1lnn/QqcuEhHzgi1ZqaWTQOz5usaSAKXbhdt9x5lz0PsCHR53tA5GFpthYuu+Q JMvA== Received: by 10.220.228.138 with SMTP id je10mr52414305vcb.67.1357918107836; Fri, 11 Jan 2013 07:28:27 -0800 (PST) MIME-Version: 1.0 Received: by 10.58.34.16 with HTTP; Fri, 11 Jan 2013 07:27:47 -0800 (PST) In-Reply-To: References: <1357380684848-4036184.post@n3.nabble.com> <3702994083872207037@unknownmsgid> <1357876263.21531.YahooMailNeo@web140603.mail.bf1.yahoo.com> From: Mohammad Tariq Date: Fri, 11 Jan 2013 20:57:47 +0530 Message-ID: Subject: Re: Storing images in Hbase To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=14dae9cdc0f373a20104d304f222 X-Virus-Checked: Checked by ClamAV on apache.org --14dae9cdc0f373a20104d304f222 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable IMHO, if the image files are not too huge, Hbase can efficiently serve the purpose. You can store some additional info along with the file depending upon your search criteria to make the search faster. Say if you want to fetch images by the type, you can store images in one column and its extension in another column(jpg, tiff etc). BTW, what exactly is the problem which you are facing. You have written "But I still cant do it"? Warm Regards, Tariq https://mtariq.jux.com/ On Fri, Jan 11, 2013 at 8:30 PM, Michael Segel w= rote: > That's a viable option. > HDFS reads are faster than HBase, but it would require first hitting the > index in HBase which points to the file and then fetching the file. > It could be faster... we found storing binary data in a sequence file and > indexed on HBase to be faster than HBase, however, YMMV and HBase has bee= n > improved since we did that project.... > > > On Jan 10, 2013, at 10:56 PM, shashwat shriparv > wrote: > > > Hi Kavish, > > > > i have a better idea for you copy your image files to a single file on > > hdfs, and if new image comes append it to the existing image, and keep > and > > update the metadata and the offset to the HBase. Because if you put > bigger > > image in hbase it wil lead to some issue. > > > > > > > > =E2=88=9E > > Shashwat Shriparv > > > > > > > > On Fri, Jan 11, 2013 at 9:21 AM, lars hofhansl wrote= : > > > >> Interesting. That's close to a PB if my math is correct. > >> Is there a write up about this somewhere? Something that we could link > >> from the HBase homepage? > >> > >> -- Lars > >> > >> > >> ----- Original Message ----- > >> From: Jack Levin > >> To: user@hbase.apache.org > >> Cc: Andrew Purtell > >> Sent: Thursday, January 10, 2013 9:24 AM > >> Subject: Re: Storing images in Hbase > >> > >> We stored about 1 billion images into hbase with file size up to 10MB. > >> Its been running for close to 2 years without issues and serves > >> delivery of images for Yfrog and ImageShack. If you have any > >> questions about the setup, I would be glad to answer them. > >> > >> -Jack > >> > >> On Sun, Jan 6, 2013 at 1:09 PM, Mohit Anchlia > >> wrote: > >>> I have done extensive testing and have found that blobs don't belong = in > >> the > >>> databases but are rather best left out on the file system. Andrew > >> outlined > >>> issues that you'll face and not to mention IO issues when compaction > >> occurs > >>> over large files. > >>> > >>> On Sun, Jan 6, 2013 at 12:52 PM, Andrew Purtell > >> wrote: > >>> > >>>> I meant this to say "a few really large values" > >>>> > >>>> On Sun, Jan 6, 2013 at 12:49 PM, Andrew Purtell > >>>> wrote: > >>>> > >>>>> Consider if the split threshold is 2 GB but your one row contains 1= 0 > >> GB > >>>> as > >>>>> really large value. > >>>> > >>>> > >>>> > >>>> > >>>> -- > >>>> Best regards, > >>>> > >>>> - Andy > >>>> > >>>> Problems worthy of attack prove their worth by hitting back. - Piet > Hein > >>>> (via Tom White) > >>>> > >> > >> > > --14dae9cdc0f373a20104d304f222--