Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8734619C1A for ; Sun, 17 Apr 2016 01:40:24 +0000 (UTC) Received: (qmail 38671 invoked by uid 500); 17 Apr 2016 01:40:22 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 38603 invoked by uid 500); 17 Apr 2016 01:40:22 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 38591 invoked by uid 99); 17 Apr 2016 01:40:22 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 17 Apr 2016 01:40:22 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 90534C0DD2 for ; Sun, 17 Apr 2016 01:40:21 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.879 X-Spam-Level: * X-Spam-Status: No, score=1.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id kMDRv7m604S5 for ; Sun, 17 Apr 2016 01:40:20 +0000 (UTC) Received: from mail-yw0-f169.google.com (mail-yw0-f169.google.com [209.85.161.169]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTPS id 55C435F472 for ; Sun, 17 Apr 2016 01:40:19 +0000 (UTC) Received: by mail-yw0-f169.google.com with SMTP id d68so171530400ywe.1 for ; Sat, 16 Apr 2016 18:40:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to; bh=kAmNA4wW3mekdZzytmKt4cu+vh2ko5Yv2Rz1vlwgu4w=; b=sDvSzxPdpXLUL7kqngpi3oan0RuKs5+NYrY5Rc08/arOVmKPlmeEjJKgD2HCkUg09c ePPavW7jc24dbwAX3B+OppFkAoVDdGbfyH9DepLT6T2VBr/uy9I3CITao20fW6mFFD0/ 3IHCQyZLOTabFcKO/IQJRX/HCq0lewOfSSLLirGGTh84vU0DSR2n2toHqPsAgA3rRIlF OywGL235qEwa49R8AOWXXvInBu+780ADIGO7BZM5f+SKdIcL3bqLNfTdUNfHByrdELtv BbW55LX7qwZllMjMjnA7n9rQa5fsWTkQk+eHIfy/2LpsM1argg3tRKmQ2U9863awploE jEbQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to; bh=kAmNA4wW3mekdZzytmKt4cu+vh2ko5Yv2Rz1vlwgu4w=; b=byZ3htMnAV/xWBzuCdUNPS4HvhE9sQCCSJfZ4EBJP8Mausm0V2YGWZrwXMim3LASYe EsD6cP9YEfAvTXjXnHYupPU6csK9RpbMqhgKlVHz+RZgSycMLXLSZA5zRJOH2WPGs0/C jv4GytH7IyUMiqnNmsQ9rJaSB0pXHyQStn+p/gyl2H5/sRu4ORNUCDRivwv6HRjTN6CU ue7TKQja7u78h6yY5h/hwVMovWxgoJCUa4XaVN0/gt0quQJ5y4H8klCoD9CA4tS0vfOI MRpJOSMrK4KTA3IjKZV1pKnGlLjPv2AADKbM4bRC2EA224tuOxgL19LkI10OZPhuvSw7 iV9g== X-Gm-Message-State: AOPr4FW0lIDgriLjM9qvMqyL+pDHrB2s8Y8V/kkh+kWjp0S5Ru9ejMYWj2BDUjaTP9M7wB/naCd+AGHaXkJAZw== MIME-Version: 1.0 X-Received: by 10.129.4.75 with SMTP id 72mr17460561ywe.184.1460857218298; Sat, 16 Apr 2016 18:40:18 -0700 (PDT) Received: by 10.37.101.195 with HTTP; Sat, 16 Apr 2016 18:40:18 -0700 (PDT) In-Reply-To: References: Date: Sat, 16 Apr 2016 18:40:18 -0700 Message-ID: Subject: Re: To Store Large Number of Video and Image files From: Ted Yu To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=001a113f5022907c890530a45377 --001a113f5022907c890530a45377 Content-Type: text/plain; charset=UTF-8 Have you taken a look at HBASE-11339 (HBase MOB) ? Note: this feature does not handle 10GB objects well. Consider store GB image on hdfs. Cheers On Sat, Apr 16, 2016 at 6:21 PM, Ascot Moss wrote: > Hi, > > I have a project that needs to store large number of image and video files, > the file size varies from 10MB to 10GB, the initial number of files will be > 0.1 billion and would grow over 1 billion, what will be the practical > recommendations to store and view these files? > > > > #1 One cluster, store the HDFS URL in HBase and store the actual file in > HDFS? (block_size as 128MB and replication factor as 3) > > > #2 One cluster, Store small files in HBase directly and use #1 for large > files? (block_size as 128MB and replication factor as 3) > > > #3 Multiple Hadoop/HBase clusters, each with different block_size settings? > > > e.g. cluster 1 (small): block_size as 128MB and replication factor as > 3, store all files in HBase if their file size is smaller 128MB > > cluster 2 (large): bigger block_size, say 4GB, replication > factor as 3, store the HDFS URL in HBase and store the actual file in HDFS > > > > #4 Use Hadoop Federation for large number of files? > > > About Fault Tolerance, need to consider four types of failures: driver, > host, rack, and datacenter failures. > > > Regards > --001a113f5022907c890530a45377--