Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 50977200B3E for ; Wed, 7 Sep 2016 16:19:30 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 4F32F160AC1; Wed, 7 Sep 2016 14:19:30 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 945B4160AA3 for ; Wed, 7 Sep 2016 16:19:29 +0200 (CEST) Received: (qmail 58287 invoked by uid 500); 7 Sep 2016 14:19:28 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 58267 invoked by uid 99); 7 Sep 2016 14:19:27 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Sep 2016 14:19:27 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 83E221A023D for ; Wed, 7 Sep 2016 14:19:27 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.179 X-Spam-Level: * X-Spam-Status: No, score=1.179 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id gvs94LgNtRFI for ; Wed, 7 Sep 2016 14:19:26 +0000 (UTC) Received: from mail-ua0-f177.google.com (mail-ua0-f177.google.com [209.85.217.177]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id B872560D24 for ; Wed, 7 Sep 2016 14:19:25 +0000 (UTC) Received: by mail-ua0-f177.google.com with SMTP id 31so14123298uao.0 for ; Wed, 07 Sep 2016 07:19:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=ThzL9GKbSg5/FYc3G3RQhJb0AnScKaiQ6qaWh11harQ=; b=iim3NBf/9kDG0l082D2Ex3UvsTHTuq1QC6HS9GLVxJn3TvN/2rucXSawHho6pxwp4q 9aFArh5GMjZhtCKQ73X91u5n/Q4LhXyl49hsbM2cLK+hSEVI0G+HmhiwriI2Kgj/ufW7 GxkH+9CWLUlKbRjYmuSzYTK6dAS/knyT3Uzspj/Hqq+EvoFozPNybHhzsMyhzSTChzLm zqmSfvHFRhOL0uSE4VQkwqqbYSbJTgEQWP+NIb2tpFEpQtwF8ztjNs8rphcOStVr8zHq eJy+VkozTDJ1wRzCjsIDEQfVxUvE1ajnWCiQ6BIr0CydSVkESMqNkN40MgBqQTtpC9+d 6yTA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=ThzL9GKbSg5/FYc3G3RQhJb0AnScKaiQ6qaWh11harQ=; b=HFpKRKWx4/uHqPTH6fOsK//uds+rEV1Nbu4g4svqPSpaFkycsgmXyZL+MdPJ57GKfm aI6k8WXv93OJOo7pjMCphleQy+zL7g6AhQ4lqE0PbV4irD/G84rKdsiaJckdqpksA3P0 D8tGkTOcLSbQwnuENUIQHsErn0QKStojvSxks9b4yb7QZQzKynWLo+qz+zKcI0DzzLHH is/JnJTwhOQFlU7a5fjMHbDBaMEkX4Ffy5JS2mCv88y7GT0Vcn6X/X+jgaVcUXL+fhxz b/EoKmBhr7mugTG74W/78i2p2dUA+wQ5WRIiOrPP5MO5LOTbHijLepPpz7kj87WA0V+4 +WuQ== X-Gm-Message-State: AE9vXwPqY7qFmugZWRcnQz6rTRAN/wi45q7PV9j/eX5Mp2RmyZNoDq9r7G4tRDPWpoCojz68/1VOC4h/sdJNBg== X-Received: by 10.176.1.97 with SMTP id 88mr29453029uak.147.1473257958731; Wed, 07 Sep 2016 07:19:18 -0700 (PDT) MIME-Version: 1.0 Received: by 10.103.72.135 with HTTP; Wed, 7 Sep 2016 07:19:18 -0700 (PDT) In-Reply-To: References: From: Sreeram Date: Wed, 7 Sep 2016 19:49:18 +0530 Message-ID: Subject: Re: Maximum limit on HBase cluster size To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=001a113e071c4b0677053beb992f archived-at: Wed, 07 Sep 2016 14:19:30 -0000 --001a113e071c4b0677053beb992f Content-Type: text/plain; charset=UTF-8 Hi Ted, From the link "Around 50-100 regions is a good number for a table with 1 or 2 column families. Remember that a region is a contiguous segment of a column family.". This number 50-100 regions per table at the level of individual region server or for the entire cluster ? Thanks, Sreeram On Wed, Sep 7, 2016 at 4:18 PM, Ted Yu wrote: > With properly designed schema, you don't need to split the cluster. > > Please see: > http://hbase.apache.org/book.html#schema > > > On Sep 7, 2016, at 1:59 AM, Sreeram wrote: > > > > Dear All, > > > > > > > > Looking forward to your views on the maximum limit of HBase cluster size. > > > > > > > > We are currently designing a HBase cluster and one of the tables > (designed > > in wide format) is expected to have roughly 6 billion rows in production > by > > 3 years (with an additional 200 million rows getting added each month). > In > > addition, we are expecting roughly 250 columns per row. Expected table > > data volume is around 250 TB (at end of 3 years, without considering HDFS > > replication) and growing by 7 TB per month. > > > > > > > > While we are provisioning the number of nodes based on expected data > > volume, wanted to check if there are any limits on the number of rows per > > cluster. > > > > > > > > Will it be advisable to split the cluster in such situation into two or > > more independent clusters? Will there be any impact to the read/write > > throughput/latency as the table grows over time? > > > > > > > > Please advise. > > > > > > > > Regards, > > > > Sreeram > --001a113e071c4b0677053beb992f--