Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C82C2DD18 for ; Tue, 30 Oct 2012 19:15:59 +0000 (UTC) Received: (qmail 40970 invoked by uid 500); 30 Oct 2012 19:15:57 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 40917 invoked by uid 500); 30 Oct 2012 19:15:57 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 40909 invoked by uid 99); 30 Oct 2012 19:15:57 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Oct 2012 19:15:57 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of kevin.odell@cloudera.com designates 209.85.220.169 as permitted sender) Received: from [209.85.220.169] (HELO mail-vc0-f169.google.com) (209.85.220.169) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Oct 2012 19:15:53 +0000 Received: by mail-vc0-f169.google.com with SMTP id fl17so866430vcb.14 for ; Tue, 30 Oct 2012 12:15:32 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:x-gm-message-state; bh=h4rDCpp9A+r3QzNUxYhNlxre75h7hieWFHEFl5wSRj8=; b=OPkql8BG2PFSOV69f0bSZ5vAun0leHaotO/ou0muSjqR2KYxbRe9/VptaMImMMT5xl evYEY+WAEscARxcp8sc68XbSFQrvZus8tcSfaq6ICz8PhNJv7SyUPS6L+kvOoPMGfPJ1 6kujpanpoSfB7+oCMgKF5AjOiw18xOJ+xRYuddQISu/ixmZ95OiDKNNw7B7LRSAy+Dak KzJOvmJpcQglC8N8e8ETMUxlnihi89W3KgxsCzxEFsAcVz/Svwhgf1h2HSvK87mGd/VE yh8X9qSCVQFrhdoe8lPkedRjLqx0oGL7lY23HgaIE36gQNu54dPGIfY7d3WdYmShyhyJ NO+Q== MIME-Version: 1.0 Received: by 10.52.95.34 with SMTP id dh2mr44215425vdb.69.1351624532021; Tue, 30 Oct 2012 12:15:32 -0700 (PDT) Received: by 10.58.243.71 with HTTP; Tue, 30 Oct 2012 12:15:32 -0700 (PDT) In-Reply-To: References: Date: Tue, 30 Oct 2012 14:15:32 -0500 Message-ID: Subject: Re: Hbase cluster for serving real time site traffic From: "Kevin O'dell" To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=20cf307cfef419d1c504cd4b9c8a X-Gm-Message-State: ALoCoQkdzOlgK6Dhd+NxtNVF/kUPgBEY/+2ZWmg2sTsoQa6SZ3iXsG4gXEZFod5EZRoyElFroBIr X-Virus-Checked: Checked by ClamAV on apache.org --20cf307cfef419d1c504cd4b9c8a Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Varun, I will take a shot at answering this: 1) It seems hbase starts only one zookeeper on the master node - which is critical for operation - how many zookeepers should I use and can I run those on the region servers ? <-- 3 and they should be on dedicated servers for a real production environment. 2) How many masters to use - does hbase support multiple masters (primary and secondary) within the same cluster ? From my understanding, master availability is not critical for operation. <--2 if you lose the master you lose HBase. The Master is VERY critical. 3) NameNode - We are running hadoop 0.8 - I have read that NameNode is a single point of failure and we should really be running two name node(s) so we can failover. Is it fine to run these on the region servers ? 2, you will want to use HA for a real production workload. The SNN(Secondary Name Node) is a very misleading name. So, yes, secondary NameNode is probably more critical than the secondary master - since the master is only responsible for metadata changes/region splits/table creation etc and not for writes/reads. <--- This is not correct. The Secondary Name Node is not a failover node. You will want to use a release that has HA to guarantee availability at the NN level. The master is in charge of META data operations, but also with out the Master the RS will not continue to just work. It is very important to have two masters. I will defer Jean-Marc on the Schema designs. On Tue, Oct 30, 2012 at 1:03 PM, Varun Sharma wrote: > Thanks for the tips. > > So, yes, secondary NameNode is probably more critical than the secondary > master - since the master is only responsible for metadata changes/region > splits/table creation etc and not for writes/reads. > > Regarding the keys question - i meant that the (row + column) length is > 24-32 bytes and the value length is 0-1 bytes. Currently, we have a clust= er > running with all the data loaded into hbase but it all runs with default > settings. > > Thanks > Varun > > On Tue, Oct 30, 2012 at 10:53 AM, Jean-Marc Spaggiari < > jean-marc@spaggiari.org> wrote: > > > My 2=A2. > > > > 1) You need an odd number of ZooKeeper nodes. So 3 is the minimum > > recommanded for production. > > 2) Yes, you have Master and SecondaryMaster. And it's also recommanded > > to have one of each. And the master is critical. If you are loosing > > it, you are loosing your cluster. > > 3) NameNode is hadoop, not hbase. You should follow hadoop > > recommandations. Like you have secondarymaster, you have > > secondarynamenode. So I think you should have as many > > secondarynamenode as you have secondarymaster (on the same machine?). > > 4) I'm not sure to understanding this question. Key are binary. Array > > of bytes. So 32 0-1 bytes is a 3 bytes long array. It's not a lot. > > This will only give you 2^32 different rows. You will have to > > pre-split them, or you will end with almost all of them on the same > > regionserver? > > > > JM > > > > 2012/10/30, Varun Sharma : > > > Hi, > > > > > > We are planning to experiment with a cluster for serving production > > traffic > > > using hbase for pinterest. We are starting off with a 10 region serve= r > + > > 1 > > > master cluster on Amazon EMR version 0.92. I had some very naive > > questions > > > (primarily around points of failure): > > > > > > 1) It seems hbase starts only one zookeeper on the master node - whic= h > is > > > critical for operation - how many zookeepers should I use and can I r= un > > > those on the region servers ? > > > 2) How many masters to use - does hbase support multiple masters > (primary > > > and secondary) within the same cluster ? From my understanding, maste= r > > > availability is not critical for operation. > > > 3) NameNode - We are running hadoop 0.8 - I have read that NameNode i= s > a > > > single point of failure and we should really be running two name > node(s) > > so > > > we can failover. Is it fine to run these on the region servers ? > > > 4) Our current application involves long row/column - 24-32 bytes wit= h > > 0-1 > > > bytes of values. Should we be using a different key encoding than the > > > default encoding ? What advantages could it buy us ? > > > > > > We are currently using amazon EMR for testing purposes which runs hba= se > > > 0.92. If it works well, we would like to configure our own cluster wi= th > > > probably the latest version of hbase which appears to be 0.94 at the > > > moment. > > > > > > Thanks > > > Varun > > > > > > --=20 Kevin O'Dell Customer Operations Engineer, Cloudera --20cf307cfef419d1c504cd4b9c8a--