Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EEE6710DD5 for ; Fri, 21 Mar 2014 06:20:45 +0000 (UTC) Received: (qmail 58679 invoked by uid 500); 21 Mar 2014 06:20:42 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 57914 invoked by uid 500); 21 Mar 2014 06:20:40 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 57903 invoked by uid 99); 21 Mar 2014 06:20:40 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 21 Mar 2014 06:20:40 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [98.139.212.154] (HELO nm3-vm0.bullet.mail.bf1.yahoo.com) (98.139.212.154) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 21 Mar 2014 06:20:33 +0000 Received: from [98.139.214.32] by nm3.bullet.mail.bf1.yahoo.com with NNFMP; 21 Mar 2014 06:20:10 -0000 Received: from [98.139.212.227] by tm15.bullet.mail.bf1.yahoo.com with NNFMP; 21 Mar 2014 06:20:10 -0000 Received: from [127.0.0.1] by omp1036.mail.bf1.yahoo.com with NNFMP; 21 Mar 2014 06:20:10 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 942858.68424.bm@omp1036.mail.bf1.yahoo.com Received: (qmail 74457 invoked by uid 60001); 21 Mar 2014 06:20:10 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1395382810; bh=9MR1Ys14SY/zUj1T9nAOcjgM+C5pQtnhcI12wcQvaaI=; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-RocketYMMF:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=y5VU2lUfzp+3wVjk7ei7D2OfB/QNPoHX7hUfB+BvID9KSDQpWNEp1g+UNuQlBJwHyB16t2KG+NUFmsQ6nr8YiLFKnRIaD34MJj3hUDTGxrMSH7kRXFkLAnY6q4VCRVFulo0JjUcKf6rKKim6S/wCgdB8CGS2IMOZl5fZguvhSuk= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-RocketYMMF:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=cnVNFM+v3a5NtyRWQnkxHObxQX1mECx7iRvpecbtLHX9GqB+nJJGA+gN6dXKqbofg0s5HX+c3r8wNoaqMUBMXJPTBosVOU98n902WS3kFK4VZkDysGUAbYhwmQe2sgZVkRN77ITjdDe1FOj5rGwpzm8M/aSAFgArC/G2m2Q2sCU=; X-YMail-OSG: DgkRNn8VM1lpGl12aST6IynprL9ANerYKDmMADlvRE.9BiP sqMHwBalJFdAQ3W0wgL7UX4S5LIiiDHlzcGSx4uRC2KBE8DCJiUvqRCqhWrx drC.HThk90HLGcNsiqSRya.C9wBRRyFxG.XeLxa0x5TV73OU4YUDfQ0XRw7j w1X2GSQ_1fVDxP08K.9ltzLvDTX2UaWFd88WiT4Nkya02hBKjap3VmQuq7NX XL2NJgVkNy4uxOF7zOYHg9Uj0lcj6UbxsJRNuYUXJAZ_VHavKvVw_YWLHqD5 HIBTcKeAId7xsur1PoB_yC8F0.uMqZG70_Rqj6Xz_nLW07Ob7IF_lkDwe.V8 pZ95p76U.7f33XRAWhLngbliYXUwoYYkix80DJvqKRBlDdI3PzhcmHjAMZMk DaN.i23t.im744IEXJfH2dM0q0Y0aUlBw0bLLF9BcppXxBhzjF5dNuRYAKg0 Ey1Se_KjrRXxJ7MkLMokUVdXa0uS0h7ixLymIFBbUXFThvyNzqFyxNDdOIde TyV7AUBubnOyc9WF4BsUk549zAr4rbqDUwfoSdhnOlzQCAyaO1DDjfNio33. ehyKOpu5_zj4nfbSPSJ02FLRl5.lXpxvrNER_93S0oD3b0MPDwoBgwMNGKCM QWoMstmlGOMrJX.Fv49IDetNxrx1dP8uBGJWrBuB8rrpx72RbajfkjyOCIPF GNtfq_eQeBacpEUOHSnl1.DFf_jfIDocz Received: from [204.14.239.161] by web140602.mail.bf1.yahoo.com via HTTP; Thu, 20 Mar 2014 23:20:10 PDT X-Rocket-MIMEInfo: 002.001,RGVwZW5kcy4gSWYgYSBtYWNoaW5lIGZhaWxzIGhhcmQgYW5kIEhERlMgMi54IGlzIHNldHVwIGNvcnJlY3RseSBJJ2QgZXhwZWN0IHRoaXMgdG8gYmUgdGhlIFpLIHRpbWVvdXQgKDE4MHMgYnkgZGVmYXVsdCBpbiAwLjk0LCBidXQgY2FuIGJlIGxvd2VyZWQpICsgYSBmZXcgbWludXRlcyB0byByZWppZ2dlciB0aGluZ3MuCgotLSBMYXJzCgoKCl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fCiBGcm9tOiBEZW1haSBOaSA8bmlkbWdnQGdtYWlsLmNvbT4KVG86ICJ1c2VyQGhiYXNlLmFwYWNoZS5vcmcBMAEBAQE- X-RocketYMMF: lhofhansl X-Mailer: YahooMailWebService/0.8.181.645 References: <1395208267.61135.YahooMailNeo@web140602.mail.bf1.yahoo.com> Message-ID: <1395382810.64534.YahooMailNeo@web140602.mail.bf1.yahoo.com> Date: Thu, 20 Mar 2014 23:20:10 -0700 (PDT) From: lars hofhansl Reply-To: lars hofhansl Subject: Re: How much time HBase will take if one Region Server down. To: Demai Ni , "user@hbase.apache.org" In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="-118416272-766108868-1395382810=:64534" X-Virus-Checked: Checked by ClamAV on apache.org ---118416272-766108868-1395382810=:64534 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Depends. If a machine fails hard and HDFS 2.x is setup correctly I'd expect= this to be the ZK timeout (180s by default in 0.94, but can be lowered) + = a few minutes to rejigger things.=0A=0A-- Lars=0A=0A=0A=0A_________________= _______________=0A From: Demai Ni =0ATo: "user@hbase.apac= he.org" ; lars hofhansl =0ASent: = Wednesday, March 19, 2014 9:16 AM=0ASubject: Re: How much time HBase will t= ake if one Region Server down.=0A =0A=0A=0ALars,=0A=0AI didn't get the 15~2= 0 min from the article. I played a bit with my clusters on 94 early last ye= ar, and got a ball-park number. Well, my cluster was using all the default = hbase configuration at that time, so it definitely wasn't well tuned.=A0 So= for HBase 94, what's a reasonable MTTR # from your experience? Thanks=0A= =0ADemai=0A=0A=0A=0A=0AOn Tue, Mar 18, 2014 at 10:51 PM, lars hofhansl wrote:=0A=0AMany of the improvement were made in HDFS, so be= nefit 0.94 equally.=0A>Not sure where you read 15-20 minutes for 0.94 in th= at article.=0A>=0A>-- Lars=0A>=0A>=0A>=0A>________________________________= =0A>=A0From: Demai Ni =0A>To: "user@hbase.apache.org" =0A>Sent: Tuesday, March 18, 2014 10:59 AM=0A>Subject: = Re: How much time HBase will take if one Region Server down.=0A>=0A>=0A>=0A= >a good article to start with:=0A>http://hortonworks.com/blog/introduction-= to-hbase-mean-time-to-recover-mttr/=0A>=0A>I would say 15~20 min for 94 or = earlier; it can be reduced to a couple min=0A>with HBase 96+ and Hadoop .= =0A>=0A>Demai=0A>=0A>=0A>=0A>On Tue, Mar 18, 2014 at 10:55 AM, Upendra Yada= v wrote:=0A>=0A>> How much time HBase will take if o= ne Region Server down and try to read=0A>> Region that belong to this down = Region Server.=0A>>=0A>> Means Is there service down for those all region t= hat belongs to dead RS?=0A>>=0A>> How much time HMaster will take to assign= those region to live RS? What is=0A>> the dependencies that may increase r= egion assignment in such situation.=0A>>=0A>> Thanks....=0A>> ---118416272-766108868-1395382810=:64534--