Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E82FC1774B for ; Mon, 23 Feb 2015 20:35:14 +0000 (UTC) Received: (qmail 20111 invoked by uid 500); 23 Feb 2015 20:35:13 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 20035 invoked by uid 500); 23 Feb 2015 20:35:12 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 20023 invoked by uid 99); 23 Feb 2015 20:35:12 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 23 Feb 2015 20:35:12 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ndimiduk@gmail.com designates 74.125.82.43 as permitted sender) Received: from [74.125.82.43] (HELO mail-wg0-f43.google.com) (74.125.82.43) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 23 Feb 2015 20:35:07 +0000 Received: by wghl18 with SMTP id l18so1179222wgh.7 for ; Mon, 23 Feb 2015 12:34:47 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=dvNcevKHiQyhS80eO/D5ac8vTBx/1hrXet7riXRA+tI=; b=UTOhuBLZWZ70Sakj/hf8MGj4O5ZPMsrDyZDoOdzqS0EhU8qixHNg8V/f8qsDf72YUw 1G+gW7ai5d01vcNNyvHcbmaqjrYKaFzpl2J8zA0AUU2HWZ37O//L0W6Eqyl2JHHHWsLe OpKVkX3PP3AHmc4lgiQNRmp+5TUG8fQzFs7MNyz179T8/gws2oWIJTiC48C4vT58XZjv xlQEb0/+qbhwfUh31Ro5S1YU4S0vY6wLgQXpQXVgzA8P5DDcAWjOcIKvcTsrjn52955O JJ/omp0TNr3InS2luzsNV3/wg46XQWoDNt2k3kM51SySiOsMetAUIel/m37RCJyEzSQU zLBQ== X-Received: by 10.180.20.167 with SMTP id o7mr23585630wie.50.1424723686198; Mon, 23 Feb 2015 12:34:46 -0800 (PST) MIME-Version: 1.0 Received: by 10.28.186.135 with HTTP; Mon, 23 Feb 2015 12:34:25 -0800 (PST) In-Reply-To: <0D4EC5F0-CF9C-4A7D-9B4C-A3378BDE8548@segel.com> References: <0D4EC5F0-CF9C-4A7D-9B4C-A3378BDE8548@segel.com> From: Nick Dimiduk Date: Mon, 23 Feb 2015 12:34:25 -0800 Message-ID: Subject: Re: HBase Region always in transition + corrupt HDFS To: hbase-user Content-Type: multipart/alternative; boundary=bcaec53f3583381214050fc75577 X-Virus-Checked: Checked by ClamAV on apache.org --bcaec53f3583381214050fc75577 Content-Type: text/plain; charset=UTF-8 HBase/HDFS are maintaining block checksums, so presumably a corrupted block would fail checksum validation. Increasing the number of replicas increases the odds that you'll still have a valid block. I'm not an HDFS expert, but I would be very surprised if HDFS is validating a "questionable block" via byte-wise comparison over the network amongst the replica peers. On Mon, Feb 23, 2015 at 12:25 PM, Michael Segel wrote: > > On Feb 23, 2015, at 1:47 AM, Arinto Murdopo wrote: > > We're running HBase (0.94.15-cdh4.6.0) on top of HDFS (Hadoop > 2.0.0-cdh4.6.0). > For all of our tables, we set the replication factor to 1 (dfs.replication > = 1 in hbase-site.xml). We set to 1 because we want to minimize the HDFS > usage (now we realize we should set this value to at least 2, because > "failure is a norm" in distributed systems). > > > > Sorry, but you really want this to be a replication value of at least 3 > and not 2. > > Suppose you have corruption but not a lost block. Which copy of the two is > right? > With 3, you can compare the three and hopefully 2 of the 3 will match. > > --bcaec53f3583381214050fc75577--