Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 46A2E91EB for ; Thu, 10 May 2012 18:41:47 +0000 (UTC) Received: (qmail 29959 invoked by uid 500); 10 May 2012 18:41:45 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 29922 invoked by uid 500); 10 May 2012 18:41:45 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 29912 invoked by uid 99); 10 May 2012 18:41:45 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 10 May 2012 18:41:45 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW X-Spam-Check-By: apache.org Received-SPF: unknown (nike.apache.org: error in processing during lookup of dave@urbanairship.com) Received: from [209.85.214.169] (HELO mail-ob0-f169.google.com) (209.85.214.169) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 10 May 2012 18:41:38 +0000 Received: by obbwd18 with SMTP id wd18so2992571obb.14 for ; Thu, 10 May 2012 11:41:17 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:x-gm-message-state; bh=M5TQPQo2cSS2gz2A6Yd/oqw7iky8tO/JGdf7v/LBmlw=; b=L102Za1M25Bfaq5yOKhXEVcE3rEqRZAy8egGyQpXt3dNuOeZbtNHBOOCtNJV/z1Tqr JF+n2RP9CGltfspISgB/e4PnWQ/Z+BcC/jlJSC2FznU8/y+hXPyphZiimf2VABb2YivC EZkGPfpFsqnVAtRJX7ybZgEeen2yjh+fmncRwhhz3k3TKYiQs8r8jXz1SGAfujP4Hx4t gXBoGoc1OgkkfyzH0TEjTODvLsLH71nygSmIKtVksnJJP71a/owru6ecv0iDtcTTwz8J jOUhExH6l6mwUAX0eVXbSjJOKzs4XAHzps9tOhRNvDgH91g4GxKp+vAJdBWd2BHiYaT1 zMFg== MIME-Version: 1.0 Received: by 10.182.167.68 with SMTP id zm4mr7447675obb.25.1336675277536; Thu, 10 May 2012 11:41:17 -0700 (PDT) Received: by 10.182.12.4 with HTTP; Thu, 10 May 2012 11:41:17 -0700 (PDT) In-Reply-To: References: Date: Thu, 10 May 2012 11:41:17 -0700 Message-ID: Subject: Re: Occasional regionserver crashes following socket errors writing to HDFS From: Dave Revell To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=e89a8f83a5a118fbd004bfb2f7a1 X-Gm-Message-State: ALoCoQkYwXxosH+kCl/5IBnbNHaX/ZUJpqCtoFMv2AK1fnJKYaL7yCboF/BFkwIfNetW4MlkyFVG --e89a8f83a5a118fbd004bfb2f7a1 Content-Type: text/plain; charset=ISO-8859-1 Some examples of when you'd want a reducer: http://static.usenix.org/event/osdi04/tech/full_papers/dean/dean.pdf On Thu, May 10, 2012 at 11:30 AM, Michael Segel wrote: > Dave, do you really want to go there? > > OP has a couple of issues and he was going down a rabbit hole. > (You can choose if that's a reference to 'the Matrix, Jefferson Starship, > Alice in Wonderland... or all of the above) > > So to put him on the correct path, I recommended the following, not in any > order... > > 1) Increase his region size for this table only. > 2) Look to decreasing the number of regions managed by a RS (which is why > you increase region size) > 3) Up the dfs.balance.bandwidthPerSec. (How often does HBase move regions > and how exactly do they move regions ?) > 4) Look at implementing MSLABS and GC tuning. This cuts down on the > overhead. > 5) Refactoring his job.... > > Oops. > Ok I didn't put that in the list. > But that was the last thing I wrote as a separate statement. > Clearly you didn't take my advice and think about the problem.... > > To prove a point.... you wrote: > 'Many mapreduce algorithms require a reduce phase (e.g. sorting)' > > Ok. So tell me why you would want to sort your input in to HBase and if > that's really a good thing? > Oops!... :-) > > > > > > > On May 10, 2012, at 12:31 PM, Dave Revell wrote: > > This "you don't need a reducer" conversation is distracting from the real > > problem and is false. > > > > Many mapreduce algorithms require a reduce phase (e.g. sorting). The fact > > that the output is written to HBase or somewhere else is irrelevant. > > > > -Dave > > > > On Thu, May 10, 2012 at 6:26 AM, Michael Segel < > michael_segel@hotmail.com>wrote: > > [SNIP] > > --e89a8f83a5a118fbd004bfb2f7a1--