Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E7D1410066 for ; Fri, 14 Feb 2014 23:53:00 +0000 (UTC) Received: (qmail 83083 invoked by uid 500); 14 Feb 2014 23:52:57 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 82965 invoked by uid 500); 14 Feb 2014 23:52:57 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 82957 invoked by uid 99); 14 Feb 2014 23:52:57 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Feb 2014 23:52:57 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of rohitkelkar@gmail.com designates 209.85.223.181 as permitted sender) Received: from [209.85.223.181] (HELO mail-ie0-f181.google.com) (209.85.223.181) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Feb 2014 23:52:52 +0000 Received: by mail-ie0-f181.google.com with SMTP id rl12so119953iec.40 for ; Fri, 14 Feb 2014 15:52:31 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=GQzYGmtd8p1E1/ZkAFhheNdM/WfCd9xkVQ+k1iy+l44=; b=X3NBtBfwCh/BsllXs/U3MzOO1tSBlxmmzWDOEP4GvWZul35pfrUwULwLLgJBusExsR L3eRbxm1NQKqDKqZl5p8QQTorYYmqwk4xX4BOvRc+2CnTK+qX/SOU2QG6GRaFUXwbjne tHVJV2vmDI8N34zRN8LC9LVBOEyVJh5MWjonVeeOcIbNGPvB1c6DtCF5eZCc6+JSAMKq y0BthboLfRpfx/z2us6Dj/FclGVfbmRljsPEmMYekFs0D+SNXaN5SRsMokE9rraZhBhJ uD/B5NI8K8YA8p09yfWOPS35bBu+770qk8TXtzGiFYR7W+EKiRrURbiLQfGNjNWhVTV2 ISvA== MIME-Version: 1.0 X-Received: by 10.50.43.138 with SMTP id w10mr5345360igl.33.1392421951664; Fri, 14 Feb 2014 15:52:31 -0800 (PST) Received: by 10.43.82.193 with HTTP; Fri, 14 Feb 2014 15:52:31 -0800 (PST) Date: Fri, 14 Feb 2014 17:52:31 -0600 Message-ID: Subject: uneven region distribution From: Rohit Kelkar To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=089e01176c7fce7a9904f2667fc2 X-Virus-Checked: Checked by ClamAV on apache.org --089e01176c7fce7a9904f2667fc2 Content-Type: text/plain; charset=ISO-8859-1 I am using hbase version 0.92.4 on a 5 node cluster. I am seeing that a particular region server often crashes. A status 'simple' on hbase shell gives the following stats HBase Shell; enter 'help' for list of supported commands. Type "exit" to leave the HBase Shell Version 0.94.2, r1395367, Sun Oct 7 19:11:01 UTC 2012 status 'simple' 4 live servers server7:60020 1392017875910 requestsPerSecond=0, numberOfOnlineRegions=419, usedHeapMB=3315, maxHeapMB=6127 server4:60020 1392300859332 requestsPerSecond=843, numberOfOnlineRegions=379, usedHeapMB=2070, maxHeapMB=6127 server3:60020 1391583646998 requestsPerSecond=429, numberOfOnlineRegions=653, usedHeapMB=3198, maxHeapMB=6127 server6:60020 1391583647588 requestsPerSecond=0, numberOfOnlineRegions=966, usedHeapMB=2975, maxHeapMB=6127 1 dead servers server5,60020,1392108515637 Aggregate load: 1272, regions: 2417 The dead region server has 2417 regions as opposed to 419, 379, 653, 966 regions on other servers. Am I right in attributing the region server crash to the disproportionately high number of regions on that server? If I invoke the balancer on hbase shell using the "balancer" command it returns true. But it does not change the status of the assignments. - R --089e01176c7fce7a9904f2667fc2--