Return-Path: X-Original-To: apmail-accumulo-notifications-archive@minotaur.apache.org Delivered-To: apmail-accumulo-notifications-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EAF151013B for ; Tue, 18 Mar 2014 20:34:03 +0000 (UTC) Received: (qmail 1736 invoked by uid 500); 18 Mar 2014 20:33:57 -0000 Delivered-To: apmail-accumulo-notifications-archive@accumulo.apache.org Received: (qmail 1471 invoked by uid 500); 18 Mar 2014 20:33:52 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 1212 invoked by uid 99); 18 Mar 2014 20:33:48 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 18 Mar 2014 20:33:48 +0000 Date: Tue, 18 Mar 2014 20:33:48 +0000 (UTC) From: "ASF subversion and git services (JIRA)" To: notifications@accumulo.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (ACCUMULO-2488) Concurrent randomwalk balance check needs refinement MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ACCUMULO-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13939733#comment-13939733 ] ASF subversion and git services commented on ACCUMULO-2488: ----------------------------------------------------------- Commit a4174248a96cadcc79a9de4015c90c6618a96418 in accumulo's branch refs/heads/1.5.2-SNAPSHOT from [~bhavanki] [ https://git-wip-us.apache.org/repos/asf?p=accumulo.git;h=a417424 ] ACCUMULO-2488 Change criteria for unbalanced servers in concurrent randomwalk The Concurrent randomwalk test used to consider servers unbalanced if any server's tablet count differed from the cluster average by more than a fifth of the average or by one, whichever was larger. This would cause failures under typical balancings from the default balancer. This commit changes the criterion for an unbalanced server to be double the standard deviation from the cluster average. > Concurrent randomwalk balance check needs refinement > ---------------------------------------------------- > > Key: ACCUMULO-2488 > URL: https://issues.apache.org/jira/browse/ACCUMULO-2488 > Project: Accumulo > Issue Type: Test > Components: test > Affects Versions: 1.4.4 > Reporter: Bill Havanki > Assignee: Bill Havanki > Priority: Minor > Labels: randomwalk, test > Fix For: 1.4.5, 1.5.2, 1.6.0 > > > The check for balanced tablets in the randomwalk Concurrent test too easily fails. > Here is a real-life example from the test for the number of tablets across five tablet servers: 2, 5, 2, 2, 3. (An old unrelated table plays into these totals.) This produces a mean of 2.8. The cluster is considered unbalanced by the test when any server's count differs from the mean by the larger of 1 or the mean divided by 5. In this case, 2.8/5 is less than 1, so the second tablet server fails since it has more than 3.8 tablets. Even a 4 would fail. > Part of the problem in this particular case is that there are so few tablets, and so few tablet servers. The cluster also seems happy to leave these counts as is, as I continue to check it, so the test's definition of unbalanced is too narrow. > The test needs to be refined to detect unbalanced conditions with a statistically decent calculation. -- This message was sent by Atlassian JIRA (v6.2#6252)