accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bill Havanki (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ACCUMULO-3141) Many RW failures due to balance check
Date Thu, 18 Sep 2014 15:34:33 GMT

    [ https://issues.apache.org/jira/browse/ACCUMULO-3141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14139067#comment-14139067
] 

Bill Havanki commented on ACCUMULO-3141:
----------------------------------------

Thanks Josh!

FWIW, I didn't have any particular attachment to the idea of "twice the standard deviation",
beyond it sounding pretty reasonable and handling the unbalanced issues at the time. So, this
particular problem could either be a real thing or just not fitting into the reasonable expectation
from before.

> Many RW failures due to balance check
> -------------------------------------
>
>                 Key: ACCUMULO-3141
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-3141
>             Project: Accumulo
>          Issue Type: Bug
>         Environment: 1.5.2 RC1, Hadoop 2.3.0, Zookeeper 3.4.5, CentOS 6, 20 node EC2
>            Reporter: Keith Turner
>             Fix For: 1.5.3, 1.6.1, 1.7.0
>
>
> While running RW test against 1.5.2 RC1, 10 of 17 walkers failed with a message like
the following.
> {noformat}
> 16 19:35:48,820 [randomwalk.Framework] ERROR: Error during random walk
> java.lang.Exception: Error running node Concurrent.xml
>         at org.apache.accumulo.test.randomwalk.Module.visit(Module.java:285)
>         at org.apache.accumulo.test.randomwalk.Framework.run(Framework.java:63)
>         at org.apache.accumulo.test.randomwalk.Framework.main(Framework.java:122)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at org.apache.accumulo.start.Main$1.run(Main.java:107)
>         at java.lang.Thread.run(Thread.java:744)
> Caused by: java.lang.Exception: Error running node ct.CheckBalance
>         at org.apache.accumulo.test.randomwalk.Module.visit(Module.java:285)
>         at org.apache.accumulo.test.randomwalk.Module.visit(Module.java:254)
>         ... 8 more
> Caused by: java.lang.Exception: servers are unbalanced! location 2487f8db354002f count
345 too far from average 151.86666666666667
>         at org.apache.accumulo.test.randomwalk.concurrent.CheckBalance.visit(CheckBalance.java:86)
>         at org.apache.accumulo.test.randomwalk.Module.visit(Module.java:254)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message