accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Newton (JIRA)" <>
Subject [jira] [Created] (ACCUMULO-1572) single node zookeeper failure kills connected accumulo servers
Date Tue, 16 Jul 2013 13:38:49 GMT
Eric Newton created ACCUMULO-1572:

             Summary: single node zookeeper failure kills connected accumulo servers
                 Key: ACCUMULO-1572
             Project: Accumulo
          Issue Type: Bug
          Components: master, tserver
    Affects Versions: 1.5.0
            Reporter: Eric Newton
            Priority: Blocker
             Fix For: 1.5.1

Drew Thornton writes on the user mailing list:
If one zookeeper node is shutdown/fails/whatever and the rest of the ensemble stays up, the
tablet servers attached as clients to the shutdown node immediately fail. If one of the clients
happens to be the master, the cluster goes down.

Accumulo does not seem to be failing over to the remaining zookeeper nodes, and this causes
me to restart the individual tablet servers again.

The zookeeper ensemble is very stable and has plenty of bandwidth/memory/processing, so taking
one node down out of five doesn't crash the zookeepers, just the tablet servers...

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

View raw message