accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Newton (JIRA)" <>
Subject [jira] [Created] (ACCUMULO-2535) large cluster failed to shutdown cleanlin
Date Mon, 24 Mar 2014 14:09:43 GMT
Eric Newton created ACCUMULO-2535:

             Summary: large cluster failed to shutdown cleanlin
                 Key: ACCUMULO-2535
             Project: Accumulo
          Issue Type: Bug
          Components: tserver
    Affects Versions: 1.4.4
         Environment: large cluster
            Reporter: Eric Newton

A large 1.4.4 cluster failed to shutdown cleanly.  It was stuck trying to unload 7 non-root
!METADATA tablets.  Master was sending "unload tablet" messages constantly, but the tserver
refused to unload the tablets.

Examining the tserver logs of one server, it began a full major compaction, then received
the unload message, and finished the compaction.  The tablet was never unloaded.

The cluster had ~400 metadata tablets overall.

System recovered after a hard shutdown of the remaining 8 servers (7 metadata table servers
and 1 root tablet server).

This message was sent by Atlassian JIRA

View raw message