zookeeper-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Brian Nixon (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ZOOKEEPER-3082) Fix server snapshot behavior when out of disk space
Date Thu, 05 Jul 2018 18:54:00 GMT
Brian Nixon created ZOOKEEPER-3082:

             Summary: Fix server snapshot behavior when out of disk space
                 Key: ZOOKEEPER-3082
                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3082
             Project: ZooKeeper
          Issue Type: Bug
          Components: server
    Affects Versions: 3.4.12, 3.6.0, 3.5.5
            Reporter: Brian Nixon

When the ZK server tries to make a snapshot and the machine is out of disk space, the snapshot
creation fails and throws an IOException. An empty snapshot file is created, (probably because
the server is able to create an entry in the dir) but is not able to write to the file.
If snapshot creation fails, the server commits suicide. When it restarts, it will do so from
the last known good snapshot. However, when it tries to make a snapshot again, the same thing
happens. This results in lots of empty snapshot files being created. If eventually the DataDirCleanupManager
garbage collects the good snapshot files then only the empty files remain. At this point,
the server is well and truly screwed.

This message was sent by Atlassian JIRA

View raw message