zookeeper-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ZOOKEEPER-3082) Fix server snapshot behavior when out of disk space
Date Mon, 30 Jul 2018 06:43:00 GMT

    [ https://issues.apache.org/jira/browse/ZOOKEEPER-3082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16561475#comment-16561475
] 

Hudson commented on ZOOKEEPER-3082:
-----------------------------------

SUCCESS: Integrated in Jenkins build ZooKeeper-trunk #128 (See [https://builds.apache.org/job/ZooKeeper-trunk/128/])
ZOOKEEPER-3082: Fix server snapshot behavior when out of disk space (hanm: rev 78e4a1047c701006dd4ec8d09065eda0e7adedb5)
* (edit) src/java/main/org/apache/zookeeper/server/persistence/FileTxnSnapLog.java
* (add) src/java/test/org/apache/zookeeper/server/persistence/EmptySnapshotTest.java


> Fix server snapshot behavior when out of disk space
> ---------------------------------------------------
>
>                 Key: ZOOKEEPER-3082
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3082
>             Project: ZooKeeper
>          Issue Type: Bug
>          Components: server
>    Affects Versions: 3.6.0, 3.4.12, 3.5.5
>            Reporter: Brian Nixon
>            Assignee: Brian Nixon
>            Priority: Minor
>              Labels: pull-request-available
>             Fix For: 3.6.0
>
>          Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> When the ZK server tries to make a snapshot and the machine is out of disk space, the
snapshot creation fails and throws an IOException. An empty snapshot file is created, (probably
because the server is able to create an entry in the dir) but is not able to write to the
file.
>  
> If snapshot creation fails, the server commits suicide. When it restarts, it will do
so from the last known good snapshot. However, when it tries to make a snapshot again, the
same thing happens. This results in lots of empty snapshot files being created. If eventually
the DataDirCleanupManager garbage collects the good snapshot files then only the empty files
remain. At this point, the server is well and truly screwed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message