hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Boris Shkolnik (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-4045) Increment checkpoint if we see failures in rollEdits
Date Thu, 26 Mar 2009 23:18:50 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12689724#action_12689724
] 

Boris Shkolnik commented on HADOOP-4045:
----------------------------------------

Manual testing done:
1. Mount two directories (one for Edits and Image, one for Edits only).
2. create some files
3. unmount one of them and wait for checkpoint (or create a file) , verify that failed dir
is removed
4. unmount another one (optional) - more verifications
5. mount one back - (checkpoint or new files), verify that checkpointtime is updated and files
have the same size and MD5
6. mount the other one (optional) - more verifications
7. repeat 3 and 5
8. check WebUI all the time.

> Increment checkpoint if we see failures in rollEdits
> ----------------------------------------------------
>
>                 Key: HADOOP-4045
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4045
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: dfs
>            Reporter: Lohit Vijayarenu
>            Assignee: Boris Shkolnik
>            Priority: Critical
>             Fix For: 0.19.2
>
>         Attachments: HADOOP-4045-1.patch, HADOOP-4045.patch
>
>
> In _FSEditLog::rollEdits_, if we encounter an error during opening edits.new, we remove
 the store directory associated with it. At this point we should also increment checkpoint
on all other directories.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message