hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tsz Wo (Nicholas), SZE (Updated) (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HDFS-3075) Backport HADOOP-4885 to branch-1
Date Wed, 14 Mar 2012 18:32:40 GMT

     [ https://issues.apache.org/jira/browse/HDFS-3075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tsz Wo (Nicholas), SZE updated HDFS-3075:
-----------------------------------------

    Description: 
When a storage directory is inaccessible, namenode removes it from the valid storage dir list
to a removedStorageDirs list. Those storage directories will not be restored when they become
healthy again. 

The proposed solution is to restore the previous failed directories at the beginning of checkpointing,
say, rollEdits, by copying necessary metadata files from healthy directory to unhealthy ones.
In this way, whenever a failed storage directory is recovered by the administrator, he/she
can immediately force a checkpointing to restored a failed directory.

See also HADOOP-4885.

  was:
When a storage directory is inaccessible, namenode removes it from the valid storage dir list
to a removedStorageDirs list. Those storage directories will not be restored when they become
healthy again. 

The proposed solution is to restore the previous failed directories at the beginning of checkpointing,
say, rollEdits, by copying necessary metadata files from healthy directory to unhealthy ones.
In this way, whenever a failed storage directory is recovered by the administrator, he/she
can immediately force a checkpointing to restored a failed directory.

        Summary: Backport HADOOP-4885 to branch-1  (was: Add mechanism to restore the removed
storage directories)
    
> Backport HADOOP-4885 to branch-1
> --------------------------------
>
>                 Key: HDFS-3075
>                 URL: https://issues.apache.org/jira/browse/HDFS-3075
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: name-node
>    Affects Versions: 0.24.0, 1.1.0
>            Reporter: Brandon Li
>            Assignee: Brandon Li
>
> When a storage directory is inaccessible, namenode removes it from the valid storage
dir list to a removedStorageDirs list. Those storage directories will not be restored when
they become healthy again. 
> The proposed solution is to restore the previous failed directories at the beginning
of checkpointing, say, rollEdits, by copying necessary metadata files from healthy directory
to unhealthy ones. In this way, whenever a failed storage directory is recovered by the administrator,
he/she can immediately force a checkpointing to restored a failed directory.
> See also HADOOP-4885.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message