hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hadoop QA (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-2815) Namenode is not coming out of safemode when we perform ( NN crash + restart ) . Also FSCK report shows blocks missed.
Date Tue, 31 Jan 2012 18:38:10 GMT

    [ https://issues.apache.org/jira/browse/HDFS-2815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13197093#comment-13197093

Hadoop QA commented on HDFS-2815:

-1 overall.  Here are the results of testing the latest attachment 
  against trunk revision .

    +1 @author.  The patch does not contain any @author tags.

    -1 tests included.  The patch doesn't appear to include any new or modified tests.
                        Please justify why no new tests are needed for this patch.
                        Also please list what manual steps were performed to verify this patch.

    +1 javadoc.  The javadoc tool did not generate any warning messages.

    +1 javac.  The applied patch does not increase the total number of javac compiler warnings.

    +1 eclipse:eclipse.  The patch built with eclipse:eclipse.

    -1 findbugs.  The patch appears to introduce 1 new Findbugs (version 1.3.9) warnings.

    +1 release audit.  The applied patch does not increase the total number of release audit

    -1 core tests.  The patch failed these unit tests:

    +1 contrib tests.  The patch passed contrib unit tests.

Test results: https://builds.apache.org/job/PreCommit-HDFS-Build/1827//testReport/
Findbugs warnings: https://builds.apache.org/job/PreCommit-HDFS-Build/1827//artifact/trunk/hadoop-hdfs-project/patchprocess/newPatchFindbugsWarningshadoop-hdfs.html
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/1827//console

This message is automatically generated.
> Namenode is not coming out of safemode when we perform ( NN crash + restart ) .  Also
FSCK report shows blocks missed.
> ----------------------------------------------------------------------------------------------------------------------
>                 Key: HDFS-2815
>                 URL: https://issues.apache.org/jira/browse/HDFS-2815
>             Project: Hadoop HDFS
>          Issue Type: Bug
>    Affects Versions: 0.22.0, 0.24.0, 0.23.1, 1.0.0
>            Reporter: Uma Maheswara Rao G
>            Assignee: Uma Maheswara Rao G
>            Priority: Critical
>         Attachments: HDFS-2815.patch
> When tested the HA(internal) with continuous switch with some 5mins gap, found some *blocks
missed* and namenode went into safemode after next switch.
>    After the analysis, i found that this files already deleted by clients. But i don't
see any delete commands logs namenode log files. But namenode added that blocks to invalidateSets
and DNs deleted the blocks.
>    When restart of the namenode, it went into safemode and expecting some more blocks
to come out of safemode.
>    Here the reason could be that, file has been deleted in memory and added into invalidates
after this it is trying to sync the edits into editlog file. By that time NN asked DNs to
delete that blocks. Now namenode shuts down before persisting to editlogs.( log behind)
>    Due to this reason, we may not get the INFO logs about delete, and when we restart
the Namenode (in my scenario it is again switch), Namenode expects this deleted blocks also,
as delete request is not persisted into editlog before.
>    I reproduced this scenario with bedug points. *I feel, We should not add the blocks
to invalidates before persisting into Editlog*. 
>     Note: for switch, we used kill -9 (force kill)
>   I am currently in 0.20.2 version. Same verified in 0.23 as well in normal crash + restart

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message