hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Todd Lipcon (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-4288) NN accepts incremental BR as IBR in safemode
Date Wed, 16 Jan 2013 04:22:15 GMT

    [ https://issues.apache.org/jira/browse/HDFS-4288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13554727#comment-13554727
] 

Todd Lipcon commented on HDFS-4288:
-----------------------------------

This now seems to be failing on precommit builds even though it passed its own precommit build
above:
java.lang.AssertionError: Number of blocks is inconcistent with the array length
	at org.apache.hadoop.hdfs.protocol.BlockListAsLongs.getNumberOfBlocks(BlockListAsLongs.java:204)
	at org.apache.hadoop.hdfs.protocol.BlockListAsLongs$BlockReportIterator.hasNext(BlockListAsLongs.java:153)
	at org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.processFirstBlockReport(BlockManager.java:1672)
	at org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.processReport(BlockManager.java:1558)
	at org.apache.hadoop.hdfs.server.blockmanagement.TestBlockManager.testSafeModeIBR(TestBlockManager.java:511)

I'm able to reproduce locally as well. Not sure what changed on trunk between the precommit
run last week and now, but seems to be causing issues. I'll revert this for now.
                
> NN accepts incremental BR as IBR in safemode
> --------------------------------------------
>
>                 Key: HDFS-4288
>                 URL: https://issues.apache.org/jira/browse/HDFS-4288
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: namenode
>    Affects Versions: 0.23.0, 2.0.0-alpha, 3.0.0
>            Reporter: Daryn Sharp
>            Assignee: Daryn Sharp
>            Priority: Critical
>             Fix For: 3.0.0, 2.0.3-alpha
>
>         Attachments: HDFS-4288.branch-23.patch, HDFS-4288.patch
>
>
> If a DN is ready to send an incremental BR and the NN goes down, the DN will repeatedly
try to reconnect.  The NN will then process the DN's incremental BR as an initial BR.  The
NN now thinks the DN has only a few blocks, and will ignore all subsequent BRs from that DN
until out of safemode -- which it may never do because of all the "missing" blocks on the
affected DNs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message