hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kihwal Lee (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-4138) TestBackupNode.testCheckpointNode fails due to a backup node startup issue.
Date Thu, 01 Nov 2012 21:38:12 GMT

    [ https://issues.apache.org/jira/browse/HDFS-4138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13489054#comment-13489054
] 

Kihwal Lee commented on HDFS-4138:
----------------------------------

When FSNamesystem#startCommonServices() is called from initialize(), the following happens
during invoking DefaultMetricsSystem.instance().register(). Before starting the backup node,
a MiniDFSCluster was up and ready.

{panel}
2012-11-01 12:59:43,397 ERROR lib.MethodMetric (MethodMetric.java:snapshot(118)) - Error invoking
method getTransactionsSinceLastLogRoll
	java.lang.reflect.InvocationTargetException
...
Caused by: java.lang.IllegalStateException: Bad state: UNINITIALIZED
	at com.google.common.base.Preconditions.checkState(Preconditions.java:172)
	at org.apache.hadoop.hdfs.server.namenode.FSEditLog.getCurSegmentTxId(FSEditLog.java:452)
	at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getTransactionsSinceLastLogRoll(FSNamesystem.java:3488)
{panel}	

It then tries to enter the active state, which fails.

{panel}
2012-11-01 12:59:43,445 FATAL namenode.NameNode (NameNode.java:doImmediateShutdown(1291))
- Error encountered requiring NN shutdown. Shutting down immediately.
java.lang.NullPointerException
	at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.startActiveServices(FSNamesystem.java:645)
	at org.apache.hadoop.hdfs.server.namenode.NameNode$NameNodeHAContext.startActiveServices(NameNode.java:1315)
	at org.apache.hadoop.hdfs.server.namenode.ha.ActiveState.enterState(ActiveState.java:61)
{panel}

This is caught by ExitUtil and the test fails. 
                
> TestBackupNode.testCheckpointNode fails due to a backup node startup issue.
> ---------------------------------------------------------------------------
>
>                 Key: HDFS-4138
>                 URL: https://issues.apache.org/jira/browse/HDFS-4138
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: ha, name-node
>    Affects Versions: 2.0.3-alpha
>            Reporter: Kihwal Lee
>
> When a backup node is getting started, it tries to enter active state and start common
services. But when it fails to start services and exits, which is caught by the exit util.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message