hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Weiwei Yang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-11936) Ozone: TestNodeManager times out before it is able to find all nodes
Date Wed, 19 Jul 2017 14:45:01 GMT

    [ https://issues.apache.org/jira/browse/HDFS-11936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16093192#comment-16093192
] 

Weiwei Yang commented on HDFS-11936:
------------------------------------

Hi [~yuanbo]

Thanks for working on this. I have some comment with this patch. It looks like you narrow
down the problem to {{SCMNodeManager#waitForHeartbeatProcessed}}, this check gives no guarantee
that all heartbeats are processed. Can we fix this method as it is also called some other
places, in case they may have similar problems. I am thinking if this can be fixed by

{code}
public boolean waitForHeartbeatProcessed() {
    return this.heartbeatQueue.isEmpty();
}
{code} 

Please let me know your thought. thanks.

> Ozone: TestNodeManager times out before it is able to find all nodes
> --------------------------------------------------------------------
>
>                 Key: HDFS-11936
>                 URL: https://issues.apache.org/jira/browse/HDFS-11936
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: ozone
>    Affects Versions: HDFS-7240
>            Reporter: Anu Engineer
>            Assignee: Yuanbo Liu
>         Attachments: HDFS-11936-HDFS-7240.001.patch
>
>
> During the pre-commit build of https://builds.apache.org/job/PreCommit-HDFS-Build/19795/testReport/
> we detected that a test in TestNodeManager is failing. Probably due to the
> fact that we need more time to execute this test in jenkins. This might be related to
HDFS-11919
> The test failure report follows.
> ==========================================================================
> {noformat}
> Regression
> org.apache.hadoop.ozone.scm.node.TestNodeManager.testScmStatsFromNodeReport
> Failing for the past 1 build (Since Failed#19795 )
> Took 0.51 sec.
> Error Message
> expected:<20000> but was:<18000>
> Stacktrace
> java.lang.AssertionError: expected:<20000> but was:<18000>
> 	at org.junit.Assert.fail(Assert.java:88)
> 	at org.junit.Assert.failNotEquals(Assert.java:743)
> 	at org.junit.Assert.assertEquals(Assert.java:118)
> 	at org.junit.Assert.assertEquals(Assert.java:555)
> 	at org.junit.Assert.assertEquals(Assert.java:542)
> 	at org.apache.hadoop.ozone.scm.node.TestNodeManager.testScmStatsFromNodeReport(TestNodeManager.java:972)
> Standard Output
> 2017-06-06 13:45:30,909 [main] INFO       - Data node with ID: 732ebd32-a926-44c5-afbb-c9f87513a67c
Registered.
> 2017-06-06 13:45:30,937 [main] INFO       - Data node with ID: 6860fd5d-94dc-4ba8-acd0-41cc3fa7232d
Registered.
> 2017-06-06 13:45:30,971 [main] INFO       - Data node with ID: cad7174c-204c-4806-b3af-c874706d4bd9
Registered.
> 2017-06-06 13:45:30,996 [main] INFO       - Data node with ID: 0130a672-719d-4b68-9a1e-13046f4281ff
Registered.
> 2017-06-06 13:45:31,021 [main] INFO       - Data node with ID: 8d9ea5d4-6752-48d4-9bf0-adb0bd1a651a
Registered.
> 2017-06-06 13:45:31,046 [main] INFO       - Data node with ID: f122e372-5a38-476b-97dc-5ae449190485
Registered.
> 2017-06-06 13:45:31,071 [main] INFO       - Data node with ID: 5750eb03-c1ac-4b3a-bc59-c4d9481e245b
Registered.
> 2017-06-06 13:45:31,097 [main] INFO       - Data node with ID: aa2d90a1-9e85-41f8-a4e5-35c7d2ed7299
Registered.
> 2017-06-06 13:45:31,122 [main] INFO       - Data node with ID: 5e52bf5c-7050-4fc9-bf10-0e52650229ee
Registered.
> 2017-06-06 13:45:31,147 [main] INFO       - Data node with ID: eaac7b8f-a556-4afc-9163-7309f7ccea18
Registered.
> 2017-06-06 13:45:31,224 [SCM Heartbeat Processing Thread - 0] INFO       - Current Thread
is interrupted, shutting down HB processing thread for Node Manager.
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message