hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rohith Sharma K S (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-4989) TestWorkPreservingRMRestart#testCapacitySchedulerRecovery fails intermittently
Date Fri, 22 Apr 2016 09:59:12 GMT

    [ https://issues.apache.org/jira/browse/YARN-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15253667#comment-15253667

Rohith Sharma K S commented on YARN-4989:

In the test {{TestWorkPreservingRMRestart#testCapacitySchedulerRecovery}}, after RM is restarted,
method {{waitForNumContainersToRecover}} has been called for submitted apps. There are 2 apps
submitted, but waiting is only for 2 apps i.e am1_1 and am1_2. There is another AM *am2* which
need to wait for container recovery. Code is there to wait but it is waiting for am1_2 only.

    // Wait for RM to settle down on recovering containers;
    waitForNumContainersToRecover(2, rm2, am1_1.getApplicationAttemptId());
    waitForNumContainersToRecover(2, rm2, am1_2.getApplicationAttemptId());
    waitForNumContainersToRecover(2, rm2, am1_2.getApplicationAttemptId());

In third waitForNumContainersToRecover, instead of am1_2, variable am2 should solve this randomness

> TestWorkPreservingRMRestart#testCapacitySchedulerRecovery fails intermittently 
> -------------------------------------------------------------------------------
>                 Key: YARN-4989
>                 URL: https://issues.apache.org/jira/browse/YARN-4989
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: test
>            Reporter: Rohith Sharma K S
> Sometimes TestWorkPreservingRMRestart#testCapacitySchedulerRecovery fails randomly.
> {noformat}
> java.lang.AssertionError: expected:<<memory:2048, vCores:2>> but was:<<memory:1024,
> at org.junit.Assert.fail(Assert.java:88)
> at org.junit.Assert.failNotEquals(Assert.java:743)
> at org.junit.Assert.assertEquals(Assert.java:118)
> at org.junit.Assert.assertEquals(Assert.java:144)
> at org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.checkCSLeafQueue(TestWorkPreservingRMRestart.java:289)
> at org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.testCapacitySchedulerRecovery(TestWorkPreservingRMRestart.java:501)
> {noformat}

This message was sent by Atlassian JIRA

View raw message