hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Akira Ajisaka (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-6860) TestRMRestart.testFinishedAppRemovalAfterRMRestart fails intermittently
Date Mon, 24 Jul 2017 09:33:00 GMT

    [ https://issues.apache.org/jira/browse/YARN-6860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16098141#comment-16098141
] 

Akira Ajisaka commented on YARN-6860:
-------------------------------------

The test fails in the following code:
{code}
    // the first app0 get kicked out from both rmContext and state store
    Assert.assertNull(rm2.getRMContext().getRMApps()
      .get(app0.getApplicationId()));
    Assert.assertNull(rmAppState.get(app0.getApplicationId()));
{code}
RMAppManager removes app0 from rmContext by blocking API, and removes it from state store
by non-blocking API (Please see {{RMStateStore#removeApplication}} for the detail). That way
the latter assertion may fail. I'm thinking the issue can be fixed by adding wait via {{GenericTestUtils#waitFor}}.
I'll attach a patch shortly.

> TestRMRestart.testFinishedAppRemovalAfterRMRestart fails intermittently
> -----------------------------------------------------------------------
>
>                 Key: YARN-6860
>                 URL: https://issues.apache.org/jira/browse/YARN-6860
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: test
>            Reporter: Akira Ajisaka
>            Assignee: Akira Ajisaka
>
> https://builds.apache.org/job/PreCommit-YARN-Build/16528/testReport/org.apache.hadoop.yarn.server.resourcemanager/TestRMRestart/testFinishedAppRemovalAfterRMRestart/
> {noformat}
> java.lang.AssertionError: expected null, but was:<submit_time: 1500886835534 application_submission_context
{ application_id { id: 1 cluster_timestamp: 1500886835515 } application_name: "" queue: "default"
priority { priority: 0 } am_container_spec { } cancel_tokens_when_complete: true maxAppAttempts:
2 resource { memory: 1024 virtual_cores: 1 } applicationType: "YARN" keep_containers_across_application_attempts:
false attempt_failures_validity_interval: 0 am_container_resource_request { priority { priority:
0 } resource_name: "*" capability { memory: 1024 virtual_cores: 1 } num_containers: 1 relax_locality:
true node_label_expression: "" execution_type_request { execution_type: GUARANTEED enforce_execution_type:
false } } } user: "jenkins" start_time: 1500886835535 application_state: RMAPP_FINISHED finish_time:
1500886835559>
> 	at org.junit.Assert.fail(Assert.java:88)
> 	at org.junit.Assert.failNotNull(Assert.java:664)
> 	at org.junit.Assert.assertNull(Assert.java:646)
> 	at org.junit.Assert.assertNull(Assert.java:656)
> 	at org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testFinishedAppRemovalAfterRMRestart(TestRMRestart.java:1673)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message