hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Haibo Chen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-6920) Fix TestNMClient failure due to YARN-6706
Date Thu, 03 Aug 2017 18:53:01 GMT

    [ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16113296#comment-16113296

Haibo Chen commented on YARN-6920:

Hmm.. It still does not fix TestNMClient for me.  Is this the same failure that you have seen?
"java.lang.AssertionError: Exception is not expected: org.apache.hadoop.yarn.exceptions.YarnException:
Cannot perform RE_INIT on [container_1501785504205_0001_01_000001]. Current state is [REINITIALIZING,
Looking at the log, it looks like NM cannot find the pid file because the container launch
context is empty, and therefore waits for up to maxKillWaitTime which is 2000. So it seems
like a timing issue to me. Also, I set some breakpoints which happened to mess up the time
on NM, and then I got the same Exception but with container being in DONE state. 

> Fix TestNMClient failure due to YARN-6706
> -----------------------------------------
>                 Key: YARN-6920
>                 URL: https://issues.apache.org/jira/browse/YARN-6920
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Arun Suresh
>            Assignee: Arun Suresh
>         Attachments: YARN-6920.001.patch, YARN-6920.002.patch, YARN-6920.003.patch, YARN-6920.004.patch
> Looks like {{TestNMClient}} has been failing for a while. Opening this JIRA to track
the fix.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org

View raw message