hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Wangda Tan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-1885) RM may not send the finished signal to some nodes where the application ran after RM restarts
Date Wed, 11 Jun 2014 17:01:07 GMT

    [ https://issues.apache.org/jira/browse/YARN-1885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14028016#comment-14028016
] 

Wangda Tan commented on YARN-1885:
----------------------------------

I've looked into failed tests,
1) TestNMClient, this test failure seems caused by YARN-2091,
Test cases in TestNMClient uses following code to verify exit code of COMPLETED containers
{code}
          testGetContainerStatus(container, i, ContainerState.COMPLETE,
              "Container killed by the ApplicationMaster.", Arrays.asList(
                  new Integer[] {137, 143, 0}));
{code}
But YARN-2091 added logic to make exit code reflecting the actual status, so the "killed by
ApplicationMaster" will be -105,
{code}
      if (container.hasDefaultExitCode()) {
        container.exitCode = exitEvent.getExitCode();
      }
{code}
I've filed YARN-2148 and will upload a patch soon. [~ozawa], could you help me review it?

2) TestRMAdminCLI failures
I've filed YARN-2149 and will upload a patch soon. [~kihwal], could you help me review it?


> RM may not send the finished signal to some nodes where the application ran after RM
restarts
> ---------------------------------------------------------------------------------------------
>
>                 Key: YARN-1885
>                 URL: https://issues.apache.org/jira/browse/YARN-1885
>             Project: Hadoop YARN
>          Issue Type: Bug
>    Affects Versions: 2.4.0
>            Reporter: Arpit Gupta
>            Assignee: Wangda Tan
>         Attachments: YARN-1885.patch, YARN-1885.patch, YARN-1885.patch, YARN-1885.patch,
YARN-1885.patch, YARN-1885.patch
>
>
> During our HA testing we have seen cases where yarn application logs are not available
through the cli but i can look at AM logs through the UI. RM was also being restarted in the
background as the application was running.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message