hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Junping Du (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2013) The diagnostics is always the ExitCodeException stack when the container crashes
Date Tue, 08 Jul 2014 02:12:36 GMT

    [ https://issues.apache.org/jira/browse/YARN-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14054428#comment-14054428

Junping Du commented on YARN-2013:

[~gtCarrera9], I reopen YARN-2242 as we agreed to address RM/NM side separately. Let's do
an improved patch on that jira. 
[~ozawa], Thanks for the patch here which is in good direction. Do you think we should do
similar thing with LinuxContainerExecutor? If so, please add. Also, I think it is better to
add some unit test (i.e. add in TestContainerLaunch.java) to verify messages.

> The diagnostics is always the ExitCodeException stack when the container crashes
> --------------------------------------------------------------------------------
>                 Key: YARN-2013
>                 URL: https://issues.apache.org/jira/browse/YARN-2013
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager
>            Reporter: Zhijie Shen
>            Assignee: Tsuyoshi OZAWA
>         Attachments: YARN-2013.1.patch, YARN-2013.2.patch, YARN-2013.3-2.patch, YARN-2013.3.patch
> When a container crashes, ExitCodeException will be thrown from Shell. Default/LinuxContainerExecutor
captures the exception, put the exception stack into the diagnostic. Therefore, the exception
stack is always the same. 
> {code}
>         String diagnostics = "Exception from container-launch: \n"
>             + StringUtils.stringifyException(e) + "\n" + shExec.getOutput();
>         container.handle(new ContainerDiagnosticsUpdateEvent(containerId,
>             diagnostics));
> {code}
> In addition, it seems that the exception always has a empty message as there's no message
from stderr. Hence the diagnostics is not of much use for users to analyze the reason of container

This message was sent by Atlassian JIRA

View raw message