hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Billie Rinaldi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-7894) Improve ATS response for DS_CONTAINER when container launch fails
Date Sun, 06 May 2018 21:50:00 GMT

    [ https://issues.apache.org/jira/browse/YARN-7894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16465300#comment-16465300
] 

Billie Rinaldi commented on YARN-7894:
--------------------------------------

Thanks for the patch, [~csingh]! I think patch 2 looks good overall, with a couple of minor
comments. Please move the DIAGNOSTICS string to the beginning of the class next to the other
timeline-related constants. Also, I think the diagnostics should come at the end of the application
failure message (e.g. "Application Failure: desired = {}, completed = {}, allocated = {},
failed = {}, diagnostics = {}"), since the diagnostics might be very long, and this entire
application failure string should be used in the unregisterApplicationMaster call instead
of only the diagnostics.

Have you tried an application with many container failures to see how a string with very long
diagnostics would look in the UI?

> Improve ATS response for DS_CONTAINER when container launch fails
> -----------------------------------------------------------------
>
>                 Key: YARN-7894
>                 URL: https://issues.apache.org/jira/browse/YARN-7894
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: timelineserver
>            Reporter: Charan Hebri
>            Assignee: Chandni Singh
>            Priority: Major
>         Attachments: YARN-7894.001.patch, YARN-7894.002.patch
>
>
> When a distributed shell application starts running and a container launch fails the
web service call to the API,
> {noformat}
> http://<RM web address>/ws/v1/timeline/DS_CONTAINER/<container_attempt_id>{noformat}
> return a "Not Found". The message returned in this case should be improved to signify
that a container launch failed.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message