hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Naganarasimha G R (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-3995) Some of the NM events are not getting published due race condition when AM container finishes in NM
Date Thu, 30 Jul 2015 01:13:05 GMT

    [ https://issues.apache.org/jira/browse/YARN-3995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14647017#comment-14647017
] 

Naganarasimha G R commented on YARN-3995:
-----------------------------------------

Two approaches were discussed till now :
#  we can have timer task which periodically cleans up collector after some period and not
imm remove it when AM container is finished.
# When RM finishes the attempt then it can send one finish event through timelineclient for
the ApplicationEntity which is kind of a marker based on which NM's TimelineCollectorManager
can act upon.



> Some of the NM events are not getting published due race condition when AM container
finishes in NM 
> ----------------------------------------------------------------------------------------------------
>
>                 Key: YARN-3995
>                 URL: https://issues.apache.org/jira/browse/YARN-3995
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager, timelineserver
>    Affects Versions: YARN-2928
>            Reporter: Naganarasimha G R
>            Assignee: Naganarasimha G R
>
> As discussed in YARN-3045:  While testing in TestDistributedShell found out that few
of the container metrics events were failing as there will be race condition. When the AM
container finishes and removes the collector for the app, still there is possibility that
all the events published for the app by the current NM and other NM are still in pipeline,




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message