hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Junping Du (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (YARN-4325) Purge app state from NM state-store should cover more LOG_HANDLING cases
Date Fri, 22 Apr 2016 15:33:13 GMT

     [ https://issues.apache.org/jira/browse/YARN-4325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Junping Du updated YARN-4325:
-----------------------------
    Attachment: ApplicationImpl.gv

> Purge app state from NM state-store should cover more LOG_HANDLING cases
> ------------------------------------------------------------------------
>
>                 Key: YARN-4325
>                 URL: https://issues.apache.org/jira/browse/YARN-4325
>             Project: Hadoop YARN
>          Issue Type: Bug
>    Affects Versions: 2.6.0
>            Reporter: Junping Du
>            Assignee: Junping Du
>            Priority: Critical
>         Attachments: ApplicationImpl
>
>
> From a long running cluster, we found tens of thousands of stale apps still be recovered
in NM restart recovery. The reason is some wrong configuration setting to log aggregation
so the end of log aggregation events are not received so stale apps are not purged properly.
We should make sure the removal of app state to be independent of log aggregation life cycle.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message