hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Templeton (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-7150) Yarn crash [max number of completed apps kept in memory met]
Date Sun, 03 Sep 2017 14:27:02 GMT

    [ https://issues.apache.org/jira/browse/YARN-7150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151825#comment-16151825
] 

Daniel Templeton commented on YARN-7150:
----------------------------------------

That's expected behavior.  YARN has a configurable maximum number of jobs it will keep in
memory and recover from the state store after a failure.  Once that limit is exceeded, you
can find the applications that are no longer available from the resource manager in the application
history server.  Is there some other issue other than the log about hitting the max number
of jobs?  If not, I'll close this as not an issue.

> Yarn crash [max number of completed apps kept in memory met]
> ------------------------------------------------------------
>
>                 Key: YARN-7150
>                 URL: https://issues.apache.org/jira/browse/YARN-7150
>             Project: Hadoop YARN
>          Issue Type: Bug
>         Environment: Production
>            Reporter: anikad ayman
>
> During MapReduce processing of several jobs, Yarn did crash and the processing of jobs
had stopped.
> I successed to back the processing after killing jobs which were running (2jobs).
> In the logs I find from the beginning of the crash :
> {code:java}
> *Max number of completed apps kept in state store met*: maxCompletedAppsInStateStore
= 10000, removing app application_1500982512144_26754 from state store.
>  2017-08-25 03:50:05,799 INFO org.apache.hadoop.yarn.server.resourcemanager.RMAppManager:
*Application should be expired, max number of completed apps kept in memory met*: maxCompletedAppsInMemory
= 10000, removing app application_1500982512144_26754 from memory
> {code}
> After that , this message shows up several times in the log :
> {code:java}
> Large response size 4742320 for call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplications
> {code}
> Have you any explication and solution of this issue ?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message