hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "s.a.rao@accenture.com (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2865) Application recovery continuously fails with "Application with id already present. Cannot duplicate"
Date Wed, 07 Sep 2016 10:15:20 GMT

    [ https://issues.apache.org/jira/browse/YARN-2865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15470223#comment-15470223
] 

s.a.rao@accenture.com commented on YARN-2865:
---------------------------------------------

Hi,

Issues with YARN were it is not becoming active  due to the below issue and we are using CDH
5.3.0

org.apache.hadoop.yarn.exceptions.YarnException: Application with id application_1470357060724_43131
is already present! Cannot add a duplicate!
                at org.apache.hadoop.yarn.ipc.RPCUtil.getRemoteException(RPCUtil.java:45)
                at org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.createAndPopulateNewRMApp(RMAppManager.java:365)
                at org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.recoverApplication(RMAppManager.java:310)
                at org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.recover(RMAppManager.java:427)
                at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.recover(ResourceManager.java:1126)
                at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$RMActiveServices.serviceStart(ResourceManager.java:501)
                at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
                at

Please can you help us with resolving the above issue.

Thanks,
Sudhakar Rao



> Application recovery continuously fails with "Application with id already present. Cannot
duplicate"
> ----------------------------------------------------------------------------------------------------
>
>                 Key: YARN-2865
>                 URL: https://issues.apache.org/jira/browse/YARN-2865
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>            Reporter: Rohith Sharma K S
>            Assignee: Rohith Sharma K S
>            Priority: Critical
>              Labels: 2.6.1-candidate
>             Fix For: 2.7.0, 2.6.1, 3.0.0-alpha1
>
>         Attachments: YARN-2865.1.patch, YARN-2865.patch, YARN-2865.patch
>
>
> YARN-2588 handles exception thrown while transitioningToActive and reset activeServices.
But it misses out clearing RMcontext apps/nodes details and ClusterMetrics and QueueMetrics.
This causes application recovery to fail.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message