hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mayank Bansal (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2055) Preemption: Jobs are failing due to AMs are getting launched and killed multiple times
Date Fri, 16 May 2014 10:42:16 GMT

    [ https://issues.apache.org/jira/browse/YARN-2055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13998941#comment-13998941
] 

Mayank Bansal commented on YARN-2055:
-------------------------------------

YARN-2022 is for avoiding killing AM however this issue more like how we are launching AM
after preemption as there would be situations where you get some capacity for one heart beat
and then again that capacity is reclaimed by other queue and then again AM will be killed
and job will be failed. Based on the comments of YARN-2022 i dont see this case have been
handeled there.

Thanks,
Mayank

> Preemption: Jobs are failing due to AMs are getting launched and killed multiple times
> --------------------------------------------------------------------------------------
>
>                 Key: YARN-2055
>                 URL: https://issues.apache.org/jira/browse/YARN-2055
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>    Affects Versions: 2.4.0
>            Reporter: Mayank Bansal
>
> If Queue A does not have enough capacity to run AM, then AM will borrow capacity from
queue B to run AM in that case AM will be killed if queue B will reclaim its capacity and
again AM will be launched and killed again, in that case job will be failed.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message