hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Krishna Kishore Bonagiri (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (YARN-541) getAllocatedContainers() is not returning all the allocated containers
Date Tue, 16 Apr 2013 08:07:17 GMT

     [ https://issues.apache.org/jira/browse/YARN-541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Krishna Kishore Bonagiri updated YARN-541:

    Attachment: yarn-dsadm-resourcemanager-isredeng.out

Hi Hitesh,

  I am attaching the logs for AM, RM, and NM. I have an application being
run in a loop, which requires 5 containers. The 8th run has failed with
this issue of getAllocatedContainers(). The Application Master couldn't get
all the 5 containers it required, the getAllocatedContainers() method
returned only 4. The RM's log is saying that the 5th container is also
allocated thro' the message,

2013-04-16 03:32:54,701 INFO  [ResourceManager Event Processor]
rmcontainer.RMContainerImpl (RMContainerImpl.java:handle(220)) -
container_1366096597608_0008_01_000006 Container Transitioned from NEW to

In RM's log, you can see that this kind of for the remaining 4 containers
also, i.e. container_1366096597608_0008_01_000002 to

Also, as I said before this issue is seen randomly.


> getAllocatedContainers() is not returning all the allocated containers
> ----------------------------------------------------------------------
>                 Key: YARN-541
>                 URL: https://issues.apache.org/jira/browse/YARN-541
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.0.3-alpha
>         Environment: Redhat Linux 64-bit
>            Reporter: Krishna Kishore Bonagiri
>         Attachments: AppMaster.stdout, yarn-dsadm-nodemanager-isredeng.out, yarn-dsadm-resourcemanager-isredeng.out
> I am running an application that was written and working well with the hadoop-2.0.0-alpha
but when I am running the same against 2.0.3-alpha, the getAllocatedContainers() method called
on AMResponse is not returning all the containers allocated sometimes. For example, I request
for 10 containers and this method gives me only 9 containers sometimes, and when I looked
at the log of Resource Manager, the 10th container is also allocated. It happens only sometimes
randomly and works fine all other times. If I send one more request for the remaining container
to RM after it failed to give them the first time(and before releasing already acquired ones),
it could allocate that container. I am running only one application at a time, but 1000s of
them one after another.
> My main worry is, even though the RM's log is saying that all 10 requested containers
are allocated,  the getAllocatedContainers() method is not returning me all of them, it returned
only 9 surprisingly. I never saw this kind of issue in the previous version, i.e. hadoop-2.0.0-alpha.
> Thanks,
> Kishore

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message