hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Wangda Tan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-4045) Negative avaialbleMB is being reported for root queue.
Date Fri, 21 Aug 2015 18:00:46 GMT

    [ https://issues.apache.org/jira/browse/YARN-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14707158#comment-14707158
] 

Wangda Tan commented on YARN-4045:
----------------------------------

Hi [~lichangleo],
I think the comprehensive fix is, when we allocating from reserved container, we should check
limits of queue hierarchy as well. Currently the logic is reserving and not check limits,
which seems not correct to me.

If we found queue limits will be violated if we allocate the reserved container, we should
drop the reservation. Thoughts?

Thanks,
Wangda

> Negative avaialbleMB is being reported for root queue.
> ------------------------------------------------------
>
>                 Key: YARN-4045
>                 URL: https://issues.apache.org/jira/browse/YARN-4045
>             Project: Hadoop YARN
>          Issue Type: Bug
>    Affects Versions: 2.7.1
>            Reporter: Rushabh S Shah
>
> We recently deployed 2.7 in one of our cluster.
> We are seeing negative availableMB being reported for queue=root.
> This is from the jmx output:
> {noformat}
> <clusterMetrics>
>     ...
>     <availableMB>-163328</availableMB>
>     ...
> </clusterMetrics>
> {noformat}
> The following is the RM log:
> {noformat}
> 2015-08-10 14:42:28,280 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:28,404 [ResourceManager Event Processor] INFO capacity.ParentQueue:
assignedContainer queue=root usedCapacity=1.0032743 absoluteUsedCapacity=1.0032743 used=<memory:5334016,
vCores:6212> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:30,913 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:30,913 [ResourceManager Event Processor] INFO capacity.ParentQueue:
assignedContainer queue=root usedCapacity=1.0032743 absoluteUsedCapacity=1.0032743 used=<memory:5334016,
vCores:6212> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:33,093 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:33,093 [ResourceManager Event Processor] INFO capacity.ParentQueue:
assignedContainer queue=root usedCapacity=1.0032743 absoluteUsedCapacity=1.0032743 used=<memory:5334016,
vCores:6212> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:35,548 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:35,549 [ResourceManager Event Processor] INFO capacity.ParentQueue:
assignedContainer queue=root usedCapacity=1.0032743 absoluteUsedCapacity=1.0032743 used=<memory:5334016,
vCores:6212> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:39,088 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:39,089 [ResourceManager Event Processor] INFO capacity.ParentQueue:
assignedContainer queue=root usedCapacity=1.0032743 absoluteUsedCapacity=1.0032743 used=<memory:5334016,
vCores:6212> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:39,338 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:39,339 [ResourceManager Event Processor] INFO capacity.ParentQueue:
assignedContainer queue=root usedCapacity=1.0032743 absoluteUsedCapacity=1.0032743 used=<memory:5334016,
vCores:6212> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:39,757 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:39,758 [ResourceManager Event Processor] INFO capacity.ParentQueue:
assignedContainer queue=root usedCapacity=1.0032743 absoluteUsedCapacity=1.0032743 used=<memory:5334016,
vCores:6212> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:43,056 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:43,070 [ResourceManager Event Processor] INFO capacity.ParentQueue:
assignedContainer queue=root usedCapacity=1.0032743 absoluteUsedCapacity=1.0032743 used=<memory:5334016,
vCores:6212> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:44,486 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:44,487 [ResourceManager Event Processor] INFO capacity.ParentQueue:
assignedContainer queue=root usedCapacity=1.0032743 absoluteUsedCapacity=1.0032743 used=<memory:5334016,
vCores:6212> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:44,886 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:44,886 [ResourceManager Event Processor] INFO capacity.ParentQueue:
assignedContainer queue=root usedCapacity=1.0032743 absoluteUsedCapacity=1.0032743 used=<memory:5334016,
vCores:6212> cluster=<memory:5316608, vCores:28320>
> 2015-08-10 14:42:47,401 [ResourceManager Event Processor] INFO capacity.ParentQueue:
completedContainer queue=root usedCapacity=1.0029854 absoluteUsedCapacity=1.0029854 used=<memory:5332480,
vCores:6202> cluster=<memory:5316608, vCores:28320>
> {noformat}
> bq.  used=<memory:5332480, vCores:6202> cluster=<memory:5316608, vCores:28320>
> For root queue, usedCapacity is more than totalCapacity



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message