hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Payne (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-5889) Improve user-limit calculation in capacity scheduler
Date Fri, 03 Feb 2017 22:41:51 GMT

    [ https://issues.apache.org/jira/browse/YARN-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15852243#comment-15852243
] 

Eric Payne commented on YARN-5889:
----------------------------------

Thanks [~sunilg] and [~leftnoteasy] for your work on this feature. I do have one concern.
I think there is a race condition where if a container fails, the freed resources are not
recorded for that user about half the time.

Use Case:
- Queue is 50% of cluster
- MULP = 50%
- One app fills the cluster
- Some containers fail
-- I simulate this by using {{yarn container -signal container_1486159534159_0004_01_000029
 FORCEFUL_SHUTDOWN}}
- The app is only given new containers about half the time.
-- That is to say, the app is asking for resources, and the cluster has free space, but the
app is not being given those resources.

I'm sorry I can't go into more detail at this time. I just discovered this issue, and I have
not had time to investigate further. However, since you are about to complete the work on
this JIRA, I felt I should provide the information I have so far.

> Improve user-limit calculation in capacity scheduler
> ----------------------------------------------------
>
>                 Key: YARN-5889
>                 URL: https://issues.apache.org/jira/browse/YARN-5889
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: capacity scheduler
>            Reporter: Sunil G
>            Assignee: Sunil G
>         Attachments: YARN-5889.0001.patch, YARN-5889.0001.suggested.patchnotes, YARN-5889.0002.patch,
YARN-5889.0003.patch, YARN-5889.0004.patch, YARN-5889.0005.patch, YARN-5889.0006.patch, YARN-5889.0007.patch,
YARN-5889.0008.patch, YARN-5889.0009.patch, YARN-5889.0010.patch, YARN-5889.v0.patch, YARN-5889.v1.patch,
YARN-5889.v2.patch
>
>
> Currently user-limit is computed during every heartbeat allocation cycle with a write
lock. To improve performance, this tickets is focussing on moving user-limit calculation out
of heartbeat allocation flow.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message