hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Carlo Curino (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-7402) Federation: Global Queues
Date Fri, 27 Oct 2017 00:21:00 GMT

    [ https://issues.apache.org/jira/browse/YARN-7402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16221498#comment-16221498

Carlo Curino commented on YARN-7402:

This started with conversations with Bill Ramsey, [~roniburd], [~subru], [~asuresh], [~kkaranasos]
and [~chris.douglas].
The goal is to extend YARN ability to enforce global invariant across a federated cluster,
while retaining the scalability of
federation. For this purpose the sharing of information among sub-cluster is on heartbeats
and limited to very summarized
view of the world (queue-level aggregates only).   

> Federation: Global Queues
> -------------------------
>                 Key: YARN-7402
>                 URL: https://issues.apache.org/jira/browse/YARN-7402
>             Project: Hadoop YARN
>          Issue Type: New Feature
>          Components: federation
>            Reporter: Carlo Curino
>            Assignee: Carlo Curino
> YARN Federation today requires manual configuration of queues within each sub-cluster,
and each RM operates "in isolation". This has few issues:
> # Preemption is computed locally (and might far exceed the global need)
> # Jobs within a queue are forced to consume their resources "evenly" based on queue mapping
> This umbrella JIRA tracks a new feature that leverages the FederationStateStore as a
synchronization mechanism among RMs, and allows for allocation and preemption decisions to
be based on a (close to up-to-date) global view of the cluster allocation and demand.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org

View raw message