spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Wenchen Fan (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (SPARK-12552) Recovered driver's resource is not counted in the Master
Date Wed, 14 Jun 2017 00:15:00 GMT

     [ https://issues.apache.org/jira/browse/SPARK-12552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Wenchen Fan reassigned SPARK-12552:
-----------------------------------

    Assignee: Saisai Shao  (was: Apache Spark)

> Recovered driver's resource is not counted in the Master
> --------------------------------------------------------
>
>                 Key: SPARK-12552
>                 URL: https://issues.apache.org/jira/browse/SPARK-12552
>             Project: Spark
>          Issue Type: Bug
>          Components: Deploy, Spark Core
>    Affects Versions: 1.6.0
>            Reporter: Saisai Shao
>            Assignee: Saisai Shao
>             Fix For: 2.2.1, 2.3.0
>
>
> Currently in the implementation of Standalone Master HA, if application is submitted
as cluster mode, the resource (CPU cores and memory) of driver is not counted again when recovered
from failure, which will lead to unexpected behaviors, like more than expected executors,
negative core and memory usage in the web UI. Also the recovered application's state is always
{{WAITING}}, we have to change the state to {{RUNNING}} when fully recovered.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message