spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Susan X. Huynh (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SPARK-21419) Support Mesos failover_timeout in driver (Mesos cluster mode)
Date Fri, 14 Jul 2017 15:16:00 GMT
Susan X. Huynh created SPARK-21419:
--------------------------------------

             Summary: Support Mesos failover_timeout in driver (Mesos cluster mode)
                 Key: SPARK-21419
                 URL: https://issues.apache.org/jira/browse/SPARK-21419
             Project: Spark
          Issue Type: Improvement
          Components: Mesos
    Affects Versions: 2.2.0
            Reporter: Susan X. Huynh


In Mesos cluster mode, the driver framework's failover_timeout is currently set to zero. This
means that if the driver temporarily loses connectivity with the master, the driver is considered
disconnected, and the master will immediately kill all tasks and executors associated with
the framework.

To avoid this behavior, I would like to make this failover_timeout configurable. A user could
then set it to a non-zero value, so that during a temporary disconnection the master would
wait before tearing down the framework.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message