spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Josh Rosen (JIRA)" <>
Subject [jira] [Updated] (SPARK-9144) Remove DAGScheduler.runLocallyWithinThread and spark.localExecution.enabled
Date Fri, 17 Jul 2015 22:59:04 GMT


Josh Rosen updated SPARK-9144:
    Issue Type: Improvement  (was: New Feature)

> Remove DAGScheduler.runLocallyWithinThread and spark.localExecution.enabled
> ---------------------------------------------------------------------------
>                 Key: SPARK-9144
>                 URL:
>             Project: Spark
>          Issue Type: Improvement
>          Components: Scheduler, Spark Core
>            Reporter: Josh Rosen
>            Assignee: Josh Rosen
> Spark has an option called {{spark.localExecution.enabled}}; according to the docs:
> {quote}
> Enables Spark to run certain jobs, such as first() or take() on the driver, without sending
tasks to the cluster. This can make certain jobs execute very quickly, but may require shipping
a whole partition of data to the driver.
> {quote}
> This feature ends up adding quite a bit of complexity to DAGScheduler, especially in
the {{runLocallyWithinThread}} method, but as far as I know nobody uses this feature (I searched
the mailing list and haven't seen any recent mentions of the configuration nor stacktraces
including the runLocally method).  As a step towards scheduler complexity reduction, I propose
that we remove this feature and all code related to it for Spark 1.5. 

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message