spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michelangelo D'Agostino (JIRA)" <>
Subject [jira] [Commented] (SPARK-1006) MLlib ALS gets stack overflow with too many iterations
Date Wed, 05 Nov 2014 23:08:34 GMT


Michelangelo D'Agostino commented on SPARK-1006:

Any plans to work on this or any pointers how one would go about making the needed modification?
 I'm working with a dataset that doesn't appear to be converging before it runs into this

> MLlib ALS gets stack overflow with too many iterations
> ------------------------------------------------------
>                 Key: SPARK-1006
>                 URL:
>             Project: Spark
>          Issue Type: Bug
>          Components: MLlib
>            Reporter: Matei Zaharia
> The tipping point seems to be around 50. We should fix this by checkpointing the RDDs
every 10-20 iterations to break the lineage chain, but checkpointing currently requires HDFS
installed, which not all users will have.
> We might also be able to fix DAGScheduler to not be recursive.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message