spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tom Hubregtsen <>
Subject Re: Spilling when not expected
Date Wed, 29 Apr 2015 17:59:04 GMT
Hi reynold,

It took me some time, but I've finally found that there is a difference
between spilling on the map-side and spilling on the reduce-side for a
shuffle. Spilling to disk on the map-side happens by default (with the
spillToPartitionFiles call from insertAll in ExternalSorter; don't know yet
why there is a difference in number of calls though), spilling on the reduce
side (with the maybeSpillCollection call from insertAll in ExternalSorter)
is optional and based on the available memory set by
spark.shuffle.memoryFraction and the total memory available. In my case, I
was just seeing the spilling on the map-side, but did not realize that this
is supposed to happen, regardless of the memory settings.

Thanks for your help,


View this message in context:
Sent from the Apache Spark Developers List mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message