spark-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From jiangxb1987 <>
Subject [GitHub] spark issue #20414: [SPARK-23243][SQL] Shuffle+Repartition on an RDD could l...
Date Mon, 29 Jan 2018 07:40:46 GMT
Github user jiangxb1987 commented on the issue:
    @felixcheung You are right that I didn't make it clear there should be still many shuffle
blocks, and if you have the read task retried it should be slower than using `repartition(1)`
    Now I tend to fix the issue following the latter fix-shuffle-fetch-order way, since it
may resolve for general cases.


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message