spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From shahid qadri <shahidashr...@icloud.com>
Subject repartition vs partitionby
Date Sat, 17 Oct 2015 07:32:45 GMT
Hi folks

I need to reparation large set of data around(300G) as i see some portions have large data(data
skew)

i have pairRDDs [({},{}),({},{}),({},{})]

what is the best way to solve the the problem
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org


Mime
View raw message