spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Noorul Islam Kamal Malmiyoda <>
Subject Cassandra read throughput using DataStax connector in Spark
Date Sat, 26 Dec 2015 15:37:57 GMT
Hello all,

I am using DataStax connector to read data from Cassandra and write to
another Cassandra cluster.  Infra is Amazon. I have three nodes
cluster with replication factor of 3 on both clusters.

But the throughput seems to be very low. It takes 7 minutes to
transfer around 2.5 GB/node. I think the bottleneck is at the read
side as I could see that spark node (Independent of two clusters) is
less loaded with respect to memory and CPU.

I tried tweaking some from

Do you have any idea whether there is any parameter that I can tweak
to get better throughput?


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message