cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From DuyHai Doan <>
Subject Re: Are aggregate functions done in parallel?
Date Thu, 28 Jan 2016 17:36:04 GMT
You can read this: and this:

Long story short, UDF and UDA computation is Cassandra is not distributed.
All the values are retrieved first on the coordinator node (to apply the
last write win reconciliation logic) before applying any UDF/UDA

The sweet spot for Cassandra UDA is single partition operations. If you
need to aggregate on multiple partitions, consider using Apache Spark

On Thu, Jan 28, 2016 at 6:06 PM, Francisco Reyes <> wrote:

> Does Cassandra paralelizes aggregate functions?
> Have a new project with potentially 200 to 300 million rows per month that
> I need to do aggregates on. Wondering if Cassandra would be a good match.

View raw message