cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jonathan Ellis (Commented) (JIRA)" <>
Subject [jira] [Commented] (CASSANDRA-4023) Batch reading BloomFilters on startup
Date Thu, 08 Mar 2012 22:35:58 GMT


Jonathan Ellis commented on CASSANDRA-4023:

We added the multithreadedness specifically because it *improves* startup time for people
with multiple spindles or SSDs...

Any ideas how to get the best of both worlds besides falling back to a config options?

(Maybe it's time to add random vs sequential speed ratio as a setting, which at least is general
enough to be useful in other places.)
> Batch reading BloomFilters on startup
> -------------------------------------
>                 Key: CASSANDRA-4023
>                 URL:
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Joaquin Casares
>              Labels: datastax_qa
> The difference of startup times between a 0.8.7 cluster and 1.0.7 cluster with the same
amount of data is 4x greater in 1.0.7.
> It seems as though 1.0.7 loads the BloomFilter through a series of reading longs out
in a multithreaded process while 0.8.7 reads the entire object.
> Perhaps we should update the new BloomFilter to do reading in batch as well?

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message