incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From John Pyeatt <>
Subject nodetool repair seems to increase linearly with number of keyspaces
Date Wed, 20 Nov 2013 12:53:48 GMT
We have an application that has been designed to use potentially 100s of
keyspaces (one for each company).

One thing we are noticing is that nodetool repair across all of the
keyspaces seems to increase linearly based on the number of keyspaces. For
example, if we have a 6 node ec2 (m1.large) cluster across 3 Availability
Zones and create 20 keyspaces a nodetool repair -pr on one node takes 3
hours even with no data in any of the keyspaces. If I bump that up to 40
keyspaces it takes 6 hours.

Is this the behaviour you would expect?

Is there anything you can think of (short of redesigning the cluster to
limit keyspaces) to increase the performance of the nodetool repairs?

My obvious concern is that as this application grows and we get more
companies using our it we will eventually have too many keyspaces to
perform repairs on the cluster.

John Pyeatt
Singlewire Software, LLC

View raw message