cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ruben Cardenal <>
Subject Little question
Date Tue, 04 Oct 2016 13:57:56 GMT


We've inherited quite a big amazon infrastructure from a company we've
purchased. It's has an ancient and obsolete implementation of services,
being the worst (and more expensive) of all of them a 5 cluster of
Cassandra (RF=3). I'm new to Cassandra, and yes, I'm doing my way
throughout docs. 

I was told that Amazon asked them a few months ago to reboot one of
their servers (it had been turned on for so long that Amazon had to make
some changes and needed it rebooted), so they had to add a new node to
the cluster. If you query nodetool as of now, it shows: 

$ nodetool ring
Note: Ownership information does not include topology, please specify a
Address DC Rack Status State Load Owns Token
141784319550391026443072753096570088105 datacenter1 rack1 Up Normal 263.06 GB 16.67% 0 datacenter1 rack1 Up Normal 253.31 GB 16.67%
28356863910078205288614550619314017621 datacenter1 rack1 Up Normal 262.12 GB 33.33%
85070591730234615865843651857942052863 datacenter1 rack1 Up Normal 264.28 GB 16.67%
113427455640312821154458202477256070484 datacenter1 rack1 Up Normal 65.15 GB 16.67%

What puzzels me is the last line. It belongs to the last added node, the
new one I talked about. While it's holding the same amount of data
(16.67%) that other 3 nodes, the Load is about 4 times lower. What does
this mean? Is that difference data that is not cleaned up, such as
TTL-expired cell or tombstoned data? 

Thanks and excuse me if I'm asking something stupid. 


View raw message