cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Peter Schuller <>
Subject Re: CommitLog replay
Date Tue, 21 Jun 2011 13:12:44 GMT
> I’ve got a single node deployment of 0.8 set up on my windows box. When I
> insert a bunch of data into it, the commitlogs directory doesn’t clear upon
> completion (should it?).

It is expected that commit logs are retained for a while, and that
there is reply going on when restarting a node. The main way to ensure
that a smaller amount of commit log is active at any given moment, is
to ensure that all column familes are flushed sufficiently often. This
is because when column families are flushed, they are no longer
necessitating the retention of the commit logs that contain the writes
that were just flushed.

Pay attention to whether you maybe have some cf:s that are written
very rarely and won't flush until the flush expiry period.

> As a result, when I stop and restart Cassandra it
> replays all the commitlogs, then starts compacting (which seems like it’s
> taking a long time). While it’s compacting it won’t talk to my test client.

That it starts compacting is expected if the data flushed as a result
of the commit log reply triggers compactions. However, compaction does
not imply that the node refuses to talk to clients.

Are you sure the node has fully started? it should log when it starts
up the thrift interface - check system.log.

/ Peter Schuller

View raw message