cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matthew F. Dennis (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CASSANDRA-4894) log number of combined/merged rows during a compaction
Date Thu, 01 Nov 2012 19:15:13 GMT
Matthew F. Dennis created CASSANDRA-4894:
--------------------------------------------

             Summary: log number of combined/merged rows during a compaction
                 Key: CASSANDRA-4894
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4894
             Project: Cassandra
          Issue Type: Improvement
            Reporter: Matthew F. Dennis
            Priority: Critical


we already log some details about compactions but it would be useful to know how many rows
were merged (resulting in "useful" work) and how many were unique (representing "wasted work").

the simple approach requires two additional counters (one for unique rows, one for merged
rows).  As the merge join is progressing if two rows are combined, tick the joined counter.
 If a row is simply copied tick the unique counter.

a more complete solution would be to keep a separate count for each number of merges.  This
would require number_of_files_being_merged counters.  If no rows were merged, tick counters[0],
if two rows were merged tick counters[1] and so on 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message