cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Norberg (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CASSANDRA-4710) High key hashing overhead for index scans when using RandomPartitioner
Date Mon, 24 Sep 2012 14:32:07 GMT
Daniel Norberg created CASSANDRA-4710:
-----------------------------------------

             Summary: High key hashing overhead for index scans when using RandomPartitioner
                 Key: CASSANDRA-4710
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4710
             Project: Cassandra
          Issue Type: Bug
            Reporter: Daniel Norberg


For a workload where the dataset is completely in ram, the md5 hashing of the keys during
index scans becomes a bottleneck for reads when using RandomPartitioner, according to profiling.

Instead performing a raw key equals check in SSTableReader.getPosition() for EQ operations
improves throughput by some 30% for my workload (moving the bottleneck elsewhere).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message