incubator-cassandra-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pushkar Prasad" <pushkar.pra...@airtightnetworks.net>
Subject Slow search on secondary index
Date Thu, 14 Mar 2013 07:52:25 GMT
Hi,

 

I have the following schema in Cassandra 1.2.1:

 

+ TimeStamp 

+ MACAddress 

+ Data Transfer 

+ LocationID 

+ MacAddressCopy                // Copy of MAC Address

 

** Primary KEY(TimeStamp, MacAddress)              // Composite key,
partitioned on TimeStamp 

 

There are close to 500K different MAC Address, and 10K timestamps. So a
total of 5 billion records are there. Each record is 50 bytes, so total size
of the data is 250 GB. I have a 4 node cluster with no replication where all
this data is stored.

 

When I created a secondary index on MacAddressCopy, and search for a
particular value of MAC, then I expect to get back 10K records (with
different timestamps) for that MAC Address. Since it is indexed, I expect it
to give a quick response, however, I am experiencing RPC Timeouts, and the
query does not respond. 

 

Is there any reason why this should be so slow? Is there too much of disk
seek which is causing such timeouts? Is getting 10K records asking for too
much? 

 

- Pushkar


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message