cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nate McCall (JIRA)" <j...@apache.org>
Subject [jira] Created: (CASSANDRA-1520) stress.py's multiget option sends increasingly inefficient queries as more test data is inserted
Date Sun, 19 Sep 2010 05:59:33 GMT
stress.py's multiget option sends increasingly inefficient queries as more test data is inserted
------------------------------------------------------------------------------------------------

                 Key: CASSANDRA-1520
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1520
             Project: Cassandra
          Issue Type: Bug
    Affects Versions: 0.7 beta 1
            Reporter: Nate McCall
             Fix For: 0.7 beta 2


MultiGetter's key list sizes should be broken up better for more efficient queries. Setting
an initial value that breaks up the key list into N sub lists (where N is the number of threads)
yielded more efficient queries. (The choice of thread count here was a stop-gap for demonstration
purposes. End result should probably be chunk-size config option with a sane default).

Pre patch:
---
python stress.py -o multiget -t 25 -n 250000 -c 5
total,interval_op_rate,interval_key_rate,avg_latency,elapsed_time
6,0,6000,8.6109764576,10
10,0,4000,18.6666852832,20
17,0,7000,27.4705835751,30
23,0,6000,36.6091703971,41
25,0,2000,41.8415510654,42

Post patch:
---
python mstress.py -o multiget -t 25 -n 250000 -c 5
total,interval_op_rate,interval_key_rate,avg_latency,elapsed_time
172,17,6880,1.44215503127,10
314,14,5680,1.8667214538,20
466,15,6080,1.69888155084,31
624,15,6320,1.55442555947,41
625,0,40,0.0914790630341,41

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message