cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nate McCall (JIRA)" <j...@apache.org>
Subject [jira] Commented: (CASSANDRA-1520) stress.py's multiget option sends increasingly inefficient queries as more test data is inserted
Date Mon, 20 Sep 2010 18:31:33 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12912627#action_12912627
] 

Nate McCall commented on CASSANDRA-1520:
----------------------------------------

1520v2 looks good as well. new -g usage works well to find the slice sweet spot. 
For the record: had to apply patch to file previous to commit though, thus some merge handjiving
will be needed.

> stress.py's multiget option sends increasingly inefficient queries as more test data
is inserted
> ------------------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-1520
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1520
>             Project: Cassandra
>          Issue Type: Bug
>            Reporter: Nate McCall
>            Assignee: Brandon Williams
>            Priority: Minor
>             Fix For: 0.7 beta 2
>
>         Attachments: 1520-v2.txt, 1520.patch
>
>
> MultiGetter's key list sizes should be broken up better for more efficient queries. Setting
an initial value that breaks up the key list into N sub lists (where N is the number of threads)
yielded more efficient queries. (The choice of thread count here was a stop-gap for demonstration
purposes. End result should probably be chunk-size config option with a sane default).
> Pre patch:
> ---
> python stress.py -o multiget -t 25 -n 250000 -c 5
> total,interval_op_rate,interval_key_rate,avg_latency,elapsed_time
> 6,0,6000,8.6109764576,10
> 10,0,4000,18.6666852832,20
> 17,0,7000,27.4705835751,30
> 23,0,6000,36.6091703971,41
> 25,0,2000,41.8415510654,42
> Post patch:
> ---
> python mstress.py -o multiget -t 25 -n 250000 -c 5
> total,interval_op_rate,interval_key_rate,avg_latency,elapsed_time
> 172,17,6880,1.44215503127,10
> 314,14,5680,1.8667214538,20
> 466,15,6080,1.69888155084,31
> 624,15,6320,1.55442555947,41
> 625,0,40,0.0914790630341,41

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message