db-derby-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bryan Pendleton (JIRA)" <j...@apache.org>
Subject [jira] Updated: (DERBY-4363) Add simple benchmark for measuring GROUP BY performance
Date Sun, 30 Aug 2009 16:21:37 GMT

     [ https://issues.apache.org/jira/browse/DERBY-4363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Bryan Pendleton updated DERBY-4363:

    Attachment: simpleBenchmark.diff

Attached 'simpleBenchmark.diff' is a patch proposal for a new benchmark
in the o.a.dT.perf.clients package.

This benchmark runs 3 GROUP BY statements against the scalable
Wisconsin table TENKTUP1. The statements partition the entire table
into (a) 10 groups, (b) 100 groups, and (c) 1000 groups.

Sample ways to run this benchmark:
1)  java org.apache.derbyTesting.perf.clients.Runner -load group_by -init

     initializes the database and runs the benchmark against the 10,000 row
     default Wisconsin database.

2) java org.apache.derbyTesting.perf.clients.Runner -load group_by -init -load_opts numRows=100000

      Reloads the DB and runs the benchmark against a 100,000 row table.

3) java org.apache.derbyTesting.perf.clients.Runner -load group_by -init -load_opts numRows=1000000
-rt 300

    Reloads the DB and runs the benchmark against a 1,000,000 row table. Since
    the runtimes are much slower against a giant table, lets the benchmark run 5 minutes.

Comments and suggestions would be wonderful.

> Add simple benchmark for measuring GROUP BY performance
> -------------------------------------------------------
>                 Key: DERBY-4363
>                 URL: https://issues.apache.org/jira/browse/DERBY-4363
>             Project: Derby
>          Issue Type: Sub-task
>          Components: Test, Tools
>            Reporter: Bryan Pendleton
>            Assignee: Bryan Pendleton
>            Priority: Minor
>         Attachments: simpleBenchmark.diff
> As part of ROLLUP implementation (DERBY-3002), it will be helpful to be able to measure
the performance of GROUP BY.
> Using the o.a.dT.perf.clients framework, this sub-task proposes to add a GroupByClient
to the performance runner
> library; the GroupByClient will run GROUP BY statements against the Wisconsin benchmark

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message