Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 22832 invoked from network); 11 Jun 2009 18:31:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 11 Jun 2009 18:31:25 -0000 Received: (qmail 45459 invoked by uid 500); 11 Jun 2009 18:31:36 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 45388 invoked by uid 500); 11 Jun 2009 18:31:36 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 45375 invoked by uid 99); 11 Jun 2009 18:31:31 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Jun 2009 18:31:31 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Jun 2009 18:31:28 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 8C37F234C004 for ; Thu, 11 Jun 2009 11:31:07 -0700 (PDT) Message-ID: <112866383.1244745067560.JavaMail.jira@brutus> Date: Thu, 11 Jun 2009 11:31:07 -0700 (PDT) From: "Michael McCandless (JIRA)" To: java-dev@lucene.apache.org Subject: [jira] Commented: (LUCENE-1539) Improve Benchmark In-Reply-To: <1346552103.1234291740340.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/LUCENE-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718578#action_12718578 ] Michael McCandless commented on LUCENE-1539: -------------------------------------------- Right, I think deleteDocsByPercent should 1) determine how many docs to delete (deletePct * reader.numDocs()), and then 2) random select ones to delete, counting how many actually were deleted, and stopping when it reaches the target. To avoid this taking excessively long when too many deletions are requested, you should probably invert if the %tg is > 50? Ie, choose instead the docs NOT to delete, and then make a linear sweep to delete any docs not chosen? > Improve Benchmark > ----------------- > > Key: LUCENE-1539 > URL: https://issues.apache.org/jira/browse/LUCENE-1539 > Project: Lucene - Java > Issue Type: Improvement > Components: contrib/benchmark > Affects Versions: 2.4 > Reporter: Jason Rutherglen > Assignee: Michael McCandless > Priority: Minor > Fix For: 2.9 > > Attachments: LUCENE-1539.patch, LUCENE-1539.patch, LUCENE-1539.patch, LUCENE-1539.patch, sortBench2.py, sortCollate2.py > > Original Estimate: 336h > Remaining Estimate: 336h > > Benchmark can be improved by incorporating recent suggestions posted > on java-dev. M. McCandless' Python scripts that execute multiple > rounds of tests can either be incorporated into the codebase or > converted to Java. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org