Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 72377 invoked from network); 11 Feb 2007 18:10:28 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 11 Feb 2007 18:10:28 -0000 Received: (qmail 84156 invoked by uid 500); 11 Feb 2007 18:10:35 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 83492 invoked by uid 500); 11 Feb 2007 18:10:34 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 83481 invoked by uid 99); 11 Feb 2007 18:10:34 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 11 Feb 2007 10:10:34 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 11 Feb 2007 10:10:25 -0800 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id BB7477142FB for ; Sun, 11 Feb 2007 10:10:05 -0800 (PST) Message-ID: <27435975.1171217405765.JavaMail.jira@brutus> Date: Sun, 11 Feb 2007 10:10:05 -0800 (PST) From: "Grant Ingersoll (JIRA)" To: java-dev@lucene.apache.org Subject: [jira] Assigned: (LUCENE-790) contrib/benchmark - few improvements and a bug fix In-Reply-To: <9943138.1170315845514.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/LUCENE-790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned LUCENE-790: -------------------------------------- Assignee: Grant Ingersoll (was: Doron Cohen) > contrib/benchmark - few improvements and a bug fix > -------------------------------------------------- > > Key: LUCENE-790 > URL: https://issues.apache.org/jira/browse/LUCENE-790 > Project: Lucene - Java > Issue Type: Improvement > Components: Other > Affects Versions: 2.1 > Reporter: Doron Cohen > Assigned To: Grant Ingersoll > Priority: Minor > Fix For: 2.1 > > Attachments: TrecDocMaker.patch > > > Benchmark byTask was slightly improved: > 1. fixed a bug in the "child-should-not-report" mechanism. If a task sequence contained only simple tasks it worked as expected (i.e. child tasks did not report times/memory) but if a child was a task sequence, then its children would report - they should not - this was fixed, so this property is now "penetrating/inherited" all the way down. > 2. doc size control now possible also for the Reuters doc maker. (allowing to index N docs of size C characters each.) > 3. TrecDocMaker was added - it reads as input the .gz files used in Trec - e.g. .gov data - this can be handy to benchmark Lucene on these large collections. Similar to the Reuters collection, the doc-maker scans the input directory for all the files and extracts documents from the files. Here there are multiple documents in each input file. Unlike the Reuters collection, we cannot provide a 'loader' for these collections - they are available from http://trec.nist.gov - for research purposes. > 4. a new BasicDocMaker abstract class handles most of doc-maker tasks, including creating docs with specific size, so adding new doc-makers for other data is now much simpler. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org