Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 17412 invoked from network); 14 Oct 2005 03:27:36 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 14 Oct 2005 03:27:36 -0000 Received: (qmail 92234 invoked by uid 500); 14 Oct 2005 03:27:34 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 92217 invoked by uid 500); 14 Oct 2005 03:27:33 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 92203 invoked by uid 99); 14 Oct 2005 03:27:32 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [192.87.106.226] (HELO ajax.apache.org) (192.87.106.226) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Oct 2005 20:27:30 -0700 Received: from ajax.apache.org (ajax.apache.org [127.0.0.1]) by ajax.apache.org (Postfix) with ESMTP id F2DD5515 for ; Fri, 14 Oct 2005 05:27:08 +0200 (CEST) Message-ID: <1834584985.1129260428836.JavaMail.jira@ajax.apache.org> Date: Fri, 14 Oct 2005 05:27:08 +0200 (CEST) From: "Yonik Seeley (JIRA)" To: java-dev@lucene.apache.org Subject: [jira] Updated: (LUCENE-454) lazily create SegmentMergeInfo.docMap In-Reply-To: <2131957529.1129175166466.JavaMail.jira@ajax.apache.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N [ http://issues.apache.org/jira/browse/LUCENE-454?page=all ] Yonik Seeley updated LUCENE-454: -------------------------------- Attachment: docMap.txt Also deferred creation of SegmentMergeInfo.postings (TermPositions) for another 15% gain. Same index and query were used to test, but this time 100,000 query iterations. defer docMap only: indexing time = 121,734 ms querying time = 18,266 ms defer docMap and postings: indexing time = 120,860 ms querying time = 15,625 ms > lazily create SegmentMergeInfo.docMap > ------------------------------------- > > Key: LUCENE-454 > URL: http://issues.apache.org/jira/browse/LUCENE-454 > Project: Lucene - Java > Type: Improvement > Versions: CVS Nightly - Specify date in submission > Reporter: Yonik Seeley > Attachments: docMap.txt, docMap.txt > > Since creating the docMap is expensive, and it's only used during segment merging, not searching, defer creation until it is requested. > SegmentMergeInfo is also used in MultiTermEnum, the term enumerator for a MultiReader. TermEnum is used by queries such as PrefixQuery, RangeQuery, WildcardQuery, as well as RangeFilter, DateFilter, and sorting the first time (filling the FieldCache). > Performance Results: > A simple single field index with 555,555 documents, and 1000 random deletions was queried 1000 times with a PrefixQuery matching a single document. > Performance Before Patch: > indexing time = 121,656 ms > querying time = 58,812 ms > Performance After Patch: > indexing time = 121,000 ms > querying time = 598 ms > A 100 fold increase in query performance! > All lucene unit tests pass. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org