Return-Path: Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: (qmail 41530 invoked from network); 19 Nov 2010 22:50:08 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 19 Nov 2010 22:50:08 -0000 Received: (qmail 55476 invoked by uid 500); 19 Nov 2010 22:50:38 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 55417 invoked by uid 500); 19 Nov 2010 22:50:38 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 55410 invoked by uid 99); 19 Nov 2010 22:50:38 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Nov 2010 22:50:38 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Nov 2010 22:50:36 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id oAJMoEXY027174 for ; Fri, 19 Nov 2010 22:50:15 GMT Message-ID: <4559921.209881290207014755.JavaMail.jira@thor> Date: Fri, 19 Nov 2010 17:50:14 -0500 (EST) From: "Toke Eskildsen (JIRA)" To: dev@lucene.apache.org Subject: [jira] Updated: (LUCENE-2369) Locale-based sort by field with low memory overhead MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/LUCENE-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Toke Eskildsen updated LUCENE-2369: ----------------------------------- Attachment: LUCENE-2369.patch Bugfixes and maintenance. This patches against Lucene trunk revision 1036986 (latest one at the time of writing). Apply the patch in the lucene sub-folder. > Locale-based sort by field with low memory overhead > --------------------------------------------------- > > Key: LUCENE-2369 > URL: https://issues.apache.org/jira/browse/LUCENE-2369 > Project: Lucene - Java > Issue Type: New Feature > Components: Search > Reporter: Toke Eskildsen > Priority: Minor > Attachments: lucene-2369-20101011.patch, LUCENE-2369.patch, LUCENE-2369.patch, LUCENE-2369.patch > > > The current implementation of locale-based sort in Lucene uses the FieldCache which keeps all sort terms in memory. Beside the huge memory overhead, searching requires comparison of terms with collator.compare every time, making searches with millions of hits fairly expensive. > This proposed alternative implementation is to create a packed list of pre-sorted ordinals for the sort terms and a map from document-IDs to entries in the sorted ordinals list. This results in very low memory overhead and faster sorted searches, at the cost of increased startup-time. As the ordinals can be resolved to terms after the sorting has been performed, this approach supports fillFields=true. > This issue is related to https://issues.apache.org/jira/browse/LUCENE-2335 which contain previous discussions on the subject. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org