lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Toke Eskildsen (JIRA)" <>
Subject [jira] [Updated] (LUCENE-2369) Locale-based sort by field with low memory overhead
Date Thu, 14 Jul 2011 10:51:00 GMT


Toke Eskildsen updated LUCENE-2369:

    Attachment: LUCENE-2369.patch

Maintenance patch bringing the code up to date with Lucene trunk@1145556 (2011-07-13). Patch
and test by e.g.

svn co lucene-2369
cd lucene-2369
patch -p0 < LUCENE-2369.patch
cd modules/queryparser/
cd ../../lucene/contrib/exposed/
ant compile-test
cd ../..
java -cp lib/junit-4.7.jar:build/classes/test/:build/classes/java:build/contrib/exposed/classes/java/:build/contrib/exposed/classes/test/

> Locale-based sort by field with low memory overhead
> ---------------------------------------------------
>                 Key: LUCENE-2369
>                 URL:
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: core/search
>            Reporter: Toke Eskildsen
>            Priority: Minor
>              Labels: memory, search, sort, time
>         Attachments: LUCENE-2369.patch, LUCENE-2369.patch, LUCENE-2369.patch, LUCENE-2369.patch,
LUCENE-2369.patch, lucene-2369-20101011.patch
> The current implementation of locale-based sort in Lucene uses the FieldCache which keeps
all sort terms in memory. Beside the huge memory overhead, searching requires comparison of
terms with every time, making searches with millions of hits fairly expensive.
> This proposed alternative implementation is to create a packed list of pre-sorted ordinals
for the sort terms and a map from document-IDs to entries in the sorted ordinals list. This
results in very low memory overhead and faster sorted searches, at the cost of increased startup-time.
As the ordinals can be resolved to terms after the sorting has been performed, this approach
supports fillFields=true.
> This issue is related to which contain
previous discussions on the subject.

This message is automatically generated by JIRA.
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message