lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Muir <rcm...@gmail.com>
Subject Re: Sorting with little memory: A suggestion
Date Fri, 19 Mar 2010 15:18:05 GMT
On Fri, Mar 19, 2010 at 11:06 AM, Toke Eskildsen <te@statsbiblioteket.dk> wrote:
>
> To me, the trade-offs seems to be
>
> new Sort(new SortField(field, locale))

Toke, only partially-on-topic here, is it possible to describe your
use-case a little more where its preferable to use this Locale-based
sort instead of indexing collation keys (e.g. you have to support so
many locales this would be too much indexing overhead?)

For reference, I tried to give some various tricks for using the
Collation package here (admittedly for Solr, but the same principles
apply): http://wiki.apache.org/solr/UnicodeCollation

Separately, if there is some reason you really need to use the
Locale-based runtime sort, perhaps we should rethink ways to allow the
ICU collator to be used instead of the JDK one, somehow.

For key generation speed and indexing size its only slightly faster
for us, but for runtime comparisons, considerably more so:
http://site.icu-project.org/charts/collation-icu4j-sun

-- 
Robert Muir
rcmuir@gmail.com

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message