lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Toke Eskildsen>
Subject RE: Sorting with little memory: A suggestion
Date Fri, 19 Mar 2010 21:42:23 GMT
From: Robert Muir []:

[Toke: Indexing collation keys only helps with the speed problem]

> I don't really understand this measurement, collation keys are
> byte[]... (although its true we don't yet encode them this way in
> flex, I think we should)

I sounds like I'm missing something here... A quick check of running 20000 random Strings
of 30 characters from a-zA-Z0-1 + 20 different national characters through Java's Collator
returned an average collatorKey-length of 175 bytes. On
it is stated that a standard sort is used, which - to my knowledge - loads the Strings into
memory. For my quick test, this means a tripling of memory usage for the sort field when indexing

Toke Eskildsen
To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message