lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adrien Grand (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-4792) Smaller doc maps
Date Sun, 24 Feb 2013 21:08:12 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-4792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13585490#comment-13585490
] 

Adrien Grand commented on LUCENE-4792:
--------------------------------------

In case someone would like to use this class, I'd add that:
 - the encoded sequence does not strictly need to be monotonic: it can encode any sequence
of values but it compresses best when the stream contains monotonic sub-sequences of 1024
longs at least (for example it would have a good compression ratio if there are first 10000
increasing values and then 5000 decreasing values),
 - it can address up to 2^42 values,
 - there are writer/reader equivalents called MonotonicBlockPackedWriter and MonotonicBlockPackedReader
(which can either load values in memory or read from disk).
                
> Smaller doc maps
> ----------------
>
>                 Key: LUCENE-4792
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4792
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Assignee: Adrien Grand
>            Priority: Minor
>             Fix For: 4.2
>
>         Attachments: LUCENE-4792.patch
>
>
> MergeState.DocMap could leverage MonotonicAppendingLongBuffer to save memory.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message