lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <>
Subject [jira] [Commented] (LUCENE-4527) CompressingStoredFieldsFormat: encode numStoredFields more efficiently
Date Sun, 04 Nov 2012 12:24:12 GMT


Robert Muir commented on LUCENE-4527:

And of course for this test (and any other test of compressing stored fields), that 
"fake" wikipedia corpus being used is totally invalid because all documents are 
truncated to a specific length :)
> CompressingStoredFieldsFormat: encode numStoredFields more efficiently
> ----------------------------------------------------------------------
>                 Key: LUCENE-4527
>                 URL:
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Assignee: Adrien Grand
>            Priority: Minor
>             Fix For: 4.1
>         Attachments: LUCENE-4527.patch
> Another interesting idea from Robert: many applications have a schema and all documents
are likely to have the same number of stored fields. We could save space by using packed ints
and the same kind of optimization as {{ForUtil}} (requiring only one VInt if all values are

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message