lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adrien Grand (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-4161) Make PackedInts usable by codecs
Date Thu, 21 Jun 2012 13:42:42 GMT
Adrien Grand created LUCENE-4161:
------------------------------------

             Summary: Make PackedInts usable by codecs
                 Key: LUCENE-4161
                 URL: https://issues.apache.org/jira/browse/LUCENE-4161
             Project: Lucene - Java
          Issue Type: Improvement
          Components: core/store
            Reporter: Adrien Grand
            Assignee: Adrien Grand
            Priority: Minor


Some codecs might be interested in using PackedInts.{Writer,Reader,ReaderIterator} to read
and write fixed-size values efficiently.

The problem is that the serialization format is self contained, and always writes the name
of the codec, its version, its number of bits per value and its format. For example, if you
want to use packed ints to store your postings list, this is a lot of overhead (at least ~60
bytes per term, in case you only use one Writer per term, more otherwise).

Users should be able to externalize the storage of metadata to save space. For example, to
use PackedInts to store a postings list, one should be able to store the codec name, its version
and the number of bits per doc in the header of the terms+postings list instead of having
to write it once (or more!) per term.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message