lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 32921] New: - Some utilities for a compact sparse filter
Date Mon, 03 Jan 2005 15:38:27 GMT
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG·
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=32921>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND·
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=32921

           Summary: Some utilities for a compact sparse filter
           Product: Lucene
           Version: CVS Nightly - Specify date in submission
          Platform: Other
        OS/Version: other
            Status: NEW
          Severity: enhancement
          Priority: P1
         Component: Search
        AssignedTo: lucene-dev@jakarta.apache.org
        ReportedBy: paul.elschot@xs4all.nl


Two files are attached that might form the basis for an alternative 
filter implementation that is more memory efficient than one bit 
per doc when less than about 1/8 of the docs pass through the filter. 
 
The document numbers are stored in RAM as VInt's from the Lucene index 
format. These VInt's encode the difference between two successive 
document numbers, much like a PositionDelta in the Positions: 
http://jakarta.apache.org/lucene/docs/fileformats.html 
 
The getByteSize() method can be used to verify the compression 
once a SortedVIntList is constructed. 
The precise conditions under which this is more memory efficient than 
one bit per document are not easy to specify in advance.

-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message