lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 31149] - [PATCH] to store binary fields with compression
Date Wed, 29 Sep 2004 18:36:47 GMT
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=31149>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=31149

[PATCH] to store binary fields with compression





------- Additional Comments From bernhard.messer@intrafind.de  2004-09-29 18:36 -------
hi,

the zip file added today contains the improved version for the compression patch
based on the latest source from cvs with new features discussed on the mailing
list implemented. The patch contains three diff files (Field.diff,
FieldsReader.diff and FieldsWriter.diff) and one new Testcase to test the
compression functionality.

This patch does allow now compression either on binary or string value fields.

There also is a small cleanup in FieldsReader and FieldsWriter using static
members referencing the bit values which makes the code more readable (Doug
asked for it).

The Field class now stores all 3 possible data values (reader, string or byte[])
within single member. This change was also asked by Doug and makes things easier
to handle, but is not directly related to compression.

Activating compression, the index size can be reduced to 60% of the original
size when storing whole documents within the index. Increasing index time round
about 50-70%. Regarding the query performance, i saw no differences between an
compressed or uncompressed index. What may take a bit longer is fetching the hit
documents.

All lucene test cases works well. So maybe the lucene committers can have a look
on it and decide if it will be part of the next version.

If there are any questions regarding the changes, leave a note on the developer
list.

regards and fun with it
bernhard

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message