lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael McCandless (JIRA)" <>
Subject [jira] Updated: (LUCENE-1410) PFOR implementation
Date Tue, 06 Oct 2009 16:23:31 GMT


Michael McCandless updated LUCENE-1410:

    Attachment: LUCENE-1410-codecs.tar.bz2

Attaching sep, intblock and pfordelta codecs, spun out of the last patch on LUCENE-1458.

Once LUCENE-1458 is in, we should finish the pfordelta codec to make it a real choice.

I actually think some combination of pulsing, standard, pfordelta and simple bit packing (in
order by increasing term's docFreq), within a single codec, may be best.

Ie, rare terms (only in a doc or two) could be inlined into the the terms dict.  Slightly
more common terms can use the more CPU intensive standard codec.  Common terms can use cpu-friendly-yet-still-decent-compression
pfordelta.  Obsenely common terms can use bit packing for the fastest decode.

> PFOR implementation
> -------------------
>                 Key: LUCENE-1410
>                 URL:
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Other
>            Reporter: Paul Elschot
>            Priority: Minor
>         Attachments: autogen.tgz, LUCENE-1410-codecs.tar.bz2, LUCENE-1410b.patch, LUCENE-1410c.patch,
LUCENE-1410d.patch, LUCENE-1410e.patch, TermQueryTests.tgz,,,
>   Original Estimate: 21840h
>  Remaining Estimate: 21840h
> Implementation of Patched Frame of Reference.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message