lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael McCandless (Resolved) (JIRA)" <>
Subject [jira] [Resolved] (LUCENE-2445) Perf improvements for the DocsEnum bulk read API
Date Tue, 20 Mar 2012 15:33:39 GMT


Michael McCandless resolved LUCENE-2445.

    Resolution: Won't Fix

We removed bulk API in 4.0.
> Perf improvements for the DocsEnum bulk read API
> ------------------------------------------------
>                 Key: LUCENE-2445
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: core/index
>            Reporter: Michael McCandless
>             Fix For: 4.0
> I started to work on LUCENE-2443, to create a test showing the
> problems, but it turns out none of the core codecs (even sep/intblock)
> ever set a non-zero offset.
> So I set forth to fix sep to do so, but ran into some issues w/ the
> current bulk-read API that we should fix to make it higher
> performance:
>   * Filtering of deleted docs should be the caller's job (saves an
>     extra pass through the docs)
>   * Probably docs should arrive as deltas and caller sums these up to
>     get the actual docID
>   * Whether to load freqs or not should be separately controllable
>   * We may want to require that the int[] for docs and freqs are
>     "aligned", ie the offset into each is the same
>   * Maybe we should separate out a BulkDocsEnum from DocsEnum.  We can
>     make it optional for codecs (ie, we can emulate BulkDocsEnum from
>     the DocsEnum)

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message