lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "robert engels (JIRA)" <>
Subject [jira] Commented: (LUCENE-1043) Speedup merging of stored fields when field mapping "matches"
Date Fri, 02 Nov 2007 20:39:51 GMT


robert engels commented on LUCENE-1043:

When bulk copying the documents, I think you need to:

read array of long from index (8 * (ndocs+1)) in long[ndocs+1] offsets;
calculate length = offset[ndocs]-offset[0];
read bytes of length from document file
startoffset = current output document stream position
write bytes to output document
modify offset[] adding startoffset-offset[0] to each entry
write offset[] in bulk to index output

> Speedup merging of stored fields when field mapping "matches"
> -------------------------------------------------------------
>                 Key: LUCENE-1043
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>    Affects Versions: 2.2
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>            Priority: Minor
>             Fix For: 2.3
>         Attachments: LUCENE-1043.patch
> Robert Engels suggested the following idea, here:
> When merging in the stored fields from a segment, if the field name ->
> number mapping is identical then we can simply bulk copy the entire
> entry for the document rather than re-interpreting and then re-writing
> the actual stored fields.
> I've pulled the code from the above thread and got it working on the
> current trunk.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message