lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael McCandless (JIRA)" <>
Subject [jira] Updated: (LUCENE-2321) use packed ints for the terms dict index
Date Sun, 14 Mar 2010 17:41:27 GMT


Michael McCandless updated LUCENE-2321:

    Attachment: LUCENE-2321.patch

First cut -- it has some limitations though:

  * Cannot handle termsIndexInterval yet (forces this to 1)

  * Loads all term data as a single byte[] -- this may be a problem, eg it means your total
bytes for indexed terms cannot exceed 2.1B; it also may fail to allocate due to memory fragmentation
on 32 bit JREs.  So I think we have to make this paged.

Also found & fixed some small bugs in packed ints.

> use packed ints for the terms dict index
> ----------------------------------------
>                 Key: LUCENE-2321
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>    Affects Versions: Flex Branch
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>             Fix For: Flex Branch
>         Attachments: LUCENE-2321.patch
> Terms dict index needs to store large RAM resident arrays of ints, but, because their
size is bound & variable (depending on the segment/docs), we should used packed ints for

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message