lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shahan Khatchadourian (JIRA)" <>
Subject [jira] Updated: (LUCENE-713) File Formats Documentation is not correct for Term Vectors
Date Sat, 14 Jul 2007 21:51:04 GMT


Shahan Khatchadourian updated LUCENE-713:

In the current implementation, token offsets are not stored as they are provided in the Token.
DocumentWriter makes a token's startOffset relative to +1 of the previous Token's endOffset.

> File Formats Documentation is not correct for Term Vectors
> ----------------------------------------------------------
>                 Key: LUCENE-713
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: Website
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
> From Samir Abdou on the dev mailing list:
> Hi, 
> There is an inconsistency between the files format page (from Lucene
> website) and the source code. It concerns the positions and offsets of term
> vectors. It seems that documentation (website) is not up to date. According
> to the file format page, offsets and positions are not stored! Is that
> correct?
> Many thanks,
> Samir
> -----
> Indeed, in the file formats term vectors section it doesn't talk about the storing of
position and offset info.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message