lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (Commented) (JIRA)" <>
Subject [jira] [Commented] (LUCENE-3878) CheckIndex should check deleted documents too
Date Fri, 16 Mar 2012 15:41:37 GMT


Robert Muir commented on LUCENE-3878:

Actually if we are willing to add SegmentReader.rawTermPositions() to match SegmentReader.rawTermDocs()

we could do this in 3.x too...
> CheckIndex should check deleted documents too
> ---------------------------------------------
>                 Key: LUCENE-3878
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Task
>    Affects Versions: 4.0
>            Reporter: Robert Muir
>             Fix For: 4.0
> In 4.0 livedocs are passed down to the enums, thus deleted docs are not so special.
> So I think checkindex should not pass the livedocs down to the enums when checking,
> it should pass livedocs=null and check all the postings. It already does this separately
> collect stats i think to compare against the term/collection statistics? But we should
> just clean this up and only use one enum.
> For example LUCENE-3876 is a case where we were actually making a corrumpt index,
> (a position was negative) but because the document in question was deleted, CheckIndex

> didn't detect this.
> This could have caused problems if someone just passed null for livedocs (maybe they

> are doing something where its not so important to take deletions into account)

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message