lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erik van Zijst (JIRA)" <j...@apache.org>
Subject [jira] Commented: (LUCENE-1474) Incorrect SegmentInfo.delCount when IndexReader.flush() is used
Date Thu, 21 May 2009 23:47:45 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-1474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12711868#action_12711868
] 

Erik van Zijst commented on LUCENE-1474:
----------------------------------------

I have attached the output of CheckIndex on all our index directories, which seems to report
quite a few errors:

erik:cache ervzijst$ grep "CorruptIndexException\|AssertionError" CheckIndex.txt 
java.lang.AssertionError: delete count mismatch: info=1263 vs BitVector=1262
java.lang.AssertionError: delete count mismatch: info=496 vs BitVector=493
java.lang.AssertionError: delete count mismatch: info=101 vs BitVector=100
java.lang.AssertionError: delete count mismatch: info=300 vs BitVector=298
java.lang.AssertionError: delete count mismatch: info=109 vs BitVector=108
java.lang.AssertionError: delete count mismatch: info=140 vs BitVector=139
java.lang.AssertionError: delete count mismatch: info=122 vs BitVector=121
java.lang.AssertionError: delete count mismatch: info=91 vs BitVector=89
java.lang.AssertionError: delete count mismatch: info=1411 vs BitVector=1409
java.lang.AssertionError: delete count mismatch: info=801 vs BitVector=800
java.lang.AssertionError: delete count mismatch: info=630 vs BitVector=629
java.lang.AssertionError: delete count mismatch: info=510 vs BitVector=508
org.apache.lucene.index.CorruptIndexException: doc counts differ for segment _0: fieldsReader
shows 12365 but segmentInfo shows 12232
org.apache.lucene.index.CorruptIndexException: doc counts differ for segment _1: fieldsReader
shows 10144 but segmentInfo shows 8766
org.apache.lucene.index.CorruptIndexException: doc counts differ for segment _2: fieldsReader
shows 4616 but segmentInfo shows 7006
org.apache.lucene.index.CorruptIndexException: doc counts differ for segment _3: fieldsReader
shows 6681 but segmentInfo shows 4854
org.apache.lucene.index.CorruptIndexException: doc counts differ for segment _4: fieldsReader
shows 2652 but segmentInfo shows 8808
org.apache.lucene.index.CorruptIndexException: doc counts differ for segment _5: fieldsReader
shows 11500 but segmentInfo shows 14551
org.apache.lucene.index.CorruptIndexException: doc counts differ for segment _6: fieldsReader
shows 16225 but segmentInfo shows 4375
erik:cache ervzijst$


> Incorrect SegmentInfo.delCount when IndexReader.flush() is used
> ---------------------------------------------------------------
>
>                 Key: LUCENE-1474
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1474
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: Index
>    Affects Versions: 2.4
>            Reporter: Marcel Reutegger
>            Assignee: Michael McCandless
>             Fix For: 2.4.1, 2.9
>
>         Attachments: CheckIndex.txt, IndexReaderTest.java
>
>
> When deleted documents are flushed using IndexReader.flush() the delCount in SegmentInfo
is updated based on the current value and SegmentReader.pendingDeleteCount (introduced by
LUCENE-1267). It seems that pendingDeleteCount is not reset after the commit, which means
after a second flush() or close() of an index reader the delCount in SegmentInfo is incorrect.
A subsequent IndexReader.open() call will fail with an error when assertions are enabled.
E.g.:
> java.lang.AssertionError: delete count mismatch: info=3 vs BitVector=2
> 	at org.apache.lucene.index.SegmentReader.loadDeletedDocs(SegmentReader.java:405)
> [...]

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message