lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yonik Seeley <>
Subject Re: Deleted files considered for scoring
Date Sun, 10 May 2009 22:38:49 GMT
On Sun, May 10, 2009 at 5:37 PM, Moshe Cohen <> wrote:
> I am using Lucene 2.4.1 via Pylucene and have encountered the following
> behavior:
> When there are deleted documents in the index the search scores are
> identical to those that exist had those documents not been deleted.
> If I optimize the index and the deleted documents are actually removed, the
> the scoring is the same as if those documents were never indexed at all.

This is working as designed... a known design tradeoff / limitation.
When a document is marked as deleted,  document frequency for terms
don't change (changing them would be impractical).


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message