lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yonik Seeley <yo...@lucidimagination.com>
Subject Re: Deleted files considered for scoring
Date Sun, 10 May 2009 22:38:49 GMT
On Sun, May 10, 2009 at 5:37 PM, Moshe Cohen <moshec@gmail.com> wrote:
> I am using Lucene 2.4.1 via Pylucene and have encountered the following
> behavior:
> When there are deleted documents in the index the search scores are
> identical to those that exist had those documents not been deleted.
> If I optimize the index and the deleted documents are actually removed, the
> the scoring is the same as if those documents were never indexed at all.

This is working as designed... a known design tradeoff / limitation.
When a document is marked as deleted,  document frequency for terms
don't change (changing them would be impractical).

-Yonik
http://www.lucidimagination.com

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message