lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <cutt...@lucene.com>
Subject Re: Automatic stop-words
Date Wed, 22 Jan 2003 21:29:38 GMT
Leo Galambos wrote:
>>>When I want to search "Linux", nothing is found.
>>>This word is in every article in the content.
>>>Or is something wrong?
>>
>>Yes :)
> 
> 
> why? log(1)=0. it is OK, I think :-))) so where's any problem?

Lucene's IDF computation is:

    log( maxDoc / docFreq+1) + 1.0

Thus a term which occurs in every document gets a value of 1.0, not zero.

Doug


--
To unsubscribe, e-mail:   <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>


Mime
View raw message