lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Leo Galambos <galam...@com-os2.ms.mff.cuni.cz>
Subject Re: Automatic stop-words
Date Thu, 23 Jan 2003 18:56:38 GMT
> >>>When I want to search "Linux", nothing is found.
> >>>This word is in every article in the content.
> >>>Or is something wrong?
> >>Yes :)
> > why? log(1)=0. it is OK, I think :-))) so where's any problem?
> Thus a term which occurs in every document gets a value of 1.0, not zero.

I then believe in UFO :-) So does he have long documents? Can it then fall 
to 0, when you normalize the vector (tf_linux=1 |w|=20000words)?

-g-



--
To unsubscribe, e-mail:   <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>


Mime
View raw message