lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lukas Zapletal <l...@root.cz>
Subject Re: Automatic stop-words
Date Thu, 23 Jan 2003 19:26:11 GMT
>
>
>>>>>When I want to search "Linux", nothing is found.
>>>>>This word is in every article in the content.
>>>>>Or is something wrong?
>>>>>          
>>>>>
>>>>Yes :)
>>>>        
>>>>
>>>why? log(1)=0. it is OK, I think :-))) so where's any problem?
>>>      
>>>
>>Thus a term which occurs in every document gets a value of 1.0, not zero.
>>    
>>
>
>I then believe in UFO :-) So does he have long documents? Can it then fall 
>to 0, when you normalize the vector (tf_linux=1 |w|=20000words)?
>  
>
Well I solved it. The bug was in my head. I forgot this is an old index 
that was created with old stop-words.
Hard do solve this, yes it was ;-) heh

Happy new year!

-- 
Lukas Zapletal      [lzap@root.cz]
http://www.tanecni-olomouc.cz/lzap




--
To unsubscribe, e-mail:   <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>


Mime
View raw message