lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Hostetter <hossman_luc...@fucit.org>
Subject Re: Lucene Scoring
Date Wed, 08 Mar 2006 20:03:32 GMT

: Roughly speaking:
:
: * Documents containing *all* the search terms are good
: * Matches on rare words are better than for common words
: * Long documents are not as good as short ones
: * Documents which mention the search terms many times are good

Be wary of the distinction between "term" and "word" and how that affects
statements like "Long documents are not as good as short ones" ... If you
have a title field and body field and one document has a really long body,
but a very short title then a search on the title isn't going to be
penalized by the length of the body ... you have to choose your words
carefully.






-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message