lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Halácsy Péter <halacsy.pe...@axelero.com>
Subject Re: Relevance boosting with the aid of semantic markup
Date Mon, 10 Dec 2001 08:09:13 GMT
Doug Cutting wrote:

>
>>Why can't we store some value of each word. If I could index 
>>the stems 
>>of the words as well, I gave lower value to them.
>>I know a Russion search engine that uses 3 (or 4 I don't remember) 
>>distinct value to classify each term in the index:
>>1. original word
>>2. stem
>>3. spam
>>
>>The priority of the terms is calculated at indexing time and used for 
>>ranking.
>>
>
>Would such weighting be per word, or per word occurence?  Earlier you were
>asking for the ability to separately weight word occurences, e.g. to boost
>them if they are emphasized in the text.  That was what I was responding to.
>
per word occurence (don't forget it's only interesting if I can put more 
than 1 words to the same position)


>
>Doug
>
peter


--
To unsubscribe, e-mail:   <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>


Mime
View raw message