lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karl Wettin <karl.wet...@gmail.com>
Subject Re: indexing fields with multiplicity
Date Wed, 29 Aug 2007 17:10:35 GMT

29 aug 2007 kl. 19.13 skrev Tim Sturge:

> I'm looking for a boost when the anchor text is more commonly  
> associated with one topic than another. For example the United  
> States of America
> is called "USA" by a lot of people. The United Space Alliance is  
> also called "USA" but by many less people.
>
> If I just index them both with "USA" once, they will rank equally.  
> I want the United States of America to rank higher.

Why not use Field#setBoost(float)?


-- 
karl


>
> Tim
>
> Karl Wettin wrote:
>>
>> 28 aug 2007 kl. 21.41 skrev Tim Sturge:
>>
>>> Hi,
>>>
>>> I have fields which have high multiplicity; for example I have a  
>>> topic with 1000 names, 500 of which are "USA" and 200 are "United  
>>> States of America".
>>>
>>> Previously I was indexing "USA USA .(500x).. USA United States of  
>>> America .(200x).. United States of America" as as single field.  
>>> The problem is that this causes this field to be less weighted  
>>> for "USA" than a topic with a single name "USA".
>>>
>>> So what I am now going to do is call
>>>
>>> for (i = 0 ; i < 500 ; i++) {
>>> document.add(new Field("anchor","USA"));
>>> }
>>
>> Why do you do this? What is the effect you are looking for?
>>
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message