lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tim Sturge <>
Subject Re: indexing fields with multiplicity
Date Wed, 29 Aug 2007 17:13:11 GMT
I'm looking for a boost when the anchor text is more commonly associated 
with one topic than another. For example the United States of America
is called "USA" by a lot of people. The United Space Alliance is also 
called "USA" but by many less people.

If I just index them both with "USA" once, they will rank equally. I 
want the United States of America to rank higher.


Karl Wettin wrote:
> 28 aug 2007 kl. 21.41 skrev Tim Sturge:
>> Hi,
>> I have fields which have high multiplicity; for example I have a 
>> topic with 1000 names, 500 of which are "USA" and 200 are "United 
>> States of America".
>> Previously I was indexing "USA USA .(500x).. USA United States of 
>> America .(200x).. United States of America" as as single field. The 
>> problem is that this causes this field to be less weighted for "USA" 
>> than a topic with a single name "USA".
>> So what I am now going to do is call
>> for (i = 0 ; i < 500 ; i++) {
>> document.add(new Field("anchor","USA"));
>> }
> Why do you do this? What is the effect you are looking for?

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message