lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Hostetter <hossman_luc...@fucit.org>
Subject Re: termFreq always = 1 ?
Date Thu, 02 Oct 2008 23:26:55 GMT

: Yes this may be my problem,
: 
: But is there any solution to have only one "men" keyword indexed when i''ve
: got something like this :

SOLR-739 is working towards a new omitTf option for fields (taking 
advantage of a Lucene optimization for this case) but in the mean time the 
best options i can think of are

1) a custom TokenFilter that keeps track of every token it's ever seen and 
removes *all* dups

2) a custom Similarity with a tf() func that returns a constant value 
regardless of the input.  (the termFreq stored in the index will be the 
same, but the scores will be equivilent)




-Hoss


Mime
View raw message