lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jochen Wersdörfer>
Subject term frequency normalization
Date Tue, 03 Feb 2009 13:26:28 GMT

i'd like to use the term frequency normalization described in

so that the term frequency tf becomes

tf(f, d) = log(1 + feq(t, d)) / log(1 + avgFreq(d))

The easiest way to change the tf calculation would be overwriting
tf in an own implementation of Similarity like it's done in
SweetSpotSimilarity. But the average term frequency of the
document is missing. Is there a simple way to get or calc this


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message