lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Bedaride <paul.bedar...@xilopix.com>
Subject Token type similarity
Date Wed, 08 Jul 2015 11:47:40 GMT
Hello,

I wonder how token type are taken in account in similarity scoring.

 From my test it appears that lucene do a scoring on the term text
and the term type separately.

For instance, with the documents (with term text/type)
d1: w1/t1 w2/t1 w3/t2
d2: w1/t2 w2/t1 w3/t1

and the search w1/t1, I get the same score for d1 and d2

Is there a way to improve the score of d1 because the same token
hat the right token text and type ?

Thanks

Paul B├ędaride

Mime
View raw message