lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Uwe Schindler" <...@thetaphi.de>
Subject RE: Lucene VSM scoring
Date Wed, 10 Jul 2013 06:24:11 GMT
Hi,

TF-IDF is just the default (and fast) scoring scheme. You can modify that (the "Similarity")
as you want (since Lucene 4.0):
http://lucene.apache.org/core/4_3_1/core/org/apache/lucene/search/similarities/package-summary.html

There are already various other ones available, like BM25. You have to set the "Similarity"
on IndexWriter and IndexSearcher to use them.

Uwe

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: uwe@thetaphi.de


> -----Original Message-----
> From: Jason Z. [mailto:jasonzhang56@gmail.com]
> Sent: Wednesday, July 10, 2013 8:02 AM
> To: java-user@lucene.apache.org
> Subject: Lucene VSM scoring
> 
> Hi,
> 
> In the Lucene docs it mentions that Lucene impements a tf-idf weighting
> scheme for scoring. Is there anyway to modfiy Lucene to implement a
> custom weighting scheme for the VSM?
> 
> Thank you.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message