lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Koji Sekiguchi <k...@r.email.ne.jp>
Subject Re: Measuring document similarity
Date Tue, 13 Mar 2012 02:24:47 GMT
(12/03/13 2:38), Hassane Cabir wrote:
> Hi guys,
>
> I'm using Lucene for my project and I need to calcule how similar two (or
> more) documents are, using TFIDF. How to get TFIDF with lucene?
>
> Any insights on this?

Solr has TermVectorComponent which can return tf, df and tf-idf of each term
in a document. To use it, the document should be TermVector enabled.

koji
-- 
Query Log Visualizer for Apache Solr
http://soleami.com/

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message