lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Christoph Kiefer <>
Subject TFIDF Implementation
Date Tue, 14 Dec 2004 16:44:47 GMT
My current task/problem is the following: I need to implement TFIDF
document term ranking using Jakarta Lucene to compute a similarity rank
between arbitrary documents in the constructed index.
I saw from the API that there are similar functions already implemented
in the class Similarity and DefaultSimilarity but I don't know exactly
how to use them. At the time my index has about 25000 (small) documents
and there are about 75000 terms stored in total.
Now, my question is simple. Does anybody has done this before or could
point me to another location for help?

Thanks for any help in advance.

Christoph Kiefer

Department of Informatics, University of Zurich

Office: Uni Irchel 27-K-32
Phone:  +41 (0) 44 / 635 67 26

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message