Hello Luceners,
I have a collections of vector of terms (token) that I extracted from files.
I am looking for ways to calculate TF/IDF of each term.
I wanted to use Lucene to do this but Lucene is made for collections of
files and in my case I have already extracted those files into vector of
terms. I know it is not very difficult to implement this measurement but I
guess there should be such API available. Does anyone of you know any Java
API that directly handle this problem? or I have to implement from scratch.
Any idea would be highly appreciated.
Thank you in advance.
Best regards,
Sengly
|