mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <gsing...@apache.org>
Subject Re: Help regarding Apache Mahout.
Date Wed, 04 Jan 2012 16:31:18 GMT
Hu Junaid,

Have a look at the SparseVectorsFromSequenceFiles class, as this does this already, in combination
with SequenceFilesFromDirectory which can convert text files to SequenceFiles.

-Grant
On Jan 4, 2012, at 8:30 AM, Junaid Surve wrote:

> Hi
> 
> I want to develop a Prototype to calculate the TF IDF from the documents
> present in a directory.
> 
> Can you please help me with the Steps to go about it using Apache Mahout?
> Thank you.
> 
> -- 
> Regards
> Junaid

--------------------------------------------
Grant Ingersoll
http://www.lucidimagination.com




Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message