mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From vineet yadav <>
Subject Re: clustering using n-grams
Date Thu, 17 Mar 2011 06:51:25 GMT
Hi Sambhu,
Check out Grant Article on Lucid Imagination
Vineet Yadav
On Thu, Mar 17, 2011 at 11:39 AM, shambhusingh <> wrote:
> I have created the lucene index for database content and have aded the fields
> documentId and content using i want to use mahout lucene.vector
> to create the sequence file using n-grams algorithm and then I will d the
> mahout clustering on top of that...
> how do i use n-grams instead of TFIDF for generatining lucene vectors
> please help
> or how do I create clusters using n-grams instead of TFIDF with lucene index
> --
> View this message in context:
> Sent from the Mahout User List mailing list archive at

View raw message