mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Suneel Marthi <>
Subject Re: Mapping from docId to clusters in the clusterdump
Date Sun, 02 Feb 2014 10:11:39 GMT
This is an issue that was very recently fixed (infact fixed last week). Please work off of
present trunk, u should see the name of the text files that r part of clusters.

On Sunday, February 2, 2014 5:09 AM, Sznajder ForMailingList <>

I have a directory containing thousands of text files.
I ran the KMeans cluster algorithm following the tutorial in the Mahout In
Action book.

However, I need to know which text file was mapped to which cluster.

I did not find the easy way to do that. I ran the clusterdump algorithm ,
but I succeed only to get mapping from vector to cluster, and not from
Document to Cluster.

Any help is welcome!

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message