mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sznajder ForMailingList <>
Subject Mapping from docId to clusters in the clusterdump
Date Sun, 02 Feb 2014 10:08:28 GMT

I have a directory containing thousands of text files.
I ran the KMeans cluster algorithm following the tutorial in the Mahout In
Action book.

However, I need to know which text file was mapped to which cluster.

I did not find the easy way to do that. I ran the clusterdump algorithm ,
but I succeed only to get mapping from vector to cluster, and not from
Document to Cluster.

Any help is welcome!


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message