mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Benson Margulies <bimargul...@gmail.com>
Subject Reentering at the ground floor
Date Sat, 05 Mar 2011 20:03:15 GMT
I may have finally been handed a reason to make a serious attempt to
use mahout, and here I am more or less where I tried to start a very
long time ago.

Imagine that someone else has gone and stuck a large number of text
docs into a hadoop file system. I want to

a- convert them to feature vectors
b- run canopy+kmeans or some such clusterer
c- report back the assignment of docs to clusters

Where should I start reading in the web site?

Mime
View raw message