mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeff Eastman <>
Subject Re: online clustering with mahout
Date Mon, 14 May 2012 13:20:56 GMT
Look at ClusterIterator.iterate(). This will do clustering in memory 
without any Hadoop. ClusterIterator.iterateSeq will do clustering in a 
single process from/to Hadoop sequence files but without map/reduce. 
ClusterIterator.iterateMR uses full Hadoop to do clustering for the same 
algorithms (k-means, fuzzy-k, Dirichlet), all configured using 
ClusteringPolicy instances.

On 5/14/12 8:34 AM, Ioan Eugen Stan wrote:
> Hi,
> Dos mahout offer online clustering out of the box using sequential
> clustering (no MapReduce). I'm looking over the code (trunk) and I
> found ClusterClassifier but I can't figure out how that works. Any
> examples or more docs on this topic?
> Thanks,

  • Unnamed multipart/mixed (inline, None, 0 bytes)
View raw message