mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From vineet yadav <>
Subject Re: Incremental data stream clustering.
Date Tue, 01 Feb 2011 18:59:51 GMT
Hi Sarath,
In mahout k-mean clustering, sequence file of  initial cluster center is
passed as a argument. You can run k-mean clustering algorithm incrementally.
During each pass of k-mean clustering, you can pass cluster which are
computed in earlier  stage of k-mean clustering  as initial cluster centers.
But you need to make sure documents/posts in each pass are related for
better result.
Vineet Yadav

On Tue, Feb 1, 2011 at 11:58 PM, sharath jagannath <> wrote:

> Hey All,
> Another new bie to mahout.
> I want to implement a system that clusters incoming data stream.
> went through mahout clustering tutorials but I am still not sure how to
> handle dynamic evolution of the cluster in mahout.
> To be specific, I am trying to cluster the content from a RSS feed and not
> sure how I should be using mahout to achieve it, are mahout clustering
> algorithms incremental?
> I was looking at interfaces like weka's incremental cluster in mahout to
> achieve this and I am lost :D.
> All help is much appreciated.
> Thanks,
> Sharath

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message