mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Christoph Hermann <>
Subject MeanShift Clustering duplicating vectors in canopies?
Date Mon, 25 Jan 2010 16:26:10 GMT

i'm running some clustering with the Mean Shift and in my final canopy i 
get 5x the same vector.

In the original input list i only had it once and i'm wondering why 
duplicates are allowed within the same canopy?

Attached is a file with the method i'm using to run mean shift as well 
as the ouput (i'm iterating over the getBoundPoints() list of the 

I'd be happy if someone could explain this.

Christoph Hermann

Christoph Hermann
Institut für Informatik
Tel: +49 761-203-8171 Fax: +49 761-203-8162

View raw message