mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Christoph Hermann <herm...@informatik.uni-freiburg.de>
Subject MeanShift Clustering duplicating vectors in canopies?
Date Mon, 25 Jan 2010 16:26:10 GMT
Hello,

i'm running some clustering with the Mean Shift and in my final canopy i 
get 5x the same vector.

In the original input list i only had it once and i'm wondering why 
duplicates are allowed within the same canopy?

Attached is a file with the method i'm using to run mean shift as well 
as the ouput (i'm iterating over the getBoundPoints() list of the 
canopy).

I'd be happy if someone could explain this.

regards
Christoph Hermann

-- 
Christoph Hermann
Institut für Informatik
Tel: +49 761-203-8171 Fax: +49 761-203-8162
e-mail: hermann@informatik.uni-freiburg.de

Mime
View raw message