Return-Path: X-Original-To: apmail-commons-issues-archive@minotaur.apache.org Delivered-To: apmail-commons-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4B1C4F4F2 for ; Sat, 23 Mar 2013 15:29:18 +0000 (UTC) Received: (qmail 93650 invoked by uid 500); 23 Mar 2013 15:29:15 -0000 Delivered-To: apmail-commons-issues-archive@commons.apache.org Received: (qmail 93533 invoked by uid 500); 23 Mar 2013 15:29:15 -0000 Mailing-List: contact issues-help@commons.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: issues@commons.apache.org Delivered-To: mailing list issues@commons.apache.org Received: (qmail 93443 invoked by uid 99); 23 Mar 2013 15:29:15 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 23 Mar 2013 15:29:15 +0000 Date: Sat, 23 Mar 2013 15:29:15 +0000 (UTC) From: "Thomas Neidhart (JIRA)" To: issues@commons.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (MATH-917) More distance measurements are needed in o.a.c.m.stat.clustering. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MATH-917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Neidhart updated MATH-917: --------------------------------- Attachment: clustering.zip I have attached the result of some refactoring of the cluster package. It includes the following changes/additions: * Move distance calculation from the Clusterable interface to a dedicated DistanceMeasure interface with a first concrete implementation: EuclideanDistance * Modify a Cluster to make it more general: remove center as this is only used for centroid based clustering algos * Introduce a Clusterer interface with currently only one method: cluster(Collection), but we may add more, e.g. with a maxIterations argument * Added an AbstractClusterer class which provides basic stuff for each clusterer, e.g. a distance measure. * The existing clustering algos implement the new interface via the abstract class I like the Clusterable interface, as it makes it quite easy to extend existing data objects to make them an input for the clusterer. The simple *Point implementations have been kept but I am not fully happy with the name. I would like to get feedback if this goes in the right direction, and if so, will finish the contribution. > More distance measurements are needed in o.a.c.m.stat.clustering. > ----------------------------------------------------------------- > > Key: MATH-917 > URL: https://issues.apache.org/jira/browse/MATH-917 > Project: Commons Math > Issue Type: Improvement > Reporter: Reid Hochstedler > Fix For: 4.0 > > Attachments: clustering.zip > > > Currently only Euclidean distance is used for distance measurement, it would be easy to quickly add Manhattan and Chebyshev distance among others. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira