mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jeff Eastman (JIRA)" <>
Subject [jira] Updated: (MAHOUT-3) Build initial canopy clustering prototype
Date Sun, 17 Feb 2008 00:35:07 GMT


Jeff Eastman updated MAHOUT-3:

    Attachment: MAHOUT-3e.diff

This patch refactors the canopy configuration from the various mappers, combiners & reducers
configure method into a single static configure method on Canopy. I changed the DistanceMeasure
creation from explicit tests to more generic class instance creation and added a unit test
thereof.  I also made DistanceMeasure extend JobConfigurable so that they can be configured.
Configurability will allow me to create a WeightedManhattanDistanceMeasure outside of the
mahout library. It seems making distance measures job configurable increases their versatility
at no cost to the library.

Finally, all unit tests still run. The build.xml test target works for me too.

> Build initial canopy clustering prototype
> -----------------------------------------
>                 Key: MAHOUT-3
>                 URL:
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Clustering
>            Reporter: Jeff Eastman
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.1
>         Attachments: MAHOUT-3.diff, MAHOUT-3a.diff, MAHOUT-3b.diff, MAHOUT-3c.diff, MAHOUT-3d.diff,
> I'd like to reserve some namespace, specifically org.apache.mahout.clustering.canopy
to use for an initial prototype of canopy clustering. I'm going to start with a little unit
test to get the basic algorithm sorted out, then M/R it.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message