mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Grant Ingersoll (JIRA)" <>
Subject [jira] [Commented] (MAHOUT-763) Map-Side Distance Comparison
Date Sat, 16 Jul 2011 10:56:59 GMT


Grant Ingersoll commented on MAHOUT-763:

The code is more or less a copy of what's in KMeans for loading the Cluster objects.

> Map-Side Distance Comparison
> ----------------------------
>                 Key: MAHOUT-763
>                 URL:
>             Project: Mahout
>          Issue Type: New Feature
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>         Attachments: MAHOUT-763.patch, MAHOUT-763.patch, MAHOUT-763.patch, MAHOUT-763.patch
> KMeans currently on the map-side calculates the distance between a set of seeds and all
other vectors.  It would be handy to have a generalization of this that, given a set of vectors
that fits in memory (the seeds) and other points, emit <seed id, other id, distance>
according to the distance measure.  This is similar to the RowSimilarityJob, but much simpler
and not as general purpose.

This message is automatically generated by JIRA.
For more information on JIRA, see:


View raw message