Return-Path: X-Original-To: apmail-mahout-dev-archive@www.apache.org Delivered-To: apmail-mahout-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0D169745F for ; Sat, 16 Jul 2011 10:57:30 +0000 (UTC) Received: (qmail 96568 invoked by uid 500); 16 Jul 2011 10:57:29 -0000 Delivered-To: apmail-mahout-dev-archive@mahout.apache.org Received: (qmail 96367 invoked by uid 500); 16 Jul 2011 10:57:25 -0000 Mailing-List: contact dev-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mahout.apache.org Delivered-To: mailing list dev@mahout.apache.org Received: (qmail 96351 invoked by uid 99); 16 Jul 2011 10:57:22 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Jul 2011 10:57:22 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Jul 2011 10:57:20 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id D877B82CAE for ; Sat, 16 Jul 2011 10:56:59 +0000 (UTC) Date: Sat, 16 Jul 2011 10:56:59 +0000 (UTC) From: "Grant Ingersoll (JIRA)" To: dev@mahout.apache.org Message-ID: <1494703564.19765.1310813819883.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <2055084736.14138.1310664059869.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (MAHOUT-763) Map-Side Distance Comparison MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAHOUT-763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13066401#comment-13066401 ] Grant Ingersoll commented on MAHOUT-763: ---------------------------------------- The code is more or less a copy of what's in KMeans for loading the Cluster objects. > Map-Side Distance Comparison > ---------------------------- > > Key: MAHOUT-763 > URL: https://issues.apache.org/jira/browse/MAHOUT-763 > Project: Mahout > Issue Type: New Feature > Reporter: Grant Ingersoll > Assignee: Grant Ingersoll > Priority: Minor > Fix For: 0.6 > > Attachments: MAHOUT-763.patch, MAHOUT-763.patch, MAHOUT-763.patch, MAHOUT-763.patch > > > KMeans currently on the map-side calculates the distance between a set of seeds and all other vectors. It would be handy to have a generalization of this that, given a set of vectors that fits in memory (the seeds) and other points, emit according to the distance measure. This is similar to the RowSimilarityJob, but much simpler and not as general purpose. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira