mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel McEnnis (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAHOUT-668) Adding knn support to Mahout classifiers
Date Sun, 22 May 2011 01:07:47 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13037490#comment-13037490
] 

Daniel McEnnis commented on MAHOUT-668:
---------------------------------------

Ted,

Your right.  The distance metrics will have trouble with Random Vectors.  I'll work on a fix
for that.  (The code is on the critical path, I can't afford to lose the speed of the current
method and the other vector methods give incorrect results for missing=0 vectors)

Daniel.

> Adding knn support to Mahout classifiers
> ----------------------------------------
>
>                 Key: MAHOUT-668
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-668
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification
>    Affects Versions: 0.6
>            Reporter: Daniel McEnnis
>              Labels: classification, knn
>         Attachments: MAHOUT-668.pat, Mahout-668-2.patch, Mahout-668-3.patch, Mahout-668.pat
>
>   Original Estimate: 672h
>  Remaining Estimate: 672h
>
> Initial implementation of the knn.  This is a minimum base set with many more possible
add-ons including support for text and weka input as well as a classify only (no confusion
matrix) back end.  The system was tested on the 20 newsgroup data set.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message