mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robin Anil (JIRA)" <j...@apache.org>
Subject [jira] Commented: (MAHOUT-297) Canopy and Kmeans clustering slows down on using SeqAccVector for center
Date Tue, 27 Apr 2010 07:13:31 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861272#action_12861272
] 

Robin Anil commented on MAHOUT-297:
-----------------------------------

There was a discussion about this on the dev list. Check the util Vector Benchmarks and see
how much faster clustering became after this change.  Shouldnt necessarily be SeqAcc, if the
points are all dense vectors. But obvious savings for sparse data is much better than the
slight loss in performance for dense. (you will see that in the vector benchmarks code)



> Canopy and Kmeans clustering slows down on using SeqAccVector for center
> ------------------------------------------------------------------------
>
>                 Key: MAHOUT-297
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-297
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Clustering
>    Affects Versions: 0.4
>            Reporter: Robin Anil
>            Assignee: Robin Anil
>             Fix For: 0.4
>
>         Attachments: MAHOUT-297.patch, MAHOUT-297.patch, MAHOUT-297.patch, MAHOUT-297.patch,
MAHOUT-297.patch
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message