mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paritosh Ranjan (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAHOUT-940) Clusterdumper - Get rid of map based implementation
Date Wed, 04 Jan 2012 17:14:40 GMT
Clusterdumper - Get rid of map based implementation
---------------------------------------------------

                 Key: MAHOUT-940
                 URL: https://issues.apache.org/jira/browse/MAHOUT-940
             Project: Mahout
          Issue Type: Improvement
          Components: Clustering
    Affects Versions: 0.6
            Reporter: Paritosh Ranjan
             Fix For: 0.7


Current implementation of ClusterDumper puts clusters and related vectors in map. This generally
results in OOM.

Since ClusterOutputProcessor is availabale now. The ClusterDumper will at first process the
clusteredPoints, and then write down the clusters to a local file. 

The inability to properly read the clustering output due to ClusterDumper facing OOM is seen
too often in the mailing list. This improvement will fix that problem.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message