mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Saikat Kanjilal (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning
Date Thu, 22 Mar 2012 05:40:22 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235364#comment-13235364
] 

Saikat Kanjilal commented on MAHOUT-984:
----------------------------------------

Paritosh,
I'm running into a strange issue, I've refactored the FuzzyKMeansDriver similar to KMeansDriver
and to use the FuzzyKMeansClusteringPolicy with the other logic being pretty much the same.
 The unit test for FuzzyKMeansDriver when run individually passes, however the unit test fails
when I go to run all the unit tests together.  I am attaching the clusterData function here,
any ideas on this?

Regards


  public static void clusterData(Path input,
                                 Path clustersIn,
                                 Path output,
                                 DistanceMeasure measure,
                                 double convergenceDelta,
                                 float m,
                                 boolean emitMostLikely,
                                 double threshold,
                                 boolean runSequential)
    throws IOException, ClassNotFoundException, InterruptedException {    
    if (log.isInfoEnabled()) {
        log.info("Running Clustering");
        log.info("Input: {} Clusters In: {} Out: {} Distance: {}", new Object[] {input, clustersIn,
output, measure});
      }
      ClusterClassifier.writePolicy(new FuzzyKMeansClusteringPolicy((double)m,convergenceDelta),
clustersIn);
      ClusterClassificationDriver.run(input, output, new Path(output, CLUSTERED_POINTS_DIRECTORY),
          threshold, true, runSequential);

  }

                
> Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning
> -----------------------------------------------------------------------------------
>
>                 Key: MAHOUT-984
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-984
>             Project: Mahout
>          Issue Type: Sub-task
>          Components: Clustering
>    Affects Versions: 0.6
>            Reporter: Paritosh Ranjan
>            Assignee: Paritosh Ranjan
>              Labels: clustering
>             Fix For: 0.7
>
>
> Use ClusterClassificationDriver to refactor clustering out of FuzzyKMeansDriver with
outlier pruning support.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message