Return-Path: X-Original-To: apmail-mahout-dev-archive@www.apache.org Delivered-To: apmail-mahout-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 06670954E for ; Thu, 22 Mar 2012 05:40:50 +0000 (UTC) Received: (qmail 75024 invoked by uid 500); 22 Mar 2012 05:40:49 -0000 Delivered-To: apmail-mahout-dev-archive@mahout.apache.org Received: (qmail 74841 invoked by uid 500); 22 Mar 2012 05:40:46 -0000 Mailing-List: contact dev-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mahout.apache.org Delivered-To: mailing list dev@mahout.apache.org Received: (qmail 74537 invoked by uid 99); 22 Mar 2012 05:40:46 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Mar 2012 05:40:46 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Mar 2012 05:40:43 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id B0C6C1B6A97 for ; Thu, 22 Mar 2012 05:40:22 +0000 (UTC) Date: Thu, 22 Mar 2012 05:40:22 +0000 (UTC) From: "Saikat Kanjilal (Commented) (JIRA)" To: dev@mahout.apache.org Message-ID: <1209571038.1228.1332394822753.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1092325952.8944.1329983869178.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235364#comment-13235364 ] Saikat Kanjilal commented on MAHOUT-984: ---------------------------------------- Paritosh, I'm running into a strange issue, I've refactored the FuzzyKMeansDriver similar to KMeansDriver and to use the FuzzyKMeansClusteringPolicy with the other logic being pretty much the same. The unit test for FuzzyKMeansDriver when run individually passes, however the unit test fails when I go to run all the unit tests together. I am attaching the clusterData function here, any ideas on this? Regards public static void clusterData(Path input, Path clustersIn, Path output, DistanceMeasure measure, double convergenceDelta, float m, boolean emitMostLikely, double threshold, boolean runSequential) throws IOException, ClassNotFoundException, InterruptedException { if (log.isInfoEnabled()) { log.info("Running Clustering"); log.info("Input: {} Clusters In: {} Out: {} Distance: {}", new Object[] {input, clustersIn, output, measure}); } ClusterClassifier.writePolicy(new FuzzyKMeansClusteringPolicy((double)m,convergenceDelta), clustersIn); ClusterClassificationDriver.run(input, output, new Path(output, CLUSTERED_POINTS_DIRECTORY), threshold, true, runSequential); } > Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning > ----------------------------------------------------------------------------------- > > Key: MAHOUT-984 > URL: https://issues.apache.org/jira/browse/MAHOUT-984 > Project: Mahout > Issue Type: Sub-task > Components: Clustering > Affects Versions: 0.6 > Reporter: Paritosh Ranjan > Assignee: Paritosh Ranjan > Labels: clustering > Fix For: 0.7 > > > Use ClusterClassificationDriver to refactor clustering out of FuzzyKMeansDriver with outlier pruning support. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira