Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 803BD70EF for ; Sun, 2 Oct 2011 08:37:13 +0000 (UTC) Received: (qmail 31601 invoked by uid 500); 2 Oct 2011 08:37:12 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 31506 invoked by uid 500); 2 Oct 2011 08:37:11 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 31489 invoked by uid 99); 2 Oct 2011 08:37:11 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 02 Oct 2011 08:37:11 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [93.94.224.194] (HELO owa.exchange-login.net) (93.94.224.194) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 02 Oct 2011 08:37:03 +0000 Received: from HC2.hosted.exchange-login.net (93.94.224.201) by edge1.hosted.exchange-login.net (93.94.224.194) with Microsoft SMTP Server (TLS) id 14.1.339.1; Sun, 2 Oct 2011 10:36:43 +0200 Received: from [192.168.1.101] (182.68.179.156) by hc2.hosted.exchange-login.net (93.94.224.204) with Microsoft SMTP Server (TLS) id 14.1.339.1; Sun, 2 Oct 2011 10:36:41 +0200 Message-ID: <4E88228C.6040902@xebia.com> Date: Sun, 2 Oct 2011 14:06:28 +0530 From: Paritosh Ranjan User-Agent: Mozilla/5.0 (Windows NT 6.0; rv:7.0.1) Gecko/20110929 Thunderbird/7.0.1 MIME-Version: 1.0 To: Subject: Re: Difference in results : Clustering : sequential and MapReduce References: <4E876286.5040205@xebia.com> In-Reply-To: <4E876286.5040205@xebia.com> Content-Type: text/plain; charset="ISO-8859-1"; format=flowed Content-Transfer-Encoding: 7bit X-Originating-IP: [182.68.179.156] X-Virus-Checked: Checked by ClamAV on apache.org Even run() of CanopyDriver, which takes only T1 and T2 is giving different results for sequential and mapreduce. This is preventing me from scaling up, as I need to run mapreduce on hadoop to scale. Is anyone having any idea of this problem? On 02-10-2011 00:27, Paritosh Ranjan wrote: > Hi, > > I am able to cluster correctly sequentially, using CanopyDriver. > > However, the same dataset, when processed as a MapReduce job, where ( > t1 = t3 and t2 = t4 and t1>t2) is not working. I am getting errors > like Canopies are empty. > > I also tried to reduce the values of t3 and t4. But reducing it either > has no effect or gives meaningless results. > > Am I doing something wrong? or is there a bug somewhere? > > I feel that both, sequential and MapReduce should give similar > results. But, It is not happening. > > Thanks and Regards, > Paritosh > > > ----- > No virus found in this message. > Checked by AVG - www.avg.com > Version: 10.0.1410 / Virus Database: 1520/3932 - Release Date: 10/01/11