Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8CB81C590 for ; Fri, 1 Jun 2012 20:15:01 +0000 (UTC) Received: (qmail 96360 invoked by uid 500); 1 Jun 2012 20:15:00 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 96313 invoked by uid 500); 1 Jun 2012 20:15:00 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 96304 invoked by uid 99); 1 Jun 2012 20:15:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Jun 2012 20:15:00 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [206.188.198.67] (HELO omr2pod1.networksolutionsemail.com) (206.188.198.67) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Jun 2012 20:14:51 +0000 Received: from cm-omr2pod1 (mailpod1.networksolutionsemail.com [206.188.198.65]) by omr2pod1.networksolutionsemail.com (8.13.8/8.13.8) with ESMTP id q51KEUiS029778 for ; Fri, 1 Jun 2012 16:14:30 -0400 Authentication-Results: cm-omr2pod1 smtp.user=jeastman@windwardsolutions.com; auth=pass (LOGIN) X-Authenticated-UID: jeastman@windwardsolutions.com Received: from [76.189.175.0] ([76.189.175.0:37635] helo=Jeffs-New-MacBook-Pro.local) by cm-omr2pod1 (envelope-from ) (ecelerity 2.2.2.41 r(31179/31189)) with ESMTPA id 96/15-25392-5A229CF4; Fri, 01 Jun 2012 16:14:30 -0400 Message-ID: <4FC922A5.3020000@windwardsolutions.com> Date: Fri, 01 Jun 2012 16:14:29 -0400 From: Jeff Eastman User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.7; rv:12.0) Gecko/20120428 Thunderbird/12.0.1 MIME-Version: 1.0 To: user@mahout.apache.org Subject: Re: Bug report in FuzzyKMeansUtil References: In-Reply-To: Content-Type: multipart/mixed; boundary="------------040702050506090701010502" X-Virus-Checked: Checked by ClamAV on apache.org --------------040702050506090701010502 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Hi Yimin, Thanks for pointing this out. I've just committed a patch to correct it. On 6/1/12 3:13 PM, Yimin Wu wrote: > > > > Unlike KMeansUtil, the FuzzyKMeansUtil does not take a Configuration as a parameter. While performing deserialization, it create a new Configuration. This is a bug since it wouldn't work on Hadoop files. It will calles FuzzyMapper setup throw an exception complainning the file can not be found. I verified that the issue would gone after passing in conf like KMeansUtil. Bascially, this code patern "new Configuration()" could be problematic as long as the code would be used in a mapreduce job. I suggest developer team review all the places where a Configuration object is used. Whenever a new Configuration object is created instead of passed in. Thanks,Yimin Wu Predixion Software --------------040702050506090701010502--