Return-Path: Delivered-To: apmail-mahout-user-archive@www.apache.org Received: (qmail 47224 invoked from network); 18 Sep 2010 17:29:38 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 18 Sep 2010 17:29:38 -0000 Received: (qmail 83508 invoked by uid 500); 18 Sep 2010 17:29:38 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 83456 invoked by uid 500); 18 Sep 2010 17:29:38 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 83448 invoked by uid 99); 18 Sep 2010 17:29:37 -0000 Received: from Unknown (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 18 Sep 2010 17:29:37 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ssc.open@googlemail.com designates 209.85.214.42 as permitted sender) Received: from [209.85.214.42] (HELO mail-bw0-f42.google.com) (209.85.214.42) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 18 Sep 2010 17:29:15 +0000 Received: by bwz7 with SMTP id 7so6569205bwz.1 for ; Sat, 18 Sep 2010 10:28:54 -0700 (PDT) Received: by 10.204.100.12 with SMTP id w12mr5119815bkn.90.1284830934416; Sat, 18 Sep 2010 10:28:54 -0700 (PDT) Received: from [192.168.0.100] (f052142164.adsl.alicedsl.de [78.52.142.164]) by mx.google.com with ESMTPS id s34sm4849689bkk.1.2010.09.18.10.28.52 (version=SSLv3 cipher=RC4-MD5); Sat, 18 Sep 2010 10:28:53 -0700 (PDT) Message-ID: <4C94F6D3.3040505@apache.org> Date: Sat, 18 Sep 2010 19:28:51 +0200 From: Sebastian Schelter Reply-To: ssc@apache.org User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.12) Gecko/20100915 Thunderbird/3.0.8 MIME-Version: 1.0 To: user@mahout.apache.org Subject: Re: Evaluator for RecommenderJob (hadoop implementation)? References: <1284685187681-1515638.post@n3.nabble.com> In-Reply-To: Content-Type: multipart/alternative; boundary="------------000603060408070806030602" X-Virus-Checked: Checked by ClamAV on apache.org --------------000603060408070806030602 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if you change the subject line of your email, other mail headers still track which thread you replied to and your question is "hidden" in that thread and gets less attention. It makes following discussions in the mailing list archives particularly difficult. See Also: http://en.wikipedia.org/wiki/User:DonDiego/Thread_hijacking > I am trying to run FPGrowth: > > *hadoop jar /opt/mahout-0.3/mahout-examples-0.3.job > org.apache.mahout.fpm.pfpgrowth.FPGrowthDriver -i > output/product/part-r-00000 -o pfp -method mapreduce -regex [\\t] -s 5 -g > 17500 -k 50* > > However the 3rd task:* "Processing FPTree: Bottom Up FP Growth > > reduce"*will not finish. It's basically stuck at 85% and hasn't budged > in over an > hour. The output of the first task outputted there were about 37K features > so I set -g to 17500. Does anyone know whats going on and how I can speed > this up? > > Thanks > > --------------000603060408070806030602--