Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 887B4E216 for ; Thu, 17 Jan 2013 16:10:09 +0000 (UTC) Received: (qmail 5962 invoked by uid 500); 17 Jan 2013 16:10:08 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 5687 invoked by uid 500); 17 Jan 2013 16:10:07 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 5659 invoked by uid 99); 17 Jan 2013 16:10:06 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Jan 2013 16:10:06 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of pat.ferrel@gmail.com designates 209.85.210.53 as permitted sender) Received: from [209.85.210.53] (HELO mail-da0-f53.google.com) (209.85.210.53) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Jan 2013 16:09:56 +0000 Received: by mail-da0-f53.google.com with SMTP id x6so1169702dac.26 for ; Thu, 17 Jan 2013 08:09:35 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; bh=eVMFEAUibQg3W43nUbetfcne11DiL6nqcqPww1YPt5g=; b=H9MN685Lt3Q6MHTdJtwI23OkqvL9eNOhzjz6LLyeNoj+tci9cSlzJuXk1urCQ9WXGF B20z/eSOSdzttyqTZ0tg5PL7fewluxSlTujijegzhAMdrcyoMohTAZ5FbNxYgAgi4nfI S7/23Lmsqahs8P264anfRNvyOxIaiuiVNUztRBrPpRxumVVVJ6p5jpfZgAgRafSZEmo0 XIp07svXzG8gIq0dFWe218mMjqzE1K78zsNKxbmV5kHUN+6QeKpMWfGP0WIVGF4ReWEX Ap6/SgSZ4GvYp960L+o2LHZKFe+/UBZmREwfm66GJjOhiRm8jNzN11x4QYVFmWyJmt/0 3QFQ== X-Received: by 10.66.78.168 with SMTP id c8mr15010049pax.16.1358438975154; Thu, 17 Jan 2013 08:09:35 -0800 (PST) Received: from [10.7.1.27] ([72.5.59.176]) by mx.google.com with ESMTPS id ho4sm1294343pbc.54.2013.01.17.08.09.31 (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Thu, 17 Jan 2013 08:09:32 -0800 (PST) Content-Type: text/plain; charset=iso-8859-1 Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\)) Subject: Re: is Hadoop based SVD_ALS a complete feature? From: Pat Ferrel In-Reply-To: <50F814E5.3010206@apache.org> Date: Thu, 17 Jan 2013 08:09:35 -0800 Content-Transfer-Encoding: quoted-printable Message-Id: <67E2FED0-62F1-486A-8D92-9F92939362BF@gmail.com> References: <50F814E5.3010206@apache.org> To: user@mahout.apache.org, ssc@apache.org X-Mailer: Apple Mail (2.1499) X-Virus-Checked: Checked by ClamAV on apache.org There is a problem in factorize-movielens-1M.sh and the DataSplitter = needs to initialize the args parser before it accesses the options ( I = think I put a ticket in for the DataSplitter with a patch). The last = problem below is Ying Liao's alone. On Jan 17, 2013, at 7:12 AM, Sebastian Schelter wrote: Which version/distribution of Hadoop are you using? On 17.01.2013 16:08, Pat Ferrel wrote: > +1 this, found the same problems, same fixes. Haven't seem your last = problem >=20 > On Jan 11, 2013, at 1:41 PM, Ying Liao wrote: >=20 > I am tring factorize-movielens-1M.sh. I first find a bug in the sh = file. > Then I find a bug in = org.apache.mahout.cf.taste.hadoop.als.DatasetSplitter, > the argMap is not mapped. No I hit a third bug: > [cloudera@localhost trunk]$ hadoop jar > = /home/cloudera/workspace/Mahout/trunk/examples/target/mahout-examples-0.8-= SNAPSHOT-job.jar > org.apache.mahout.cf.taste.hadoop.als.DatasetSplitter --input > /user/cloudera/ratings.csv --output /user/cloudera/dataset > --trainingPercentage 0.9 --probePercentage 0.1 --tempDir > /user/cloudera/dataset/tmp > 13/01/11 16:37:30 INFO common.AbstractJob: Command line arguments: > {--endPhase=3D[2147483647], --input=3D[/user/cloudera/ratings.csv], > --output=3D[/user/cloudera/dataset], --probePercentage=3D[0.1], > --startPhase=3D[0], --tempDir=3D[/user/cloudera/dataset/tmp], > --trainingPercentage=3D[0.9]} > 13/01/11 16:37:30 WARN conf.Configuration: mapred.input.dir is = deprecated. > Instead, use mapreduce.input.fileinputformat.inputdir > 13/01/11 16:37:30 WARN conf.Configuration: mapred.compress.map.output = is > deprecated. Instead, use mapreduce.map.output.compress > 13/01/11 16:37:30 WARN conf.Configuration: mapred.output.dir is = deprecated. > Instead, use mapreduce.output.fileoutputformat.outputdir > Exception in thread "main" java.lang.IncompatibleClassChangeError: = Found > interface org.apache.hadoop.mapreduce.JobContext, but class was = expected > at = org.apache.mahout.common.HadoopUtil.getCustomJobName(HadoopUtil.java:166) > at = org.apache.mahout.common.AbstractJob.prepareJob(AbstractJob.java:553) > at > = org.apache.mahout.cf.taste.hadoop.als.DatasetSplitter.run(DatasetSplitter.= java:85) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) > at > = org.apache.mahout.cf.taste.hadoop.als.DatasetSplitter.main(DatasetSplitter= .java:62) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > = sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:= 39) > at > = sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorIm= pl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.apache.hadoop.util.RunJar.main(RunJar.java:208) >=20 > Any help is appreciated. >=20 > Thanks, > Ying >=20