Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C7D2F846E for ; Wed, 10 Aug 2011 12:58:49 +0000 (UTC) Received: (qmail 38109 invoked by uid 500); 10 Aug 2011 12:58:48 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 38048 invoked by uid 500); 10 Aug 2011 12:58:48 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 38040 invoked by uid 99); 10 Aug 2011 12:58:47 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 10 Aug 2011 12:58:47 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of srowen@gmail.com designates 74.125.83.42 as permitted sender) Received: from [74.125.83.42] (HELO mail-gw0-f42.google.com) (74.125.83.42) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 10 Aug 2011 12:58:41 +0000 Received: by gwb17 with SMTP id 17so1094409gwb.1 for ; Wed, 10 Aug 2011 05:58:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=lsR3I6oZiKyaoIv3GWMXxaMd5xQm614uesWO9AQz1o4=; b=k7NjOrTKY2QFRRpPMQSdpZWF0AFY8FiNX/Yt6oVhpqbRKxtg6obzVsRvQBjcxi8yz2 uJHuO6P8aH7nnAHJScDFzcGL2Oa9H5Jm6qzPMzB4E85d19YaDFvrcMCtsb9PhFc6paEu UwNTyAta1so7CuJGXQeuGhyIhkRBu1Jna14IA= MIME-Version: 1.0 Received: by 10.101.131.4 with SMTP id i4mr7163753ann.61.1312981100773; Wed, 10 Aug 2011 05:58:20 -0700 (PDT) Received: by 10.100.12.11 with HTTP; Wed, 10 Aug 2011 05:58:20 -0700 (PDT) In-Reply-To: References: Date: Wed, 10 Aug 2011 13:58:20 +0100 Message-ID: Subject: Re: How to launch an Hadoop Recommender Job from Java ? From: Sean Owen To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=001636c92a3f1bbfb004aa263cf6 --001636c92a3f1bbfb004aa263cf6 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable I don't believe it's actually cleaned out then. Hadoop thinks the temp directory exists from a previous run, which perhaps failed. Make sure it is deleted in HDFS. This is, at least, what the error is trying to tell you. Are you running two jobs that might both want this directory? 2011/8/10 Cl=C3=A9ment Notin > Yes I agree it's ugly ;) > > I tried with the params > "org.apache.mahout.cf.taste.hadoop.item.RecommenderJob > -Dmapred.input.dir=3Dmb-recouser-input/input.csv > -Dmapred.output.dir=3Dmb-recouser-output/reco.csv --numRecommendations 3 > --booleanData true --similarityClassname SIMILARITY_EUCLIDEAN_DISTANCE" (= of > course I split them). > > But I'm getting an error : > INFO [2011-08-10 14:52:05,195] (JobClient.java:871) - Cleaning up the > staging area > > file:/tmp/hadoop-clement/mapred/staging/clement1957523084/.staging/job_lo= cal_0001 > org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory > temp/itemIDIndex already exists > > Even if I clean before the /tmp/hadoop-clement/ folder... > And it don't seems to run on the cluster. > > 2011/8/10 Sean Owen > > > You could just run the main() method with an array of the same argument= s > > you > > passed on the command line. It's a little ugly but entirely works. > > > > 2011/8/10 Cl=C3=A9ment Notin > > > > > Hello, > > > > > > I've achieved to run a recommender over hadoop using the command line > > > /bin/mahout org.apache.mahout.cf.taste.hadoop.item.RecommenderJob > --input > > > ..... > > > I'm happy with it but now I want to launch this using Java. > > > > > > What is the easiest way to do this ? I tried to run the MahoutDriver > but > > it > > > runs locally however I want to launch the job on an hadoop cluster. > > > > > > Regards. > > > > > > -- > > > *Cl=C3=A9ment **Notin* > > > > > > > > > -- > *Cl=C3=A9ment **Notin* > --001636c92a3f1bbfb004aa263cf6--