Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 050F69EED for ; Tue, 3 Jan 2012 06:56:47 +0000 (UTC) Received: (qmail 38826 invoked by uid 500); 3 Jan 2012 06:56:45 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 38635 invoked by uid 500); 3 Jan 2012 06:56:36 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 38623 invoked by uid 99); 3 Jan 2012 06:56:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jan 2012 06:56:35 +0000 X-ASF-Spam-Status: No, hits=2.5 required=5.0 tests=FREEMAIL_REPLY,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ted.dunning@gmail.com designates 209.85.210.170 as permitted sender) Received: from [209.85.210.170] (HELO mail-iy0-f170.google.com) (209.85.210.170) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jan 2012 06:56:29 +0000 Received: by mail-iy0-f170.google.com with SMTP id j26so59846566iaf.1 for ; Mon, 02 Jan 2012 22:56:09 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=m0QB+8azJsJKehxpnQ+PbkHxMA41KpCbWiS2DdVX/6M=; b=g4/N3Pzd0OIwwBIQSIYU7l5iQJ2yxXETb5b1AaZXcfxIDlW7b1Jjh4S03pYFxq2aiJ vY14l5F2ctavaaGMpCQAfwVP0W3DwlO5pI0ojfXcGzZE35PA2Ey/SB5s/5vP9L01ODBg gzbj1USYVYwAnJ9xsE34HsnAzb9xaC3FeH3DE= Received: by 10.50.180.167 with SMTP id dp7mr61263610igc.26.1325573769106; Mon, 02 Jan 2012 22:56:09 -0800 (PST) MIME-Version: 1.0 Received: by 10.50.197.161 with HTTP; Mon, 2 Jan 2012 22:55:48 -0800 (PST) In-Reply-To: References: From: Ted Dunning Date: Mon, 2 Jan 2012 22:55:48 -0800 Message-ID: Subject: Re: STEPS(how) to write programs using mahout.. To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=14dae9340ce3a196a804b59a310b --14dae9340ce3a196a804b59a310b Content-Type: text/plain; charset=UTF-8 Did you look at the README? On Mon, Jan 2, 2012 at 10:10 PM, rahul raghavendhra < rahulraghavendhra1@gmail.com> wrote: > I have svn the trunk of mahout-snapshot and i installed using maven.. I > have tested Kmeans ans it works well.. How to run examples that are in > Mahout in Action book ? what are the steps to follow.. please help.. > > > > On Fri, Dec 30, 2011 at 3:02 AM, Ted Dunning > wrote: > > > Here are some sample maven projects that use mahout. You can copy the > > dependencies from the pom.xml file after you set up an empty project. Or > > you can copy this project and delete all the code before inserting your > > own. > > > > https://github.com/tdunning/Chapter-16 > > > > https://github.com/tdunning/pig-vector > > > > On Thu, Dec 29, 2011 at 12:25 PM, Dmitriy Lyubimov > >wrote: > > > > > 1) Are you sure you can't use Mahout command line? > > > > > > if no, try command line, otherwise proceed to #2. > > > > > > 2) Are you resolved to run it embedded client side? > > > > > > if no, go back to command line use. > > > if yes, your best bet is to build a maven project. Unfortunately i > > > cannot help you with maven references within framework of this list. I > > > think you need some maven resource to read thru how to build that. > > > > > > > > > 3) Are you also running MR backend-side with mahout dependencies as > well? > > > If yes, you need something called mahout-core-0.6-SNAPSHOT-job.jar (if > > > you build Mahout from source, it will land in core/target folder). > > > That's something called "hadoop job" jar which you can redistribute to > > > MR backend tasks. If that's what you want to do, try to ask on Hadoop > > > forums how to do it in your mapreduce-enabled applications, I am not > > > really 100% sure myself. Standard hadoop command takes those with > > > --jar option. > > > > > > 4) Sometimes it is also needed to do something of inverse nature: to > > > include some of _your_ libraries running in backend with Mahout tasks. > > > (example being: custom lucene text analyzer for text inputs). I think > > > it may be also achievable with mahout command line option by using the > > > same standard --jar option for your own hadoop job jar, but I am not > > > 100% sure. I did somethnig like that long ago but i can't remember how > > > it was done now. > > > > > > Thanks. > > > -Dmitriy > > > > > > On Thu, Dec 29, 2011 at 1:02 AM, rahul raghavendhra > > > wrote: > > > > It sound better.. can u please elaborate so that new uses like me can > > > > learn.. thanks Dmitry.. Please help.. thanks in advance > > > > > > > > ./rahul > > > > > > > > > > > > On Thu, Dec 29, 2011 at 2:07 PM, Dmitriy Lyubimov > > > > wrote: > > > > > > > >> > (I actually don't do that, I do it slightly > > > >> >other way, by publishing all dependency jars of my project on hdfs > > and > > > >> >then use DistributedCache to add them to my MR classpath, so i > don't > > > >> >know for sure about using mahout hadoop job jar outside the command > > > >> line). > > > >> >But command line is still probably the best way to try something, > > > >> >embedding takes more time. > > > >> > > > >> > > > > > > --14dae9340ce3a196a804b59a310b--