Return-Path: Delivered-To: apmail-mahout-user-archive@www.apache.org Received: (qmail 30184 invoked from network); 3 Jun 2010 21:58:58 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 3 Jun 2010 21:58:58 -0000 Received: (qmail 43405 invoked by uid 500); 3 Jun 2010 21:58:58 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 43361 invoked by uid 500); 3 Jun 2010 21:58:58 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 43353 invoked by uid 99); 3 Jun 2010 21:58:58 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Jun 2010 21:58:58 +0000 X-ASF-Spam-Status: No, hits=1.1 required=10.0 tests=AWL,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of tommy.chheng@gmail.com designates 209.85.160.42 as permitted sender) Received: from [209.85.160.42] (HELO mail-pw0-f42.google.com) (209.85.160.42) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Jun 2010 21:58:52 +0000 Received: by pwi7 with SMTP id 7so386563pwi.1 for ; Thu, 03 Jun 2010 14:58:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from :user-agent:mime-version:to:subject:references:in-reply-to :content-type; bh=A2ommA4+YHgWsQpfl8++Ftnfg04cMhwAd8ql1tORe6M=; b=lZr0akMSDrddz+yUkJev0GbdaBwQNxdWkJElJKEDcm9Umbp82KOtzmcTCAbvuc/cDh QJqfjcUw3XkFDLfoRKrWiezkoyrdSk44XuZdbE731rHJ0KlwmhMP1r/IxKZzDogeNyxQ +EQxlcQ3U2zA9EydcFXbkSpuKkW1V0jWwIXW8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type; b=lT8XmthzPNqHDpdxCv3Mi3K3Y8cp+ei6wNg+sv4yiXC5vnyexS0IoWbaZqbGZIxU0s /8q2GUy1v6C9SQWpkSAhTAuv/7r55QFoaKmhFTWK0xeIeH1RRiIMI8ODvDfBWvU61hUY QPrtSrXuR0Kmg+eKIljlejhYzPMQ1Ms8QCY6U= Received: by 10.142.67.22 with SMTP id p22mr7397294wfa.179.1275602311800; Thu, 03 Jun 2010 14:58:31 -0700 (PDT) Received: from dhcp-v002-032.mobile.uci.edu (dhcp-v002-032.mobile.uci.edu [169.234.2.32]) by mx.google.com with ESMTPS id w39sm466719wfh.15.2010.06.03.14.58.30 (version=TLSv1/SSLv3 cipher=RC4-MD5); Thu, 03 Jun 2010 14:58:30 -0700 (PDT) Message-ID: <4C082585.3090200@gmail.com> Date: Thu, 03 Jun 2010 14:58:29 -0700 From: Tommy Chheng User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.4) Gecko/20100526 Thunderbird/3.1 MIME-Version: 1.0 To: user@mahout.apache.org Subject: Re: mahout quickstart-kmeans script sequencefile parameter References: <4C03E3AE.3010303@gmail.com> <4C05B636.1030502@gmail.com> In-Reply-To: Content-Type: multipart/alternative; boundary="------------000003060202090808030606" --------------000003060202090808030606 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Thanks Drew, I started a new EC2 instance with the mahout trunk and got it working. There is a problem with the last line though. The last line in the script gave an error: ../bin/mahout kmeans -i ./work/reuters-out-seqdir-sparse/tfidf/vectors/ -c ./work/clusters -o ./work/reuters-kmeans -k 20 -w org.apache.commons.cli2.OptionException: Unexpected -w while processing Options Removing the -w and adding the -maxIter fixes it. ../bin/mahout kmeans -i ./work/reuters-out-seqdir-sparse/tfidf-vectors/ -c ./work/clusters -o ./work/reuters-kmeans -k 20 --maxIter 20 I added a comment to https://issues.apache.org/jira/browse/MAHOUT-390 @tommychheng Programmer and UC Irvine Graduate Student Find a great grad school based on research interests: http://gradschoolnow.com On 6/2/10 8:27 PM, Drew Farris wrote: > Very strange: > > drew@skirnir:~/mahout/svn-trunk$ svn info > Path: . > URL: https://svn.apache.org/repos/asf/mahout/trunk > Repository Root: https://svn.apache.org/repos/asf > Repository UUID: 13f79535-47bb-0310-9956-ffa450edef68 > Revision: 950859 > [...] > drew@skirnir:~/mahout/svn-trunk$ ./bin/mahout seqdirectory -i > ./work/reuters-out -o ./work/reuters-out-seqdir -c UTF-8 > no HADOOP_CONF_DIR or HADOOP_HOME set, running locally > [..] > drew@skirnir:~/mahout/svn-trunk$ ls ./work/reuters-out-seqdir > chunk-0 > > To be absolutely certain nothing old is lurking in your target directories, > try 'mvn clean install' to rebuild and see if your results differ. If you > prefer, you can skip test execution 'mvn clean install -DskipTests=true' > > IF that doesn't work, run 'mvn -v' and post the results -- that might > provide some clues. > > - Drew > > On Tue, Jun 1, 2010 at 9:39 PM, Tommy Chheng wrote: > >> I updated the svn and did a mvn install but still getting a parsing >> command line error on the seqdirectory command. >> $svn info >> Path: . >> URL: http://svn.apache.org/repos/asf/mahout/trunk >> Repository Root: http://svn.apache.org/repos/asf >> Repository UUID: 13f79535-47bb-0310-9956-ffa450edef68 >> Revision: 950329 >> Node Kind: directory >> Schedule: normal >> Last Changed Author: srowen >> Last Changed Rev: 950049 >> Last Changed Date: 2010-06-01 05:55:49 -0700 (Tue, 01 Jun 2010) >> >> >> $./bin/mahout seqdirectory -i ./work/reuters-out/ -o >> ./work/reuters-out-seqdir -c UTF-8 >> no HADOOP_CONF_DIR or HADOOP_HOME set, running locally >> Exception in thread "main" org.apache.commons.cli2.OptionException: >> Unexpected -i while processing Options >> at org.apache.commons.cli2.commandline.Parser.parse(Parser.java:99) >> at >> org.apache.mahout.text.SequenceFilesFromDirectory.main(SequenceFilesFromDirectory.java:205) >> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >> at >> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) >> at >> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) >> at java.lang.reflect.Method.invoke(Method.java:597) >> at >> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68) >> at >> org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139) >> at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:174) >> >> @tommychheng >> Programmer and UC Irvine Graduate Student >> Find a great grad school based on research interests: >> http://gradschoolnow.com >> >> On 6/1/10 12:43 PM, Grant Ingersoll wrote: >> >>> Can you try doing an SVN update and then "mvn install" and then run again? >>> >>> On May 31, 2010, at 12:28 PM, Tommy Chheng wrote: >>> >>> Hi, >>>> I'm using the quickstart-kmeans.sh script from >>>> https://issues.apache.org/jira/browse/MAHOUT-390 to run the example >>>> kmeans. I'm on mahout trunk. >>>> >>>> It fails on the SequenceFile generation step: >>>> $./bin/mahout seqdirectory -i ./work/reuters-out/ -o >>>> ./work/reuters-out-seqdir -c UTF-8 >>>> no HADOOP_CONF_DIR or HADOOP_HOME set, running locally >>>> Exception in thread "main" org.apache.commons.cli2.OptionException: >>>> Unexpected -i while processing Options >>>> at >>>> org.apache.commons.cli2.commandline.Parser.parse(Parser.java:99) >>>> at >>>> org.apache.mahout.text.SequenceFilesFromDirectory.main(SequenceFilesFromDirectory.java:205) >>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >>>> at >>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) >>>> at >>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) >>>> at java.lang.reflect.Method.invoke(Method.java:597) >>>> at >>>> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68) >>>> at >>>> org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139) >>>> at >>>> org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:174) >>>> >>>> Alternatively, I tried ./bin/mahout seqdirectory --input >>>> ./work/reuters-out/ -o ./work/reuters-out-seqdir -c UTF-8 but the get the >>>> same unexpected --input error. >>>> >>>> >>>> -- >>>> >>>> @tommychheng >>>> Programmer and UC Irvine Graduate Student >>>> Find a great grad school based on research interests: >>>> http://gradschoolnow.com >>>> >>>> --------------000003060202090808030606--