Return-Path: Delivered-To: apmail-mahout-user-archive@www.apache.org Received: (qmail 82037 invoked from network); 11 Feb 2011 08:47:38 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 11 Feb 2011 08:47:38 -0000 Received: (qmail 18317 invoked by uid 500); 11 Feb 2011 08:47:37 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 18003 invoked by uid 500); 11 Feb 2011 08:47:35 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 17995 invoked by uid 99); 11 Feb 2011 08:47:34 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Feb 2011 08:47:34 +0000 X-ASF-Spam-Status: No, hits=3.8 required=5.0 tests=FREEMAIL_FROM,FREEMAIL_REPLY,MIME_QP_LONG_LINE,RCVD_IN_BL_SPAMCOP_NET,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of tanish2k@gmail.com designates 209.85.161.42 as permitted sender) Received: from [209.85.161.42] (HELO mail-fx0-f42.google.com) (209.85.161.42) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Feb 2011 08:47:26 +0000 Received: by fxm11 with SMTP id 11so2523431fxm.1 for ; Fri, 11 Feb 2011 00:47:04 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:subject:references:from:content-type:x-mailer :in-reply-to:message-id:date:to:content-transfer-encoding :mime-version; bh=DafHXKiBcCLuIcmf0gKttzi7v8+GlrkbK0scOak9QLs=; b=kcjl0bcV8MU7Hh30LO4UK++SO7+6lWZJmhmIajXM1tU0NKA5hssnrHjhpYIrlhoKF6 euBnLzuMKjzKt++m0WchXpTA9/4mLRmtZgP6x4oG+eCKRJJYvjr3qrTfT+K9eFO5EXMi W9iyg/8DycIfT2kFlcUMUQ/9vG1E1kvC5XBpc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=subject:references:from:content-type:x-mailer:in-reply-to :message-id:date:to:content-transfer-encoding:mime-version; b=nQg+/fs8h6wPEpRYkBad84NPJEldgLK1pyI8ifZGmeLBS6F2fTpP9OJTml4snr6sOB PRaUiILFoIpKtSG+kHRc2okiEptwknbPDTTsV7LwPUYelqgojSIK/0IIRT/P3XkUeTX+ FhvxSoHT8wcxiDuqgMDGSt9t64fhtJCjxkqpw= Received: by 10.223.103.8 with SMTP id i8mr224336fao.47.1297414024844; Fri, 11 Feb 2011 00:47:04 -0800 (PST) Received: from [10.219.5.227] (tmo-107-170.customers.d1-online.com [80.187.107.170]) by mx.google.com with ESMTPS id f24sm211457fak.24.2011.02.11.00.47.02 (version=TLSv1/SSLv3 cipher=OTHER); Fri, 11 Feb 2011 00:47:03 -0800 (PST) Subject: Re: Oops: file URLs require file:// in some jobs References: From: Saurabh Arora Content-Type: text/plain; charset=us-ascii X-Mailer: iPhone Mail (8C148a) In-Reply-To: Message-Id: <1269111D-C1CF-4F28-BC8A-9D687D0A2A49@gmail.com> Date: Fri, 11 Feb 2011 09:47:52 +0100 To: "user@mahout.apache.org" Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (iPhone Mail 8C148a) try removing HADOOP_HOME from env.=20 -- Saurabh On Feb 11, 2011, at 6:20, Lance Norskog wrote: > Rather, non-distributed mode now does not work. bin/mahout always > tries to contact hdfs: >=20 > CLance-Norskogs-MacBook-Pro:mahout lancenorskog$ bin/mahout > trainclassifier -o > file:///Users/laorskog/Documents/open/datasets/20news-bydate/bayes-model-b= grams > -i file:///Users/lancenorskog/Documents/open/datasets/20news-bydate/bayes-= train-input > -type cbayes -ng 2 > Running on hadoop, using > HADOOP_HOME=3D/Users/lancenorskog/Documents/open/hadoop-0.20.2 > No HADOOP_CONF_DIR set, using > /Users/lancenorskog/Documents/open/hadoop-0.20.2/conf > 11/02/10 21:17:15 INFO bayes.TrainClassifier: Training Complementary > Bayes Classifier > 11/02/10 21:17:15 INFO common.HadoopUtil: Deleting > file:/Users/lancenorskog/Documents/open/datasets/20news-bydate/bayes-model= -bgrams > 11/02/10 21:17:15 INFO cbayes.CBayesDriver: Reading features... > 11/02/10 21:17:17 INFO ipc.Client: Retrying connect to server: > localhost/127.0.0.1:9000. Already tried 0 time(s). > 11/02/10 21:17:18 INFO ipc.Client: Retrying connect to server: > localhost/127.0.0.1:9000. Already tried 1 time(s). >=20 > [and a whole lot more, but you get the idea: i don't have HDFS up] >=20 >=20 >=20 > On Thu, Feb 10, 2011 at 9:14 PM, Lance Norskog wrote: >> This is new, within the last week. When I changed ~/Documents/* to >> file:///Users/lancenorskog/Documents/* this started working. >> Somehow, file paths without url protocol handlers don't default to >> file:// anymore. >>=20 >> Lance-Norskogs-MacBook-Pro:mahout lancenorskog$ bin/mahout >> trainclassifier -o >> ~/Documents/open/datasets/20news-bydate/bayes-model-bgrams -i >> ~/Documents/open/datasets/20news-bydate/bayes-train-input -type cbayes >> -ng 2 >> Running on hadoop, using >> HADOOP_HOME=3D/Users/lancenorskog/Documents/open/hadoop-0.20.2 >> No HADOOP_CONF_DIR set, using >> /Users/lancenorskog/Documents/open/hadoop-0.20.2/conf >> 11/02/10 21:07:18 INFO bayes.TrainClassifier: Training Complementary >> Bayes Classifier >> 11/02/10 21:07:19 INFO cbayes.CBayesDriver: Reading features... >> 11/02/10 21:07:19 WARN mapred.JobClient: Use GenericOptionsParser for >> parsing the arguments. Applications should implement Tool for the >> same. >> Exception in thread "main" >> org.apache.hadoop.mapred.InvalidInputException: Input path does not >> exist: hdfs://localhost:9000/Users/lancenorskog/Documents/open/datasets/2= 0news-bydate/bayes-train-input >> at org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFo= rmat.java:190) >> at org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFor= mat.java:201) >> at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.jav= a:810) >> at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.= java:781) >> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730= ) >> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249) >>=20 >> -- >> Lance Norskog >> goksron@gmail.com >>=20 >=20 >=20 >=20 > --=20 > Lance Norskog > goksron@gmail.com