mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lance Norskog <goks...@gmail.com>
Subject Re: Oops: file URLs require file:// in some jobs
Date Fri, 11 Feb 2011 05:20:00 GMT
Rather, non-distributed mode now does not work. bin/mahout always
tries to contact hdfs:

CLance-Norskogs-MacBook-Pro:mahout lancenorskog$ bin/mahout
trainclassifier -o
file:///Users/laorskog/Documents/open/datasets/20news-bydate/bayes-model-bgrams
-i file:///Users/lancenorskog/Documents/open/datasets/20news-bydate/bayes-train-input
-type cbayes -ng 2
Running on hadoop, using
HADOOP_HOME=/Users/lancenorskog/Documents/open/hadoop-0.20.2
No HADOOP_CONF_DIR set, using
/Users/lancenorskog/Documents/open/hadoop-0.20.2/conf
11/02/10 21:17:15 INFO bayes.TrainClassifier: Training Complementary
Bayes Classifier
11/02/10 21:17:15 INFO common.HadoopUtil: Deleting
file:/Users/lancenorskog/Documents/open/datasets/20news-bydate/bayes-model-bgrams
11/02/10 21:17:15 INFO cbayes.CBayesDriver: Reading features...
11/02/10 21:17:17 INFO ipc.Client: Retrying connect to server:
localhost/127.0.0.1:9000. Already tried 0 time(s).
11/02/10 21:17:18 INFO ipc.Client: Retrying connect to server:
localhost/127.0.0.1:9000. Already tried 1 time(s).

[and a whole lot more, but you get the idea: i don't have HDFS up]



On Thu, Feb 10, 2011 at 9:14 PM, Lance Norskog <goksron@gmail.com> wrote:
> This is new, within the last week. When I changed ~/Documents/* to
> file:///Users/lancenorskog/Documents/* this started working.
> Somehow, file paths without url protocol handlers don't default to
> file:// anymore.
>
> Lance-Norskogs-MacBook-Pro:mahout lancenorskog$ bin/mahout
> trainclassifier -o
> ~/Documents/open/datasets/20news-bydate/bayes-model-bgrams -i
> ~/Documents/open/datasets/20news-bydate/bayes-train-input -type cbayes
> -ng 2
> Running on hadoop, using
> HADOOP_HOME=/Users/lancenorskog/Documents/open/hadoop-0.20.2
> No HADOOP_CONF_DIR set, using
> /Users/lancenorskog/Documents/open/hadoop-0.20.2/conf
> 11/02/10 21:07:18 INFO bayes.TrainClassifier: Training Complementary
> Bayes Classifier
> 11/02/10 21:07:19 INFO cbayes.CBayesDriver: Reading features...
> 11/02/10 21:07:19 WARN mapred.JobClient: Use GenericOptionsParser for
> parsing the arguments. Applications should implement Tool for the
> same.
> Exception in thread "main"
> org.apache.hadoop.mapred.InvalidInputException: Input path does not
> exist: hdfs://localhost:9000/Users/lancenorskog/Documents/open/datasets/20news-bydate/bayes-train-input
>        at org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:190)
>        at org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:201)
>        at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)
>        at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
>        at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
>        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)
>
> --
> Lance Norskog
> goksron@gmail.com
>



-- 
Lance Norskog
goksron@gmail.com

Mime
View raw message