mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lance Norskog <>
Subject Oops: file URLs require file:// in some jobs
Date Fri, 11 Feb 2011 05:14:27 GMT
This is new, within the last week. When I changed ~/Documents/* to
file:///Users/lancenorskog/Documents/* this started working.
Somehow, file paths without url protocol handlers don't default to
file:// anymore.

Lance-Norskogs-MacBook-Pro:mahout lancenorskog$ bin/mahout
trainclassifier -o
~/Documents/open/datasets/20news-bydate/bayes-model-bgrams -i
~/Documents/open/datasets/20news-bydate/bayes-train-input -type cbayes
-ng 2
Running on hadoop, using
No HADOOP_CONF_DIR set, using
11/02/10 21:07:18 INFO bayes.TrainClassifier: Training Complementary
Bayes Classifier
11/02/10 21:07:19 INFO cbayes.CBayesDriver: Reading features...
11/02/10 21:07:19 WARN mapred.JobClient: Use GenericOptionsParser for
parsing the arguments. Applications should implement Tool for the
Exception in thread "main"
org.apache.hadoop.mapred.InvalidInputException: Input path does not
exist: hdfs://localhost:9000/Users/lancenorskog/Documents/open/datasets/20news-bydate/bayes-train-input
        at org.apache.hadoop.mapred.FileInputFormat.listStatus(
        at org.apache.hadoop.mapred.FileInputFormat.getSplits(
        at org.apache.hadoop.mapred.JobClient.writeOldSplits(
        at org.apache.hadoop.mapred.JobClient.submitJobInternal(
        at org.apache.hadoop.mapred.JobClient.submitJob(
        at org.apache.hadoop.mapred.JobClient.runJob(

Lance Norskog

View raw message