mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From jamal sasha <jamalsha...@gmail.com>
Subject error running synthetic-cluster example script
Date Mon, 28 Jan 2013 23:00:15 GMT
Hi,
  I tried to run the cluster-syntheticcontrol.sh script inside examples/bin
I am getting this error?
mhduser@markov:/usr/local/mahout-distribution-0.7$
examples/bin/cluster-syntheticcontrol.sh
Please select a number to choose the corresponding clustering algorithm
1. canopy clustering
2. kmeans clustering
3. fuzzykmeans clustering
4. dirichlet clustering
5. meanshift clustering
Enter your choice : 3
ok. You chose 3 and we'll use fuzzykmeans Clustering
creating work directory at /tmp/mahout-work-mhduser
Downloading Synthetic control data
examples/bin/cluster-syntheticcontrol.sh: line 58: curl: command not found
Checking the health of DFS...
Warning: $HADOOP_HOME is deprecated.

Found 9 items
drwxr-xr-x   - mhduser supergroup          0 2013-01-18 15:32
/user/mhduser/counts
drwxr-xr-x   - mhduser supergroup          0 2013-01-21 13:17
/user/mhduser/final-output
drwxr-xr-x   - mhduser supergroup          0 2013-01-21 15:21
/user/mhduser/input
drwxr-xr-x   - mhduser supergroup          0 2013-01-21 15:23
/user/mhduser/mod-count-output
drwxr-xr-x   - mhduser supergroup          0 2013-01-24 17:07
/user/mhduser/movie-lens
drwxr-xr-x   - mhduser supergroup          0 2013-01-18 14:16
/user/mhduser/output
drwxr-xr-x   - mhduser supergroup          0 2013-01-18 14:39
/user/mhduser/temp-output
drwxr-xr-x   - mhduser supergroup          0 2013-01-28 14:12
/user/mhduser/test-hclustering
drwxr-xr-x   - mhduser supergroup          0 2013-01-28 14:33
/user/mhduser/testdata
DFS is healthy...
Uploading Synthetic control data to HDFS
Warning: $HADOOP_HOME is deprecated.

Deleted hdfs://localhost:54310/user/mhduser/testdata
Warning: $HADOOP_HOME is deprecated.

Warning: $HADOOP_HOME is deprecated.

put: File /tmp/mahout-work-mhduser/synthetic_control.data does not exist.
Successfully Uploaded Synthetic control data to HDFS
Warning: $HADOOP_HOME is deprecated.

Running on hadoop, using /usr/local/hadoop/bin/hadoop and HADOOP_CONF_DIR=
MAHOUT-JOB:
/usr/local/mahout-distribution-0.7/examples/target/mahout-examples-0.7-job.jar
Warning: $HADOOP_HOME is deprecated.

13/01/28 14:36:19 WARN driver.MahoutDriver: No
org.apache.mahout.clustering.syntheticcontrol.fuzzykmeans.Job.props found
on classpath, will use command-line arguments only
13/01/28 14:36:19 INFO fuzzykmeans.Job: Running with default arguments
13/01/28 14:36:19 INFO common.HadoopUtil: Deleting output
13/01/28 14:36:19 INFO fuzzykmeans.Job: Preparing Input
13/01/28 14:36:20 WARN mapred.JobClient: Use GenericOptionsParser for
parsing the arguments. Applications should implement Tool for the same.
13/01/28 14:36:20 INFO input.FileInputFormat: Total input paths to process
: 0
13/01/28 14:36:20 INFO mapred.JobClient: Running job: job_201301281401_0001
13/01/28 14:36:21 INFO mapred.JobClient:  map 0% reduce 0%
13/01/28 14:36:38 INFO mapred.JobClient: Job complete: job_201301281401_0001
13/01/28 14:36:38 INFO mapred.JobClient: Counters: 4
13/01/28 14:36:38 INFO mapred.JobClient:   Job Counters
13/01/28 14:36:38 INFO mapred.JobClient:     SLOTS_MILLIS_MAPS=8136
13/01/28 14:36:38 INFO mapred.JobClient:     Total time spent by all
reduces waiting after reserving slots (ms)=0
13/01/28 14:36:38 INFO mapred.JobClient:     Total time spent by all maps
waiting after reserving slots (ms)=0
13/01/28 14:36:38 INFO mapred.JobClient:     SLOTS_MILLIS_REDUCES=0
13/01/28 14:36:38 INFO fuzzykmeans.Job: Running Canopy to get initial
clusters
13/01/28 14:36:38 INFO canopy.CanopyDriver: Build Clusters Input:
output/data Out: output/canopies Measure:
org.apache.mahout.common.distance.EuclideanDistanceMeasure@309fe84e t1:
80.0 t2: 55.0
13/01/28 14:36:38 WARN mapred.JobClient: Use GenericOptionsParser for
parsing the arguments. Applications should implement Tool for the same.
13/01/28 14:36:39 INFO input.FileInputFormat: Total input paths to process
: 0
13/01/28 14:36:39 INFO mapred.JobClient: Running job: job_201301281401_0002
13/01/28 14:36:40 INFO mapred.JobClient:  map 0% reduce 0%
13/01/28 14:36:58 INFO mapred.JobClient:  map 0% reduce 100%
13/01/28 14:37:03 INFO mapred.JobClient: Job complete: job_201301281401_0002
13/01/28 14:37:03 INFO mapred.JobClient: Counters: 19
13/01/28 14:37:03 INFO mapred.JobClient:   Job Counters
13/01/28 14:37:03 INFO mapred.JobClient:     Launched reduce tasks=1
13/01/28 14:37:03 INFO mapred.JobClient:     SLOTS_MILLIS_MAPS=7434
13/01/28 14:37:03 INFO mapred.JobClient:     Total time spent by all
reduces waiting after reserving slots (ms)=0
13/01/28 14:37:03 INFO mapred.JobClient:     Total time spent by all maps
waiting after reserving slots (ms)=0
13/01/28 14:37:03 INFO mapred.JobClient:     SLOTS_MILLIS_REDUCES=7103
13/01/28 14:37:03 INFO mapred.JobClient:   File Output Format Counters
13/01/28 14:37:03 INFO mapred.JobClient:     Bytes Written=106
13/01/28 14:37:03 INFO mapred.JobClient:   FileSystemCounters
13/01/28 14:37:03 INFO mapred.JobClient:     FILE_BYTES_WRITTEN=22749
13/01/28 14:37:03 INFO mapred.JobClient:     HDFS_BYTES_WRITTEN=106
13/01/28 14:37:03 INFO mapred.JobClient:   Map-Reduce Framework
13/01/28 14:37:03 INFO mapred.JobClient:     Reduce input groups=0
13/01/28 14:37:03 INFO mapred.JobClient:     Combine output records=0
13/01/28 14:37:03 INFO mapred.JobClient:     Reduce shuffle bytes=0
13/01/28 14:37:03 INFO mapred.JobClient:     Physical memory (bytes)
snapshot=100974592
13/01/28 14:37:03 INFO mapred.JobClient:     Reduce output records=0
13/01/28 14:37:03 INFO mapred.JobClient:     Spilled Records=0
13/01/28 14:37:03 INFO mapred.JobClient:     CPU time spent (ms)=670
13/01/28 14:37:03 INFO mapred.JobClient:     Total committed heap usage
(bytes)=91619328
13/01/28 14:37:03 INFO mapred.JobClient:     Virtual memory (bytes)
snapshot=1318285312
13/01/28 14:37:03 INFO mapred.JobClient:     Combine input records=0
13/01/28 14:37:03 INFO mapred.JobClient:     Reduce input records=0
13/01/28 14:37:03 INFO fuzzykmeans.Job: Running FuzzyKMeans
Exception in thread "main" java.lang.IllegalStateException: No input
clusters found in output/canopies/clusters-0-final. Check your -c argument.
at
org.apache.mahout.clustering.fuzzykmeans.FuzzyKMeansDriver.buildClusters(FuzzyKMeansDriver.java:277)
at
org.apache.mahout.clustering.fuzzykmeans.FuzzyKMeansDriver.run(FuzzyKMeansDriver.java:161)
at
org.apache.mahout.clustering.syntheticcontrol.fuzzykmeans.Job.run(Job.java:138)
at
org.apache.mahout.clustering.syntheticcontrol.fuzzykmeans.Job.main(Job.java:61)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:156)


Am I missing something

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message