mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paritosh Ranjan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAHOUT-1108) cluster-reuters.sh executes seqdirectory with MAHOUT_LOCAL=true
Date Thu, 01 Nov 2012 11:21:13 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-1108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13488613#comment-13488613
] 

Paritosh Ranjan commented on MAHOUT-1108:
-----------------------------------------

I tend to agree with you.

I don't see the point of extracting it locally and the putting the files in hdfs. From what
I can see, nothing else is done sequentially (locally) in this script. So, MAHOUT_LOCAL seems
to be redundant to me.

Still, I think that the first mapreduce call be after 

 HADOOP="$HADOOP_HOME/bin/hadoop"
  if [ ! -e $HADOOP ]; then
    echo "Can't find hadoop in $HADOOP, exiting"
    exit 1
  fi

so that the user is warned with a proper message.

Since I am not the creator of this script, and I am not sure about the use of MAHOUT_LOCAL,
I would like to wait for someone to clarify the doubts regarding MAHOUT_LOCAL. Then, I think
we can go ahead this change with some modifications ( like putting the mapreduce call after
the check of hadoop's existence).


                
> cluster-reuters.sh executes seqdirectory with MAHOUT_LOCAL=true
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-1108
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1108
>             Project: Mahout
>          Issue Type: Bug
>    Affects Versions: 0.7
>            Reporter: Elmer Garduno
>            Priority: Minor
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> Got the following exception when running the command with HADOOP_CONF and  HADOOP_CONF_DIR
> Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/util/ProgramDriver
> 	at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:96)
> Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.util.ProgramDriver
> 	at java.net.URLClassLoader$1.run(Unknown Source)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at java.net.URLClassLoader.findClass(Unknown Source)
> 	at java.lang.ClassLoader.loadClass(Unknown Source)
> 	at sun.misc.Launcher$AppClassLoader.loadClass(Unknown Source)
> 	at java.lang.ClassLoader.loadClass(Unknown Source)
> 	... 1 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message