Return-Path: Delivered-To: apmail-mahout-dev-archive@www.apache.org Received: (qmail 24824 invoked from network); 24 Sep 2010 14:00:05 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 24 Sep 2010 14:00:05 -0000 Received: (qmail 46845 invoked by uid 500); 24 Sep 2010 14:00:01 -0000 Delivered-To: apmail-mahout-dev-archive@mahout.apache.org Received: (qmail 46501 invoked by uid 500); 24 Sep 2010 13:59:58 -0000 Mailing-List: contact dev-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mahout.apache.org Delivered-To: mailing list dev@mahout.apache.org Received: (qmail 46474 invoked by uid 99); 24 Sep 2010 13:59:57 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Sep 2010 13:59:57 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Sep 2010 13:59:54 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o8ODxXFg024715 for ; Fri, 24 Sep 2010 13:59:33 GMT Message-ID: <6948908.384641285336773257.JavaMail.jira@thor> Date: Fri, 24 Sep 2010 09:59:33 -0400 (EDT) From: "Hudson (JIRA)" To: dev@mahout.apache.org Subject: [jira] Commented: (MAHOUT-502) Adding footer note to command line utility In-Reply-To: <10071510.196191284527254255.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/MAHOUT-502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12914468#action_12914468 ] Hudson commented on MAHOUT-502: ------------------------------- Integrated in Mahout-Quality #320 (See [https://hudson.apache.org/hudson/job/Mahout-Quality/320/]) MAHOUT-502 > Adding footer note to command line utility > ------------------------------------------ > > Key: MAHOUT-502 > URL: https://issues.apache.org/jira/browse/MAHOUT-502 > Project: Mahout > Issue Type: Improvement > Components: Utils > Reporter: Joe Prasanna Kumar > Priority: Trivial > Fix For: 0.4 > > Attachments: MAHOUT-502.patch > > > Hi all, > Since ClusterDumper doesnt seem to have elaborate documentation, just created a page https://cwiki.apache.org/confluence/display/MAHOUT/Cluster+Dumper > While playing around with clusterdump utility, I learned that it can be run on hadoop or as a standalone java program. > As most of you are aware, when executed on hadoop, the seqFileDir and pointsDir should be the HDFS location else the local system path location. Since some of the clustering related wiki pages specified that we can get the output from HDFS and then run clusterdump, I was assuming that the clusterdump would always read data from local FS. > I am not sure if newbies would have this same thought process.. So I was thinking if we'd need to make this explicit by changing the help list of clusterdump > Currently ClusterDumper.java has > addOption(SEQ_FILE_DIR_OPTION, "s", "The directory containing Sequence Files for the Clusters", true); > Should we specify something like > addOption(SEQ_FILE_DIR_OPTION, "s", "The directory (HDFS if using Hadoop / Local filesystem if on standalone mode) containing Sequence Files for the Clusters", true); > and so on.. > The problem with this approach is itz repetitive in that we'd need to change in quite a few places.. (I believe vectordump also follows the same principle) > or > should we modify CommandLineUtil to have a generic message in the help specifying the fact that while running hadoop, the directories should reference HDFS location else local FS. > How about adding it to the footer like > formatter.setFooter("Specify HDFS directories while running hadoop; else specify local File System directories"); > formatter.printFooter(); > Appreciate your feedbacks / thots. > thanks > Joe. > from Jeff Eastman > reply-to dev@mahout.apache.org > to dev@mahout.apache.org > date Fri, Sep 3, 2010 at 2:45 PM > subject Re: ClusterDumper - Hadoop or standalone ? > mailed-by mahout.apache.org > hide details Sep 3 (12 days ago) > - Show quoted text - > +1 to generic message approach -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.