Return-Path: X-Original-To: apmail-mahout-dev-archive@www.apache.org Delivered-To: apmail-mahout-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DE276102A6 for ; Mon, 24 Mar 2014 20:05:18 +0000 (UTC) Received: (qmail 85390 invoked by uid 500); 24 Mar 2014 20:05:16 -0000 Delivered-To: apmail-mahout-dev-archive@mahout.apache.org Received: (qmail 85281 invoked by uid 500); 24 Mar 2014 20:05:15 -0000 Mailing-List: contact dev-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mahout.apache.org Delivered-To: mailing list dev@mahout.apache.org Received: (qmail 85258 invoked by uid 99); 24 Mar 2014 20:05:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 24 Mar 2014 20:05:14 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of yexijiang@gmail.com designates 209.85.216.173 as permitted sender) Received: from [209.85.216.173] (HELO mail-qc0-f173.google.com) (209.85.216.173) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 24 Mar 2014 20:05:10 +0000 Received: by mail-qc0-f173.google.com with SMTP id r5so6534195qcx.4 for ; Mon, 24 Mar 2014 13:04:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=+ggXDkvOgNVSfjfvku3bfRIMHnjZwZpgLjROA0eqI8Y=; b=EHi6/rgAkUY7142F1+ZssraqA713PtVSEmpyj9fvr9CPywdnX4akTuOFCwj8/PRkN8 sEs37pAeiOwY965Mb3xsu0+xvjQLgpAsQ6tDhjjyzIUd9Vw4XJSqImRUf+ffbQ4Nna3L loUPjYP/qOov19RijBheZzdCvwDu9AXY0rlRTTWmtfjZqKoXVB6sEfBpsgV+drjJ9bgn hClv7pvMUUw1b4GiUdFLfMa5XQOaCGjVb+j39CBI3CZKG1vkdpF+muSyZVlfRBBigQ4c t2Po0JdYMbw14VLdmrlt45Ct1ItJsZz+2eY5Eg20tm2ojiqQHqyVX5BAFKtqLFbg2gUF LQSQ== MIME-Version: 1.0 X-Received: by 10.140.27.109 with SMTP id 100mr48622207qgw.14.1395691489429; Mon, 24 Mar 2014 13:04:49 -0700 (PDT) Received: by 10.96.228.43 with HTTP; Mon, 24 Mar 2014 13:04:49 -0700 (PDT) In-Reply-To: References: Date: Mon, 24 Mar 2014 16:04:49 -0400 Message-ID: Subject: Re: [jira] [Assigned] (MAHOUT-1388) Add command line support and logging for MLP From: Yexi Jiang To: mahout Content-Type: multipart/alternative; boundary=001a11c14d7a718e9104f55fbf9f X-Virus-Checked: Checked by ClamAV on apache.org --001a11c14d7a718e9104f55fbf9f Content-Type: text/plain; charset=ISO-8859-1 The patch is already available. 2014-03-23 1:01 GMT-04:00 Suneel Marthi (JIRA) : > > [ > https://issues.apache.org/jira/browse/MAHOUT-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel] > > Suneel Marthi reassigned MAHOUT-1388: > ------------------------------------- > > Assignee: Suneel Marthi > > > Add command line support and logging for MLP > > -------------------------------------------- > > > > Key: MAHOUT-1388 > > URL: https://issues.apache.org/jira/browse/MAHOUT-1388 > > Project: Mahout > > Issue Type: Improvement > > Components: Classification > > Affects Versions: 1.0 > > Reporter: Yexi Jiang > > Assignee: Suneel Marthi > > Labels: mlp, sgd > > Fix For: 1.0 > > > > Attachments: Mahout-1388.patch, Mahout-1388.patch > > > > > > The user should have the ability to run the Perceptron from the command > line. > > There are two programs to execute MLP, the training and labeling. The > first one takes the data as input and outputs the model, the second one > takes the model and unlabeled data as input and outputs the results. > > The parameters for training are as follows: > > ------------------------------------------------ > > --input -i (input data) > > --skipHeader -sk // whether to skip the first row, this parameter is > optional > > --labels -labels // the labels of the instances, separated by > whitespace. Take the iris dataset for example, the labels are 'setosa > versicolor virginica'. > > --model -mo // in training mode, this is the location to store the > model (if the specified location has an existing model, it will update the > model through incremental learning), in labeling mode, this is the location > to store the result > > --update -u // whether to incremental update the model, if this > parameter is not given, train the model from scratch > > --output -o // this is only useful in labeling mode > > --layersize -ls (no. of units per hidden layer) // use whitespace > separated number to indicate the number of neurons in each layer (including > input layer and output layer), e.g. '5 3 2'. > > --squashingFunction -sf // currently only supports Sigmoid > > --momentum -m > > --learningrate -l > > --regularizationweight -r > > --costfunction -cf // the type of cost function, > > ------------------------------------------------ > > For example, train a 3-layer (including input, hidden, and output) MLP > with 0.1 learning rate, 0.1 momentum rate, and 0.01 regularization weight, > the parameter would be: > > mlp -i /tmp/training-data.csv -labels setosa versicolor virginica -o > /tmp/model.model -ls 5,3,1 -l 0.1 -m 0.1 -r 0.01 > > This command would read the training data from /tmp/training-data.csv > and write the trained model to /tmp/model.model. > > The parameters for labeling is as follows: > > ------------------------------------------------------------- > > --input -i // input file path > > --columnRange -cr // the range of column used for feature, start from 0 > and separated by whitespace, e.g. 0 5 > > --format -f // the format of input file, currently only supports csv > > --model -mo // the file path of the model > > --output -o // the output path for the results > > ------------------------------------------------------------- > > If a user need to use an existing model, it will use the following > command: > > mlp -i /tmp/unlabel-data.csv -m /tmp/model.model -o /tmp/label-result > > Moreover, we should be providing default values if the user does not > specify any. > > > > -- > This message was sent by Atlassian JIRA > (v6.2#6252) > -- ------ Yexi Jiang, ECS 251, yjian004@cs.fiu.edu School of Computer and Information Science, Florida International University Homepage: http://users.cis.fiu.edu/~yjian004/ --001a11c14d7a718e9104f55fbf9f--