Return-Path: X-Original-To: apmail-mahout-commits-archive@www.apache.org Delivered-To: apmail-mahout-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4F9521057C for ; Wed, 20 Nov 2013 16:50:52 +0000 (UTC) Received: (qmail 14107 invoked by uid 500); 20 Nov 2013 16:50:51 -0000 Delivered-To: apmail-mahout-commits-archive@mahout.apache.org Received: (qmail 14071 invoked by uid 500); 20 Nov 2013 16:50:51 -0000 Mailing-List: contact commits-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mahout.apache.org Delivered-To: mailing list commits@mahout.apache.org Received: (qmail 14064 invoked by uid 99); 20 Nov 2013 16:50:50 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 20 Nov 2013 16:50:50 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO eris.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 20 Nov 2013 16:50:49 +0000 Received: from eris.apache.org (localhost [127.0.0.1]) by eris.apache.org (Postfix) with ESMTP id 80418238883D; Wed, 20 Nov 2013 16:50:29 +0000 (UTC) Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Subject: svn commit: r1543870 - /mahout/site/mahout_cms/trunk/content/users/classification/classifyingyourdata.mdtext Date: Wed, 20 Nov 2013 16:50:29 -0000 To: commits@mahout.apache.org From: isabel@apache.org X-Mailer: svnmailer-1.0.9 Message-Id: <20131120165029.80418238883D@eris.apache.org> X-Virus-Checked: Checked by ClamAV on apache.org Author: isabel Date: Wed Nov 20 16:50:29 2013 New Revision: 1543870 URL: http://svn.apache.org/r1543870 Log: MAHOUT-1245 - cleaned up classification intro, added some external docs for random forest stuff. Modified: mahout/site/mahout_cms/trunk/content/users/classification/classifyingyourdata.mdtext Modified: mahout/site/mahout_cms/trunk/content/users/classification/classifyingyourdata.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/trunk/content/users/classification/classifyingyourdata.mdtext?rev=1543870&r1=1543869&r2=1543870&view=diff ============================================================================== --- mahout/site/mahout_cms/trunk/content/users/classification/classifyingyourdata.mdtext (original) +++ mahout/site/mahout_cms/trunk/content/users/classification/classifyingyourdata.mdtext Wed Nov 20 16:50:29 2013 @@ -1,47 +1,22 @@ Title: ClassifyingYourData -+*Mahout_0.2*+ -After you've done the [Quickstart](quickstart.html) - and are familiar with the basics of Mahout, it is time to build a -classifier from your own data. +# Classifying data from the command line -The following pieces *may* be useful for in getting started: + +After you've done the [Quickstart](../basics/quickstart.html) and are familiar with the basics of Mahout, it is time to build a +classifier from your own data. The following pieces *may* be useful for in getting started: # Input -For starters, you will need your data in an appropriate Vector format -(which has changed since Mahout 0.1) - -* See [Creating Vectors](creating-vectors.html) - - -## Text Preparation - -* See [Creating Vectors from Text](creating-vectors-from-text.html) -* -http://www.lucidimagination.com/search/document/4a0e528982b2dac3/document_clustering +For starters, you will need your data in an appropriate Vector format: See [Creating Vectors](../basics/creating-vectors.html) as well as [Creating Vectors from Text](../basics/creating-vectors-from-text.html). # Running the Process - -## Naive Bayes - -Background: [Naive Bayes Classification ](-bayesian-.html) - -Documentation of running naive bayes from the command line: [bayesian-commandline](bayesian-commandline.html) - - -## C-Bayes - -Background: [C-Bayes Classification ](-https://issues.apache.org/jira/browse/mahout-60-.html) - -Documentation of running c-bayes from the command line: [c-bayes-commandline](c-bayes-commandline.html) - - -## Random Forests - -Background: [Random Forests Classification ](-http://cwiki.apache.org/mahout/random-forests.html-.html) - -Documentation of running random forests from the command line: [Breiman Example](breiman-example.html) +* Logistic regression [background](logistic-regression.html) +* [Naive Bayes background](naivebayes.html) and [commandline](bayesian-commandline.html) options. +* [Complementary naive bayes background](complementary-naive-bayes.html), [design](https://issues.apache.org/jira/browse/mahout-60.html), and [c-bayes-commandline](c-bayes-commandline.html) +* [Random Forests Classification](https://cwiki.apache.org/confluence/display/MAHOUT/Random+Forests) comes with a [Breiman example](breiman-example.html). There is some really great documentation +over at [Mark Needham's blog](http://www.markhneedham.com/blog/2012/10/27/kaggle-digit-recognizer-mahout-random-forest-attempt/). Also checkout the description on [Xiaomeng Shawn Wan +s](http://shawnwan.wordpress.com/2012/06/01/mahout-0-7-random-forest-examples/) blog. \ No newline at end of file