mahout-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From conflue...@apache.org
Subject [CONF] Apache Lucene Mahout: ClusteringYourData (page edited)
Date Wed, 17 Jun 2009 13:11:02 GMT
ClusteringYourData (MAHOUT) edited by Grant Ingersoll
      Page: http://cwiki.apache.org/confluence/display/MAHOUT/ClusteringYourData
   Changes: http://cwiki.apache.org/confluence/pages/diffpagesbyversion.action?pageId=120583&originalVersion=2&revisedVersion=3






Content:
---------------------------------------------------------------------

+*Mahout_0.2*+

After you've done the [QuickStart] and are familiar with the basics of Mahout, it is time
to cluster your own data. 

The following pieces *may* be useful for in getting started:

h1. Input

For starters, you will need your data in an appropriate Vector format (which has changed since
Mahout 0.1)

h2. Text Preparation

* See [Creating Vectors from Text] 
* http://www.lucidimagination.com/search/document/4a0e528982b2dac3/document_clustering

h1. Running the Process

+*TODO*+ FILL ME IN
h2. Canopy

Background: [canopy | Canopy Clustering]

h2. kMeans

Background: [k-means]

h2. Dirichlet

Background: [dirichlet | Dirichlet Process Clustering]

h2. Mean-shift

Background:  [meanshift | Mean Shift]
h1. Validating the Output


* See http://www.lucidimagination.com/search/document/dab8c1f3c3addcfe/validating_clustering_output

h1. References

* [Mahout archive references|http://www.lucidimagination.com/search/p:mahout?q=clustering]

---------------------------------------------------------------------
CONFLUENCE INFORMATION
This message is automatically generated by Confluence

Unsubscribe or edit your notifications preferences
   http://cwiki.apache.org/confluence/users/viewnotifications.action

If you think it was sent incorrectly contact one of the administrators
   http://cwiki.apache.org/confluence/administrators.action

If you want more information on Confluence, or have a bug to report see
   http://www.atlassian.com/software/confluence



Mime
View raw message