mahout-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From conflue...@apache.org
Subject [CONF] Apache Lucene Mahout: index (page edited)
Date Wed, 28 May 2008 19:24:00 GMT
index (MAHOUT) edited by Isabel Drost
      Page: http://cwiki.apache.org/confluence/display/MAHOUT/index
   Changes: http://cwiki.apache.org/confluence/pages/diffpagesbyversion.action?pageId=74539&originalVersion=26&revisedVersion=27






Content:
---------------------------------------------------------------------

h1. Apache Mahout Wiki

Apache Mahout is a new Lucene TLP project to create scalable, machine learning algorithms
under the Apache license. For more information on the project goals please see the [original
proposal|http://ml-site.grantingersoll.com/index.php?title=Incubator_proposal].

{toc:style=disc|minlevel=2}

h2. Historical Information

Project inspiration and formulation can be found at [http://ml-site.grantingersoll.com]

h2. General

[TODO]

[FAQ]

[HowToContribute]

[HowToBecomeACommitter]

[Hadoop|http://hadoop.apache.org]

h2. Design

[Collection(De-)Serialization]

[Matrix and Vector Needs]

h2. Algorithms

This section contains links to information, examples, use cases, etc. for the various algorithms
we intend to implement.  Click the individual links to learn more. The initial algorithms
descriptions have been copied here from the original project proposal. The algorithms are
grouped by the application setting, they can be used for. In case of multiple applications,
the version presented in the paper was chosen, versions as implemented in our project will
be added as soon as we are working on them.

Original Paper: [Map Reduce for Machine Learning on Multicore|http://www.cs.stanford.edu/people/ang//papers/nips06-mapreducemulticore.pdf]

Papers related to Map Reduce:
   * [Evaluating MapReduce for Multi-core and Multiprocessor Systems|http://csl.stanford.edu/~christos/publications/2007.cmp_mapreduce.hpca.pdf]

h3. Classification

A general introduction to the most common text classification algorithms can be found at Google
Answers: http://answers.google.com/answers/main?cmd=threadview&id=225316 For information
on the algorithms implemented in Mahout (or scheduled for implementation) please visit the
following pages.

[Logistic Regression]

[NaiveBayes]

[Support Vector Machines] (SVM)

[Neural Network]

h3. Clustering

[Canopy Clustering]

[k-Means]

[Expectation Maximization] (EM)

[Mean Shift]

h3. Regression

[Locally Weighted Linear Regression]

h3. Dimension reduction

[Principal Components Analysis ] (PCA)

[Independent Component Analysis]

[Gaussian Discriminative Analysis] (GDA)

h3. Non map reduce algorithms

Some algorithms and applications appeared on the mailing list, that have not been published
in map reduce form so far. As we do not restrict ourselves to hadoop-only versions, these
proposals are listed here.

[Hidden Markov Models] (HMM)

[Recommendation Learning]

h2. Data

[Collections]


h2. Community

[MailingListArchives]
[PoweredBy]
[IssueTracker]

h2. Committer's Resources

[HowToUpdateTheWebsite]

[PatchCheckList]

[ReleaseToDo]

[Apache Machine Status|http://monitoring.apache.org/status/] -- Check to see if SVN, other
resources are available

h3. Other Resources

[Committer's FAQ|http://www.apache.org/dev/committers.html]

[Apache Dev|http://www.apache.org/dev/]

---------------------------------------------------------------------
CONFLUENCE INFORMATION
This message is automatically generated by Confluence

Unsubscribe or edit your notifications preferences
   http://cwiki.apache.org/confluence/users/viewnotifications.action

If you think it was sent incorrectly contact one of the administrators
   http://cwiki.apache.org/confluence/administrators.action

If you want more information on Confluence, or have a bug to report see
   http://www.atlassian.com/software/confluence



Mime
View raw message