mahout-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From conflue...@apache.org
Subject [CONF] Apache Lucene Mahout: Matrix and Vector Needs (page edited)
Date Fri, 22 Feb 2008 11:37:00 GMT
Matrix and Vector Needs (MAHOUT) edited by Grant Ingersoll
      Page: http://cwiki.apache.org/confluence/display/MAHOUT/Matrix+and+Vector+Needs
   Changes: http://cwiki.apache.org/confluence/pages/diffpagesbyversion.action?pageId=75990&originalVersion=3&revisedVersion=4






Content:
---------------------------------------------------------------------

h1. Intro

We need some basic Matrix operations to start, and will definitely need more as we go.


h2. Background

See [http://mail-archives.apache.org/mod_mbox/lucene-mahout-dev/200802.mbox/browser]

h2. Needs

We will more than likely need all the basic Matrix operations, plus some more advanced ones:

* Addition, Subtraction, Multiplication, Transpose, Inverse, Scaling


For ideas like PageRank/TextRank, iterative approaches that, essentially, calculate eigenvectors
are also useful.

Similarly, for vectors, things like dot/cross product will be useful

h2. Ideas

Use HBase (BigTable) in Hadoop to represent the Matrix.  Batching row/column operations can
be useful.

See [MAHOUT-6|ttps://issues.apache.org/jira/browse/MAHOUT-6]


h2. References

Have a look at the old parallel computing libraries like [ScalaPACK|http://www.netlib.org/scalapack/],
others

---------------------------------------------------------------------
CONFLUENCE INFORMATION
This message is automatically generated by Confluence

Unsubscribe or edit your notifications preferences
   http://cwiki.apache.org/confluence/users/viewnotifications.action

If you think it was sent incorrectly contact one of the administrators
   http://cwiki.apache.org/confluence/administrators.action

If you want more information on Confluence, or have a bug to report see
   http://www.atlassian.com/software/confluence



Mime
View raw message