Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.
The following page has been changed by udanax:
http://wiki.apache.org/hadoop/Matrix
The comment on the change is:
Move to 'http://wiki.apache.org/hadoop/Hama'

+ deleted
 == Matrix Package ==
 The Matrix Package for Map/Reducebased Parallel Matrix Computations will be useful for
LargeScale Numeric Analysis and Data Mining which need the computation system of the Inverse
Matrix (e.g. linear regression, PCA, SVM, ..., etc) and many scientific computations (e.g.
physics, linear algebra, computational fluid dynamics, statistics, graphics rendering and
others).
 Generally, The current sharedmemory based parallel matrix solution provides a scalable
and high performance matrix operations, but, matrix resources can't be scalable. But, Using
Hbase's 2 dimensional Row and Column(Qualifier) space, we are able to store large sparse
matrix. Also, autopartitioned sparsity substructure will be efficiently managed and serviced
by Hbase. Row or Column operations can be done in linear time and algorithms such as structured
Gaussian elimination or iterative methods run in O(~the number of nonzero elements in the
matrix~ / ~number of mappers (processors/cores)~) time on Hadoop Map/Reduce.

 === Initial Contributors ===

 * [:udanax:Edward Yoon] (R&D center, NHN corp.)
 * Chanwit Kaewkasi

 === Project Homepage ===

 * http://code.google.com/p/hama

 === Dependencies ===

 * Hadoop (HDFS, Map/Reduce) License: Apache License, 2.0
 * Hbase (Sparse Matrix Table) License: Apache License, 2.0
 * Groovy (Agile Dynamic Language) License: Apache License, 2.0

 === RoadMap ===
 ==== Version 0.1 ====
 * Done by Edward Yoon
 * Initial version imported here
 ==== Version 0.2 ====
 * Expected by April 15, 2008 Qdg parser (JSR241 implementation)
 * Matrix / Vector literals
 
 == References ==

 * [http://bebop.cs.berkeley.edu/oski/ OSKI], optimized sparse kernel interface (OSKI) library
 * [http://icl.cs.utk.edu/iclprojects/pages/files/sans/yelickbebop.pdf Automatic Performance
Tuning of Sparse Matrix Kernels], August 7, 2002
 * Parallel Conjugate Gradients Assignment, a parallel implementation of the conjugate gradient
algorithm
 * ScaLAPACK, a library of highperformance linear algebra routines for distributedmemory
messagepassing MIMD computers
 * Scheduling algorithms for parallel Gaussian elimination withcommunication costs, Amoura,
A.K.; Bampis, E.; Konig, J.C.
 * High performance numerical libraries in Java, BjørnOve Heimsund

