incubator-cvs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Incubator Wiki] Update of "HamaProposal" by udanax
Date Wed, 05 Mar 2008 23:32:18 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Incubator Wiki" for change notification.

The following page has been changed by udanax:
http://wiki.apache.org/incubator/HamaProposal

New page:
== Abstract ==
Hama will develop a parallel matrix computational package based on [http://hadoop.apache.org
Hadoop] Map/Reduce.
== Proposal ==
Hama will develop a high-performance parallel matrix computational package based on Hadoop
Map/Reduce. It will be useful for a massively large-scale ''Numerical Analysis'' and ''Data
Mining'', which need the intensive computation power of matrix inversion, e.g. linear regression,
PCA, SVM and etc. It will be also useful for many scientific applications, e.g. physics computations,
linear algebra, computational fluid dynamics, statistics, graphic rendering and many more.

== Background ==
Currently, several shared-memory based parallel matrix solutions can provide a scalable and
high performance matrix operations, but matrix resources can not be scalable in the term of
complexity.  
== Rationale ==

Hama approach proposes the use of 2-dimensional Row and Column (Qualifier) space and multi-dimensional
Columnfamilies of [http://hadoop.apache.org/hbase Hbase], which is able to store large sparse
and various type of matrices (e.g. Triangular Matrix, 3D Matrix, and etc.). its auto-partitioned
sparsity sub-structure will be efficiently managed and serviced by Hbase. Row and Column operations
can be done in linear-time, where several algorithms, such as ''structured Gaussian elimination''
or ''iterative methods'', run in O(the number of non-zero elements in the matrix/ number of
mappers) time on Hadoop Map/Reduce.

== Current Status ==

Currently, generalized matrix interface and basic linear algebra operations was implemented
within a large prototype system.

== Meritocracy ==

The initial developers are very familiar with meritocratic open source development, both at
Apache and elsewhere. Apache was chosen specifically because the initial developers want to
encourage this style of development for the project.

=== Community ===

Hama seeks to develop developer and user communities during incubation. 

== Core Developers ==

The initial set of committers includes folks from the [http://hadoop.apache.org Hadoop] &
[http://hadoop.apache.org/hbase Hbase] community. We have varying degrees of experience with
Apache-style open source development, ranging from none to ASF Members. 

== Alignment ==

The developers of Hama want to work with the Apache Software Foundation specifically because
Apache has proven to provide a strong foundation and set of practices for developing standards-based
infrastructure and server components. 

== Known Risks ==
=== Orphaned products ===
Due to its small number of committers, there is a risk of being orphaned.
=== Inexperience with Open Source===
We has already a good experience with Apache open source development process.

=== Homogenous Developers ===
With only two core developers, at least they are not homogenous, Edward and Chanwit knew each
other only due to their common interest in Hama.
=== Reliance on Salaried Developers ===
Edward is a full-time open source developer at NHN, and Chanwit is a Ph.D student in computer
science. 
=== Relationships with Other Apache Products ===
Hama has a strong relationship with Apache [http://hadoop.apache.org Hadoop] & [http://hadoop.apache.org/hbase
Hbase]. Being part of Apache could help for a closer collaboration between the three projects.

=== A Excessive Fascination with the Apache Brand ===

We believe in the processes, systems, and framework Apache has put in place. The brand is
nice, but is not why we wish to come to Apache.

== Documentation ==

 * http://code.google.com/p/hama/w/list


== Initial Source ==
 * http://code.google.com/p/hama/source/checkout

== External Dependencies ==
 * Hadoop (HDFS, Map/Reduce) License: Apache License, 2.0
 * Hbase (Sparse Matrix Table) License: Apache License, 2.0

== Required Resources ==

 * Developer and user mailing lists
  * hama-private@incubator.apache.org
  * hama-dev@incubator.apache.org
  * hama-user@incubator.apache.org
 * A subversion repository
  *  https://svn.apache.org/repos/asf/incubator/hama
 * A JIRA issue tracker 

== Initial Committers ==
 * Edward Yoon (edward AT udanax DOT org)
 * Chanwit Kaewkasi (chanwit AT gmail DOT com)

== Sponsors ==
=== Nominated Mentors ===
In need of mentors to volunteer.
=== Sponsoring Entity ===
The Apache Incubator. 

---------------------------------------------------------------------
To unsubscribe, e-mail: cvs-unsubscribe@incubator.apache.org
For additional commands, e-mail: cvs-help@incubator.apache.org


Mime
View raw message