hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Trivial Update of "DataProcessingBenchmarks" by udanax
Date Fri, 16 Jan 2009 00:58:59 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The following page has been changed by udanax:
http://wiki.apache.org/hadoop/DataProcessingBenchmarks

------------------------------------------------------------------------------
  ||<bgcolor="#E5E5E5">Time  ||4.43 sec ||172.30 sec||108.01 sec||77.41 sec||66.30 sec||60.78
sec||
  
  ----
- == Hbase Matrix computations Benchmarks ==
-  * You can download the Hbase [http://incubator.apache.org/hama Matrix Package] for Map/Reduce-based
Parallel Matrix Computations (still under development)
  
- ==== MapReduce Flow ====
+ I also investigate a lot of traditional methods of parallel processing and experiment some
high level processing (e.g. matrix algebra, graph algorithm) using Hadoop/Hbase/MapReduce.
The only way to increase speed linearly was locality (Do write all data even if there are
duplicated efforts). Increased node numbers, there is a linear increase of IO channel.
  
-  * The Multiplication requires (n + 1) table full scan irrespective of the number of mapper.

-  * Each map processor requires O(n2) for the communication and O(n3/mappers) the computation.

- 
- ----
- 

Mime
View raw message