incubator-hama-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hama Wiki] Update of "GroomServerFaultTolerance" by ChiaHungLin
Date Sun, 03 Apr 2011 12:19:00 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hama Wiki" for change notification.

The "GroomServerFaultTolerance" page has been changed by ChiaHungLin.
http://wiki.apache.org/hama/GroomServerFaultTolerance?action=diff&rev1=1&rev2=2

--------------------------------------------------

- = GroomServerFaultTolerance(Draft) =
+ == GroomServerFaultTolerance (Draft) ==
+ 
+ === Introduction ===
+ 
+ Distributed computing system such as Hadoop[1], and Dryad[2] provide fault tolerance feature
to help the system survive over the process crash. It is particular useful when computation
requires to finish its execution in long time. Hama, based on the BSP[3] model, is a framework
for massive scientific computations, which also requires this feature so that developers and
users who exploit this framework can benefit from it. This page serves for providing information
on direction how Hama GroomServer fault tolerance would work. 
+ 
+ === Literature Review ===
  
  
  
+ === Architecture ===
- Many Other 
- 
- '''Literature Review'''
- 
- '''Architecture'''
  
  
  
+ === Glossary ===
+ 
+ NodeManager
+ 
+ Failure Detector
+ 
+ Supervisor behaviour
+ 
+ === References ===
+ [1]. Hadoop. http://hadoop.apache.org/
+ 
+ [2]. Dryad: distributed data-parallel programs from sequential building blocks. http://portal.acm.org/citation.cfm?id=1273005
+ 
+ [3]. Bulk Synchronous Parallel Computing -- A Paradigm for Transportable Software. http://portal.acm.org/citation.cfm?id=798134
+ 

Mime
View raw message