hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "ZooKeeper/GSoCFailureDetector" by AbmarBarros
Date Wed, 05 May 2010 14:09:27 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "ZooKeeper/GSoCFailureDetector" page has been changed by AbmarBarros.


New page:
== GSoC 2010: ZooKeeper Failure Detector Model ==

== Abstract ==

ZooKeeper servers detect the failure of other servers and clients by counting the number of
'ticks' for which it doesn't get a heartbeat from other machines. This is the 'timeout' method
and it works very well; however it is possible that it is too aggressive and not easily tuned
for some more unusual ZooKeeper installations. This project's goals are to abstract the failure
detector to a separate module, to implement several failure detectors and to compare their
appropriateness for ZooKeeper.

== Roadmap ==

 1. Discuss the project with the community (dev/user lists), asking for suggestions and requirements
and decide which type and which methods are to be implemented (Community Bonding Period) 
 1. Study the chosen failure detection methods specification and the ZooKeeper code (24th
 1. Isolate the failure detector model in the ZooKeeper code (14th June) 
 1. Implement the chosen failure detector methods (28th June)
 1. Evaluate the QoS metrics for the implemented methods (26th July)

== Related JIRA ==
 * [[https://issues.apache.org/jira/browse/ZOOKEEPER-702|ZOOKEEPER-702]]

View raw message