incubator-hama-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Edward J. Yoon (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HAMA-370) Failure detector for Hama
Date Mon, 21 Mar 2011 14:07:05 GMT

    [ https://issues.apache.org/jira/browse/HAMA-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13009128#comment-13009128
] 

Edward J. Yoon commented on HAMA-370:
-------------------------------------

Just curious, what's the benefits of using a phi accrual detection compared w/ heartbeat detection?
(GroomServer Failure)

And, when BSP task failed during processing, we can simply re-start the task to provides fault
tolerance.

> Failure detector for Hama
> -------------------------
>
>                 Key: HAMA-370
>                 URL: https://issues.apache.org/jira/browse/HAMA-370
>             Project: Hama
>          Issue Type: New Feature
>          Components: bsp
>    Affects Versions: 0.3.0
>         Environment: GNU/ Debian, JDK 1.6.0_22-b04 
>            Reporter: ChiaHung Lin
>            Assignee: ChiaHung Lin
>              Labels: patch
>             Fix For: 0.3.0
>
>         Attachments: HAMA-370.patch, HAMA-370.patch
>
>
> In order to enable fault tolerance service, BSPMaster requires to have ability in determining
GroomServers' status. This generally can be achieved through failure detector. The attached
file contains source for such patch. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message