hama-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Edward J. Yoon (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HAMA-974) Support fault tolerance for Graph job
Date Tue, 08 Sep 2015 03:31:45 GMT

    [ https://issues.apache.org/jira/browse/HAMA-974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734184#comment-14734184
] 

Edward J. Yoon commented on HAMA-974:
-------------------------------------

Isn't HAMA-881 already addressed? I tested AsyncRcvdMsgCheckpointImpl, and it works fine.

The problem is the last-checkpoint variables states. I think providing some custom checkpoint
function is best. For example, we can add checkpointState() method to BSPInterface.

{code}
public setup() { }

public bsp() {
  // your program
}

public checkpointState() {
  // define variables to be checkpointed
}

public close() { }
{code}

> Support fault tolerance for Graph job
> -------------------------------------
>
>                 Key: HAMA-974
>                 URL: https://issues.apache.org/jira/browse/HAMA-974
>             Project: Hama
>          Issue Type: Improvement
>          Components: graph
>    Affects Versions: 0.7.0
>            Reporter: Edward J. Yoon
>             Fix For: 0.8.0
>
>
> Currently we only checkpoints messages. To support FT for graph job, aggregators, assigned
vertices and its statuses must be checkpointed together.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message