reef-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dhruv Mahajan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (REEF-1310) The Java Driver should ACK the Java Evaluator's DONE heartbeat
Date Mon, 04 Apr 2016 17:15:25 GMT

    [ https://issues.apache.org/jira/browse/REEF-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15224541#comment-15224541
] 

Dhruv Mahajan commented on REEF-1310:
-------------------------------------

[~afchung90] I believe this will also solve REEF-1291 ?

> The Java Driver should ACK the Java Evaluator's DONE heartbeat
> --------------------------------------------------------------
>
>                 Key: REEF-1310
>                 URL: https://issues.apache.org/jira/browse/REEF-1310
>             Project: REEF
>          Issue Type: Bug
>          Components: REEF, REEF Driver, REEF-Common
>            Reporter: Andrew Chung
>
> The Driver should ACK the Evaluator's DONE heartbeat such that a race condition does
not occur when the Evaluator ends. *i.e.* The Evaluator heartbeats DONE back to the Driver
and the RM notices that the Evaluator process has exited. In this case, it is possible that
the RM reports back to the Driver that the Evaluator is DONE before the Evaluator's DONE heartbeat
goes back to the Driver, causing the Driver to invoke the {{FailedEvaluatorHandler}} due to
an unexpected DONE message from the RM.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message