reef-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Weimer (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (REEF-1310) The Java Driver should ACK the Java Evaluator's DONE heartbeat
Date Thu, 14 Apr 2016 21:29:25 GMT

     [ https://issues.apache.org/jira/browse/REEF-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Markus Weimer resolved REEF-1310.
---------------------------------
       Resolution: Fixed
    Fix Version/s: 0.15

Resolved via [#947|https://github.com/apache/reef/pull/947]

> The Java Driver should ACK the Java Evaluator's DONE heartbeat
> --------------------------------------------------------------
>
>                 Key: REEF-1310
>                 URL: https://issues.apache.org/jira/browse/REEF-1310
>             Project: REEF
>          Issue Type: Bug
>          Components: REEF, REEF Driver, REEF-Common
>            Reporter: Andrew Chung
>            Assignee: Andrew Chung
>             Fix For: 0.15
>
>
> The Driver should ACK the Evaluator's DONE heartbeat such that a race condition does
not occur when the Evaluator ends. *i.e.* The Evaluator heartbeats DONE back to the Driver
and the RM notices that the Evaluator process has exited. In this case, it is possible that
the RM reports back to the Driver that the Evaluator is DONE before the Evaluator's DONE heartbeat
goes back to the Driver, causing the Driver to invoke the {{FailedEvaluatorHandler}} due to
an unexpected DONE message from the RM.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message