reef-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrey (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (REEF-1471) IMRUDriver.FailAction does not report job failure
Date Thu, 30 Jun 2016 04:49:10 GMT

    [ https://issues.apache.org/jira/browse/REEF-1471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356512#comment-15356512
] 

Andrey commented on REEF-1471:
------------------------------

unless it's a small fix, I'd suggest to keep it separate. 
1251 PR is cooking for long time already and desperately needs to be checked in.
it's also easy to isolate the concerns: reporting job errors to RM vs handling recoverable
failures.

> IMRUDriver.FailAction does not report job failure
> -------------------------------------------------
>
>                 Key: REEF-1471
>                 URL: https://issues.apache.org/jira/browse/REEF-1471
>             Project: REEF
>          Issue Type: Bug
>          Components: IMRU
>            Reporter: Andrey
>            Assignee: Sergiy Matusevych
>            Priority: Critical
>              Labels: FT
>
> Currently both FailAction() and DoneAction() in IMRUDriver have the same behavior: shut
down all evaluators and return with no error.
> We need to report job failure to RM (yarn). FailAction seem to be right place to do it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message