reef-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Chung (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (REEF-1343) Fix events received in case of evaluator failure
Date Wed, 20 Apr 2016 21:03:25 GMT

     [ https://issues.apache.org/jira/browse/REEF-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Andrew Chung reassigned REEF-1343:
----------------------------------

    Assignee: Andrew Chung

> Fix events received in case of evaluator failure
> ------------------------------------------------
>
>                 Key: REEF-1343
>                 URL: https://issues.apache.org/jira/browse/REEF-1343
>             Project: REEF
>          Issue Type: Bug
>          Components: REEF.NET
>            Reporter: Mariia Mykhailova
>            Assignee: Andrew Chung
>            Priority: Critical
>              Labels: FT
>
> Investigation of REEF-1325 shows a weird sequence of events on local runtime: 
> * evaluator crashes with an unhandled exception (shown in evaluator.stderr and .stdout
files).
> * driver receives {{IFailedEvaluator}} event which doesn't have associated {{FailedTask}}
object.
> * the task continues running and completes successfully
> * driver receives {{ICompletedTask}} event.
> By design, failed evaluator shouldn't allow for a successful task completion.
> This can be reproduced using {{TestPoisonedEvaluatorStartHanlder}} test.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message