reef-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shravan Matthur Narayanamurthy (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (REEF-1674) Random Failures in Broadcast and Reduce Fault Tolerance tests
Date Thu, 01 Dec 2016 00:58:58 GMT

    [ https://issues.apache.org/jira/browse/REEF-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15710396#comment-15710396
] 

Shravan Matthur Narayanamurthy edited comment on REEF-1674 at 12/1/16 12:58 AM:
--------------------------------------------------------------------------------

Mariia, you were right. I was not sure if {{PoisonedEventHandler}} causes failed evaluator
or not. But after looking at it, it seems to, as it raises exception on the clock. So removed
the exit handler and used it instead. Thanks!


was (Author: shravanmn):
Mariia, you were right. I was not sure if {{PoisonedEventHandler}} causes failed evaluator
or not. But after looking at it, it seems to as it raises exception on the clock. So removed
the exit handler and used it instead. Thanks!

> Random Failures in Broadcast and Reduce Fault Tolerance tests
> -------------------------------------------------------------
>
>                 Key: REEF-1674
>                 URL: https://issues.apache.org/jira/browse/REEF-1674
>             Project: REEF
>          Issue Type: Improvement
>          Components: REEF.NET IO
>    Affects Versions: 0.16
>            Reporter: Shravan Matthur Narayanamurthy
>            Assignee: Shravan Matthur Narayanamurthy
>            Priority: Minor
>             Fix For: 0.16
>
>
> The current fault tolerance tests inject simulated failure in a controlled manner and
hence are not the right failure model to test our fault tolerance work. It would be good to
have failures injected randomly than only at specific points as is done in the current code.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message