falcon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Peeyush Bishnoi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FALCON-1433) Falcon retry instance do not proceed on failure
Date Wed, 09 Sep 2015 13:58:45 GMT

    [ https://issues.apache.org/jira/browse/FALCON-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736885#comment-14736885
] 

Peeyush Bishnoi commented on FALCON-1433:
-----------------------------------------

On debugging, I found that this issue is happening because by default parameter "oozie.wf.rerun.failnodes"
is getting set to "true". Due to this, when rerun happen for failed Falcon instance, only
failed workflow user action invoke and corresponding post-processing action does not invoke.
As post-processing does not invoke, rerun service will not launch (in case of failure) and
hence further retry attempts will not happen. 

So to ensure that all retry attempts should happen if workflow user action failed, we need
to set the "oozie.wf.rerun.failnodes" to "false".

Thoughts please.

> Falcon retry instance do not proceed on failure
> -----------------------------------------------
>
>                 Key: FALCON-1433
>                 URL: https://issues.apache.org/jira/browse/FALCON-1433
>             Project: Falcon
>          Issue Type: Bug
>          Components: rerun
>            Reporter: Peeyush Bishnoi
>            Assignee: Peeyush Bishnoi
>            Priority: Critical
>             Fix For: trunk
>
>
> When retry is specified in Falcon entity with number of attempts more than 1 and if workflow
failed, on retrying Falcon instance workflow (if action fail again) seems like stuck and further
attempts also does not get retry.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message