falcon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pallavi Rao (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FALCON-1835) Falcon should do coord rerun rather than workflow rerun to avoid parallel runs of same instance:
Date Fri, 19 Feb 2016 09:11:18 GMT

    [ https://issues.apache.org/jira/browse/FALCON-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15153961#comment-15153961
] 

Pallavi Rao commented on FALCON-1835:
-------------------------------------

Investigated moving to coord rerun and found the following issues/requirements:
1. Need to move to Oozie 4.2 (currently we use 4.1) as the coord rerun API has been enhanced
to support only failed node reruns.
2. The post-processing notification does not send the parent coord action id, so retry uses
workflow rerun. However, with oozie notifications, the corresponding coord action id can be
obtained and coord rerun can be performed. So, will modify the code do coord rerun when action
id is available and workflow rerun when it is not.
3. Oozie coord rerun does not honor oozie.wf.rerun.skip.nodes property that workflow rerun
honors. This is a slight compatibility break as Falcon used to support this option, although
hardly used. Will file a bug for the same.



> Falcon should do coord rerun rather than workflow rerun to avoid parallel runs of same
instance:
> ------------------------------------------------------------------------------------------------
>
>                 Key: FALCON-1835
>                 URL: https://issues.apache.org/jira/browse/FALCON-1835
>             Project: Falcon
>          Issue Type: Improvement
>            Reporter: Pallavi Rao
>            Assignee: Pallavi Rao
>              Labels: backward-incompatible
>
> As of today, falcon does a workflow rerun, which can cause parallel instances to run.
> 1) Rerun via falcon retry.
> 2) Coord rerun via direct oozie command line.
> Falcon needs to move to coord rerun to rerun for same workflow which is already present.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message