Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 72B80200BC8 for ; Wed, 23 Nov 2016 15:30:07 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 715F4160AFB; Wed, 23 Nov 2016 14:30:07 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id B91EE160AFA for ; Wed, 23 Nov 2016 15:30:06 +0100 (CET) Received: (qmail 70104 invoked by uid 500); 23 Nov 2016 14:30:06 -0000 Mailing-List: contact dev-help@falcon.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@falcon.apache.org Delivered-To: mailing list dev@falcon.apache.org Received: (qmail 70092 invoked by uid 99); 23 Nov 2016 14:30:05 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Nov 2016 14:30:05 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 4FEFBC796C for ; Wed, 23 Nov 2016 14:30:05 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -7.019 X-Spam-Level: X-Spam-Status: No, score=-7.019 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-2.999] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id X8_MLGsETAk7 for ; Wed, 23 Nov 2016 14:30:00 +0000 (UTC) Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with SMTP id 5D70E5FAFB for ; Wed, 23 Nov 2016 14:29:59 +0000 (UTC) Received: (qmail 69863 invoked by uid 99); 23 Nov 2016 14:29:58 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Nov 2016 14:29:58 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 6E0D02C03E0 for ; Wed, 23 Nov 2016 14:29:58 +0000 (UTC) Date: Wed, 23 Nov 2016 14:29:58 +0000 (UTC) From: "Ajay Yadava (JIRA)" To: dev@falcon.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (FALCON-1406) Effective time in Entity updates. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 23 Nov 2016 14:30:07 -0000 [ https://issues.apache.org/jira/browse/FALCON-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15690243#comment-15690243 ] Ajay Yadava commented on FALCON-1406: ------------------------------------- {quote} a lot of features developed in falcon took the immutability of *entity definition* for past instances as given {quote} Rerunning old instances doesn't affect that. New entities create new instances and don't affect "old instances", though they might overwrite the result of other entities. {quote}This is a much cleaner way of retaining history than the current scheme {quote} I understand and agree with the motivation, but IMHO, this approach pollutes history. The fact that the overlapping instances ran and had a status and are selectively nuked with this change, is seemingly clean but is actually creating more problems than solving. A better approach in retaining history will be to create entity versioning. The workaround approach that you have suggested now, is the one which doesn't leave "mess" (defunct and similarly named entities) so you don't need a cleanup. However, there are a lot of other cases where there will be such "mess" and a tool can definitely be built to highlight such entities, older than a given time range, and hence can be deleted. It will also be useful if someone takes the approach of not deleting and recreating the entity but to update the entity and reprocess old instances with a backfill job. Both approaches have their own pros and cons and none is ideal. > Effective time in Entity updates. > --------------------------------- > > Key: FALCON-1406 > URL: https://issues.apache.org/jira/browse/FALCON-1406 > Project: Falcon > Issue Type: New Feature > Reporter: sandeep samudrala > Assignee: sandeep samudrala > Attachments: FALCON-1406-initial.patch, effective_time_in_entity_updates.pdf > > > Effective time with entity updates needs to be provided even with past time too. There was effective time capability provided in the past which gives the functionality to set an effective time for an entity with only current or future time(now + delay), which could not solve all the issues. > Following are few scenarios which would require effective time to be available with time back in past. > a) New code being deployed for an incompatible input data set which would leave instances with old code and new data. > b) Bad code being pushed for which, the entity should be able to go back in time to replay(rerun) with new code. > c) Orchestration level changes(good/bad) would need functionality to go back in time to start with. > For reference: Linking all the Jiras that have been worked upon around effective time . > https://issues.apache.org/jira/browse/FALCON-374 > https://issues.apache.org/jira/browse/FALCON-297 -- This message was sent by Atlassian JIRA (v6.3.4#6332)