Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A004A200D64 for ; Mon, 11 Dec 2017 14:58:11 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 9ECBB160C13; Mon, 11 Dec 2017 13:58:11 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id BC50B160C10 for ; Mon, 11 Dec 2017 14:58:10 +0100 (CET) Received: (qmail 41253 invoked by uid 500); 11 Dec 2017 13:58:09 -0000 Mailing-List: contact users-help@nifi.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@nifi.apache.org Delivered-To: mailing list users@nifi.apache.org Received: (qmail 41243 invoked by uid 99); 11 Dec 2017 13:58:09 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 Dec 2017 13:58:09 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 379CAC32C6 for ; Mon, 11 Dec 2017 13:58:09 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.121 X-Spam-Level: X-Spam-Status: No, score=-0.121 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id x650Dq5Mv4wf for ; Mon, 11 Dec 2017 13:58:08 +0000 (UTC) Received: from mail-oi0-f53.google.com (mail-oi0-f53.google.com [209.85.218.53]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 231545FB5C for ; Mon, 11 Dec 2017 13:58:08 +0000 (UTC) Received: by mail-oi0-f53.google.com with SMTP id r63so11597644oia.6 for ; Mon, 11 Dec 2017 05:58:08 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-transfer-encoding; bh=4BAX3/SEd5baIv7ZDGGqratLknjBRSrbqGDxzmkjWdU=; b=i32rU/g3FORQnYmGVnpmcHRxx+x7RP1Dw8E7NEt/iL8giBGlNYNKWzGkTPcvKOSQGP 6NZ5uM0y0FGl6SUFnCDTek4S1YjIEkxeozOlT4J+aisXmDcg+BrLxsy9Tne+siYo3nbQ +EASVdlezyl9PKUAVuX0hUf/ILs1G02HFpvWDPiwkLLJn3jcDTHJYHmo7Kru0azsWPUx VjMElOCGv1eFMFf7kqt96IbKEkpVY4xK4z4WmtX/dsn1MWpSLFpYh7JxKpHZ0NaQtCKz t6OTKEeM/iKNseMfoM7oRlWXd5JUYmJgp5JuDJHKunYIo2HCR19gV6EnDOHVKAuIdev0 2iDA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-transfer-encoding; bh=4BAX3/SEd5baIv7ZDGGqratLknjBRSrbqGDxzmkjWdU=; b=cOONn/2vKYQd0sockE3FkHx0o8YgGlCLTPmeCEYfmXUN/RQOQs/B95JUQ8AeSeQyeU eaS1wJdj4k3YNnRpmMqyta2hFBCfU+594Dvhnx3TfetOo4i0MUT8qT2QCRHDno6aYhPx /1SwhS5IUe+yZX0yBtLk2DBPBKPd4pAi731vinO5OKEvgUWq0WpVi0FPloJpERImMwzz 1WldikLK1vOjwFSBfMJW6uJ0Wcqj92qAtpSYH70p0QtGka/q71f3a+w+/6HK7GHfiMyS YWmTnpYV3QX8JhRAKUv/rfuOE/NsBX6Th5NRDDgcNScGj7L1rXMx9eAJe2QTllpMUA68 tivw== X-Gm-Message-State: AKGB3mI+tfDZSGDNDJ73VdSbIrNFU0cjJRnzMmJkWwb33JIiBnQHazos G1QEJgVzRs5wRBEcCIb6XRZQPV1SsOXW4ftH2H9YIQ== X-Google-Smtp-Source: ACJfBovaT3MOjkYK8Ca9h/ZLp9v6PAuqBfYno1sMxP7ySswkF8idbVDzFVzxbqN5jq7WulTeIu9YghvKwvhq7pmOwNg= X-Received: by 10.202.67.65 with SMTP id q62mr350028oia.37.1513000687062; Mon, 11 Dec 2017 05:58:07 -0800 (PST) MIME-Version: 1.0 Received: by 10.157.43.232 with HTTP; Mon, 11 Dec 2017 05:58:06 -0800 (PST) In-Reply-To: References: From: Joe Witt Date: Mon, 11 Dec 2017 08:58:06 -0500 Message-ID: Subject: Re: Drop event disordering To: users@nifi.apache.org Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable archived-at: Mon, 11 Dec 2017 13:58:11 -0000 Omer It can certainly be looked at in time for 1.5 but based on recent progress/discussion 1.5 may well kick off rather soon. It is something someone would need to take up and that someone need not be a committer at this point. Your analysis of the different provenanace repositories having no real impact is helpful though as it might suggest the issue lies above during the generation of the events so that should help narrow it down. Thanks Joe On Mon, Dec 11, 2017 at 8:51 AM, Omer Hadari wrote: > Hi, we have looked into it some more and tried using > WriteAheadProvenanceRepository, to no real avail. Sorry for the ping, but= do > you have any other ideas? We don=E2=80=99t mind pursuing them and this pr= oblem is > critical for us. Any chance this could be looked at for 1.5? > > Thanks! > > On Fri, 24 Nov 2017 at 13:19 Omer Hadari wrote: >> >> Opened an issue >> NIFI-4638 >> >> On Wed, 22 Nov 2017 at 20:39 Omer Hadari wrote: >>> >>> Yes we were thinking in that direction as well, that=E2=80=99s why I me= ntioned >>> the 1ms part. I do not know how events are assigned an ordinal though, = so >>> it=E2=80=99s unclear to me whether the disordering is constant but is u= sually >>> =E2=80=9Chidden=E2=80=9D since there is no rollover, or maybe there is = some kind of a race >>> condition. You=E2=80=99ll probably be able to answer these questions qu= icker than >>> me, I=E2=80=99ll open a jira as soon as I get home later today. >>> >>> Thanks again! >>> >>> On Wed, 22 Nov 2017 at 20:34 Mark Payne wrote: >>>> >>>> Omer, >>>> >>>> Yes, I think that is sufficient. I think the issue is that the framewo= rk >>>> is creating both the >>>> ATTRIBUTES_MODIFIED and DROP events, and the generation of these objec= ts >>>> is >>>> very fast. But if the timestamp happens to 'rollover' from millisecond= 1 >>>> to millisecond 2, >>>> for example, those events get different timestamps. So I think it's ju= st >>>> a timing thing that >>>> will be somewhat difficult to reproduce reliably. But just a descripti= on >>>> of the behavior that >>>> you're experiencing should be fine. >>>> >>>> Thanks >>>> -Mark >>>> >>>> On Nov 22, 2017, at 1:04 PM, Omer Hadari wrote= : >>>> >>>> I=E2=80=99ll be glad to open a jira, though the problem is hardly cohe= rent imo, >>>> what would you like to see there? Simply =E2=80=9CDisordering of drop = events=E2=80=9D and >>>> the explanation I have here? Sadly I cannot provide a concrete example= since >>>> the problem does not reproduce. >>>> >>>> On Wed, 22 Nov 2017 at 18:23 Joe Witt wrote: >>>>> >>>>> also - awesome find! And glad you're at such a level with provenance >>>>> data to catch that. Thanks Omer! >>>>> >>>>> On Wed, Nov 22, 2017 at 11:21 AM, Mark Payne >>>>> wrote: >>>>> > Omer, >>>>> > >>>>> > This is likely an issue related to the order in which we generate >>>>> > those events in the framework. >>>>> > Do you mind filing a JIRA? >>>>> > >>>>> > Thanks >>>>> > -Mark >>>>> > >>>>> > >>>>> >> On Nov 22, 2017, at 10:51 AM, Omer Hadari >>>>> >> wrote: >>>>> >> >>>>> >> Hi! >>>>> >> We=E2=80=99ve been using NiFi for a while now, and we save all pro= venance >>>>> >> events for logging purposes and such. We encountered an issue whil= e looking >>>>> >> at lineages of some flow files, which showed drop events as if the= y happened >>>>> >> before another event, that in fact preceded it (and indeed has a l= ower event >>>>> >> ordinal). >>>>> >> >>>>> >> For example in a split json processor, the original FlowFile is >>>>> >> dropped after all splits happen and are assigned fragment counts, = but still >>>>> >> the timestamp of the drop event is earlier than the timestamp of t= he >>>>> >> attributes modified event. That causes the graph to look as if the >>>>> >> attributes modified event comes out of the drop event, which doesn= =E2=80=99t really >>>>> >> make sense to us (should it?). It=E2=80=99s probably worth noting = that the drop >>>>> >> event ordinal is higher than the attributes modified event ordinal= . Also we >>>>> >> noticed that >>>>> >> 1. This only happens every once per a few thousand events. >>>>> >> 2. This does not reproduce by replaying. >>>>> >> 3. The drop event=E2=80=99s timestamp is earlier by 1ms in the cas= es we >>>>> >> encountered, and the ordinal is always larger by one. >>>>> >> >>>>> >> This might be an error with the split json processor or a more >>>>> >> general one. We=E2=80=99d love any clues or corrections to misconc= eptions we might >>>>> >> have (maybe this is not a problem and drop events can precede othe= r events?) >>>>> >> >>>>> >> Thank you! >>>>> > >>>> >>>> >