Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 33C78200C39 for ; Thu, 16 Mar 2017 23:28:39 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 32589160B78; Thu, 16 Mar 2017 22:28:39 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 7D39E160B72 for ; Thu, 16 Mar 2017 23:28:38 +0100 (CET) Received: (qmail 86684 invoked by uid 500); 16 Mar 2017 22:28:35 -0000 Mailing-List: contact dev-help@nifi.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@nifi.apache.org Delivered-To: mailing list dev@nifi.apache.org Received: (qmail 86635 invoked by uid 99); 16 Mar 2017 22:28:35 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Mar 2017 22:28:35 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id AFE34C06AD for ; Thu, 16 Mar 2017 22:28:34 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.722 X-Spam-Level: X-Spam-Status: No, score=-0.722 tagged_above=-999 required=6.31 tests=[RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id a8hO7tF-uFWW for ; Thu, 16 Mar 2017 22:28:32 +0000 (UTC) Received: from mout.web.de (mout.web.de [212.227.17.11]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 574B95FC7B for ; Thu, 16 Mar 2017 22:28:32 +0000 (UTC) Received: from [92.74.244.213] by 3capp-webde-bap37.server.lan (via HTTP); Thu, 16 Mar 2017 23:28:31 +0100 MIME-Version: 1.0 Message-ID: From: "Uwe Geercken" To: "Nifi Dev" Subject: Processor logic Content-Type: text/plain; charset=UTF-8 Date: Thu, 16 Mar 2017 23:28:31 +0100 Importance: normal Sensitivity: Normal X-Priority: 3 X-Provags-ID: V03:K0:zDkA0/KRDELg5o/XKaRL3Wzc4k9iEj/EdAJix/msX9s nCATB5Z5lj/rMCpuES0JZMQ/sFUjSt2GvtGK4+jQK4eB6pphzf zDOoC1n3FpuhhXhLrD4zJriSzRBclEQQi/wjidfOYlJFBktQuQ Safvef3Q7I3JfXbwb2n5mhjWXMX2L+dsBwIp9VM1XC7V1gYsKS 6UZcRTfeAgn67dbpyDSVxqWZvnPppvfgGuIG5peopsE4B8q4zi MjaefXAVlo0PXO2me1LBWCbAlu10j6RYg0TbNC+dtJ/nuNn0Zf yauONbTAMbj+F1op87d1pCQiZ9O X-UI-Out-Filterresults: notjunk:1;V01:K0:dg6YPA2QfEs=:6cII9wHgfAnpUWbxBFoo0m iwNUZ6/Y793Q4bP6Wcxr75ud4drvWWiJsJqcR+Wy3nFSakBT7mlQmRseoty+oZk32fvqTv+dr u7AlxToX+DHnBn3t9dBUnyG0VM7PS6vWYefm7OGcBCGnSkM4WNECMKUKRt+W7rjL3mkRxezoM l4qpicjoJLC2Nxm0LJR1CAhdy1nTm/RZlQeWEErse/Ola+jb/rPqG/UqCn/UQzd/He4Hd+CGw BMyuRVnMCMnhaiE00yUlxjxJPQxAOPu2mMLaqVabQPLrHrW69Grosx94AT4dOzU3h/d6kci+P dEs+zDRzgqSIz3627WZJRVNVusDpNcugb/56AvkM9tvxDJi1ATLzqA/B0xp15q86BUVjGACiM Lhf/3xRt+y/jrDLvxJ0Kw4C6yBP1lhDKGHHZfnqVH9iwQ0FOpTpvNljglM63fGE74TGzwF+Dx PmQwTX+Opg== archived-at: Thu, 16 Mar 2017 22:28:39 -0000 Hello, I have a little bit of a hard time to design processors correctly. I find it difficult to decide if a processor should e.g. process a single line from a flow file or process also flow files with multiples lines of data (e.g. in the case of CSV files). Another point is the handling of header rows. One other point is data provenenance events: what is the correct event I should use when modifying attributes, content or both? Is there a guide which outlines the best practices for such cases? I have the feeling that many of the processors handle these issues quite differently. I think there either should be a sort of standard or otherwise it should be well documented. And although there is very good documentation available for the project, for some of the processors one has to play around quite a bit to get it right because they behave differently or have a different philosophie and one has to understand it first to get it right. Would appreciate to get some feedback and advice or pointers to documentation. Uwe