Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 8CBFC200B41 for ; Thu, 7 Jul 2016 19:11:17 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 8B5ED160A4F; Thu, 7 Jul 2016 17:11:17 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id CE5DD160A68 for ; Thu, 7 Jul 2016 19:11:16 +0200 (CEST) Received: (qmail 29170 invoked by uid 500); 7 Jul 2016 17:11:15 -0000 Mailing-List: contact dev-help@apex.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@apex.apache.org Delivered-To: mailing list dev@apex.apache.org Received: (qmail 29093 invoked by uid 99); 7 Jul 2016 17:11:15 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2016 17:11:15 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 45333C05A7 for ; Thu, 7 Jul 2016 17:11:15 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -4.507 X-Spam-Level: X-Spam-Status: No, score=-4.507 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-1.287] autolearn=disabled Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id pMtEpMfqhmXA for ; Thu, 7 Jul 2016 17:11:13 +0000 (UTC) Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with SMTP id 3D1A35F237 for ; Thu, 7 Jul 2016 17:11:12 +0000 (UTC) Received: (qmail 26356 invoked by uid 99); 7 Jul 2016 17:11:11 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2016 17:11:11 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 29A2B2C02B0 for ; Thu, 7 Jul 2016 17:11:11 +0000 (UTC) Date: Thu, 7 Jul 2016 17:11:11 +0000 (UTC) From: "ASF GitHub Bot (JIRA)" To: dev@apex.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (APEXMALHAR-2086) Kafka Output Operator with Kafka 0.9 API MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 07 Jul 2016 17:11:17 -0000 [ https://issues.apache.org/jira/browse/APEXMALHAR-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15366420#comment-15366420 ] ASF GitHub Bot commented on APEXMALHAR-2086: -------------------------------------------- GitHub user sandeshh reopened a pull request: https://github.com/apache/apex-malhar/pull/298 [APEXMALHAR-2086] Kafka output operator: 0.9.0 Kafka output exactly once operator and the regular output operator. You can merge this pull request into a Git repository by running: $ git pull https://github.com/sandeshh/apex-malhar APEXMALHAR-2086 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/apex-malhar/pull/298.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #298 ---- commit 6bf316da76bf1d1871f52a114a307c05ff652376 Author: sandeshh Date: 2016-05-25T15:56:56Z Kafka 0.9.0 output operators and unit tests. 1. Abstract Base class 2. Kafka Output operator 3. Exactly Once output operator Key in the Kafka message is used by the operator to track the tuples written by it. ---- > Kafka Output Operator with Kafka 0.9 API > ---------------------------------------- > > Key: APEXMALHAR-2086 > URL: https://issues.apache.org/jira/browse/APEXMALHAR-2086 > Project: Apache Apex Malhar > Issue Type: New Feature > Reporter: Sandesh > Assignee: Sandesh > > Goal : 2 Operartors for Kafka Output > 1. Simple Kafka Output Operator > - Supports Atleast Once > - Expose most used producer properties as class properties > 2. Exactly Once Kafka Output ( Not possible in all the cases, will be documented later ) > > Design for Exactly Once > Window Data Manager - Stores the Kafka partitions offsets. > Kafka Key - Used by the operator = AppID#OperatorId > During recovery. Partially written window is re-created using the following approach: > Tuples between the largest recovery offsets and the current offset are checked. Based on the key, tuples written by the other entities are discarded. > Only tuples which are not in the recovered set are emitted. > Tuples needs to be unique within the window. > -- This message was sent by Atlassian JIRA (v6.3.4#6332)