apex-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Siyuan Hua (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (APEXMALHAR-2158) Duplication of data emitted when the Kafka Input Operator(0.8 version) redeploys
Date Wed, 10 Aug 2016 06:22:20 GMT

     [ https://issues.apache.org/jira/browse/APEXMALHAR-2158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Siyuan Hua updated APEXMALHAR-2158:
-----------------------------------
    Fix Version/s: 3.5.0

> Duplication of data emitted when the Kafka Input Operator(0.8 version) redeploys
> --------------------------------------------------------------------------------
>
>                 Key: APEXMALHAR-2158
>                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2158
>             Project: Apache Apex Malhar
>          Issue Type: Bug
>            Reporter: Chaitanya
>            Assignee: Chaitanya
>             Fix For: 3.5.0
>
>
> Kafka Input Operator failure recovery depends on offsetStats map.
> "offsetStats" map is updated whenever the message is emitted from operator.
> When the operator goes down and comes back, operator sets this map to the consumer and
start reading from these offsets. Consumer reads the last emitted message again and operator
emits it.
> Solution: Increment the offsets before setting to the consumer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message