apex-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chaitanya (JIRA)" <j...@apache.org>
Subject [jira] [Created] (APEXMALHAR-2158) Duplication of data emitted when the Kafka Input Operator(0.8 version) redeploys
Date Mon, 18 Jul 2016 11:08:20 GMT
Chaitanya created APEXMALHAR-2158:
-------------------------------------

             Summary: Duplication of data emitted when the Kafka Input Operator(0.8 version)
redeploys
                 Key: APEXMALHAR-2158
                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2158
             Project: Apache Apex Malhar
          Issue Type: Bug
            Reporter: Chaitanya
            Assignee: Chaitanya


Kafka Input Operator failure recovery depends on offsetStats map.
"offsetStats" map is updated whenever the message is emitted from operator.

When the operator goes down and comes back, operator sets this map to the consumer and start
reading from these offsets. Consumer reads the last emitted message again and operator emits
it.

Solution: Increment the offsets before setting to the consumer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message