Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 82A27200CFD for ; Thu, 17 Aug 2017 06:20:06 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 80C1916A34D; Thu, 17 Aug 2017 04:20:06 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C7FA616A34E for ; Thu, 17 Aug 2017 06:20:05 +0200 (CEST) Received: (qmail 82918 invoked by uid 500); 17 Aug 2017 04:20:04 -0000 Mailing-List: contact jira-help@kafka.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@kafka.apache.org Delivered-To: mailing list jira@kafka.apache.org Received: (qmail 82400 invoked by uid 99); 17 Aug 2017 04:20:04 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Aug 2017 04:20:04 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 632F1C046E for ; Thu, 17 Aug 2017 04:20:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id MLrLQcVAFTdF for ; Thu, 17 Aug 2017 04:20:02 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 354535FE6D for ; Thu, 17 Aug 2017 04:20:02 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 8DAA0E0D92 for ; Thu, 17 Aug 2017 04:20:01 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 819D025391 for ; Thu, 17 Aug 2017 04:20:00 +0000 (UTC) Date: Thu, 17 Aug 2017 04:20:00 +0000 (UTC) From: "Apurva Mehta (JIRA)" To: jira@kafka.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Comment Edited] (KAFKA-5403) Transactions system test should dedup consumed messages by offset MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 17 Aug 2017 04:20:06 -0000 [ https://issues.apache.org/jira/browse/KAFKA-5403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16129866#comment-16129866 ] Apurva Mehta edited comment on KAFKA-5403 at 8/17/17 4:19 AM: -------------------------------------------------------------- I think we should punt on this. The problems with the patch are not easy to fix as the {{VerifiableConsumer}} validates that offsets are sequential, which is not true when you have transactions. And the {{ConsoleConsumer}} doesn't expose offsets. So modifying either without breaking compatibility will take time. Also, the system test has been running reliably for months without suffering any problems with duplicate reads on the same offset. was (Author: apurva): I think we should punt on this. The problems with the patch are not easy to fix as the {{VerifiableConsumer}} validates that offsets are sequential, which is not true when you have transactions. And the {{ConsoleConsumer}} doesn't expose offsets. So modifying either breaking compatibility will take time. Also, the system test has been running reliably for months without suffering any problems with duplicate reads on the same offset. > Transactions system test should dedup consumed messages by offset > ----------------------------------------------------------------- > > Key: KAFKA-5403 > URL: https://issues.apache.org/jira/browse/KAFKA-5403 > Project: Kafka > Issue Type: Bug > Affects Versions: 0.11.0.0 > Reporter: Apurva Mehta > Assignee: Apurva Mehta > Fix For: 1.0.0 > > > In KAFKA-5396, we saw that the consumers which verify the data in multiple topics could read the same offsets multiple times, for instance when a rebalance happens. > This would detect spurious duplicates, causing the test to fail. We should dedup the consumed messages by offset and only fail the test if we have duplicate values for a if for a unique set of offsets. -- This message was sent by Atlassian JIRA (v6.4.14#64029)