Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 12EF9200D32 for ; Sun, 5 Nov 2017 15:31:06 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 114EC160BEB; Sun, 5 Nov 2017 14:31:06 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 572A61609EA for ; Sun, 5 Nov 2017 15:31:05 +0100 (CET) Received: (qmail 99883 invoked by uid 500); 5 Nov 2017 14:31:04 -0000 Mailing-List: contact jira-help@kafka.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@kafka.apache.org Delivered-To: mailing list jira@kafka.apache.org Received: (qmail 99872 invoked by uid 99); 5 Nov 2017 14:31:04 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 05 Nov 2017 14:31:04 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 4CC781B0B9D for ; Sun, 5 Nov 2017 14:31:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id CqiahADX8iqI for ; Sun, 5 Nov 2017 14:31:02 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id CBD0D5FD84 for ; Sun, 5 Nov 2017 14:31:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 50DCFE00DF for ; Sun, 5 Nov 2017 14:31:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 080B023F1D for ; Sun, 5 Nov 2017 14:31:00 +0000 (UTC) Date: Sun, 5 Nov 2017 14:31:00 +0000 (UTC) From: "Kostas Christidis (JIRA)" To: jira@kafka.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (KAFKA-6173) Leader should stop accepting requests when disconnected from ZK MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Sun, 05 Nov 2017 14:31:06 -0000 [ https://issues.apache.org/jira/browse/KAFKA-6173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kostas Christidis updated KAFKA-6173: ------------------------------------- Description: h2. Setup 1 consumer (C1), 2 datacenters (DC1, DC2), a ZK ensemble located in DC1, and 3 brokers (B1, B2, B3), where: * B1 is the cluster controller, located in DC1 * B2 is the leader for partition "foo", located in DC2 * B3 is a replica for partition "foo", located in DC1 h2. Condition There is a network partition between DC1 and DC2 h2. Actual behavior B2 will not relinquish leadership and will continue to serve C1. This split-brain situation will go on until C1 refreshes its metadata and finds out about the new leader. h2. Expected behavior B2 should stop accepting requests when it loses the ZK connection. -- I am prioritizing this a "minor" bug because the multi-DC arrangement is not optimal, and we'd be better off using a tool such as MirrorMaker. was: h2. Setup 1 consumer (C1), 2 datacenters (DC1, DC2), a ZK ensemble located in DC1, and 3 brokers (B1, B2, B3), where: * B1 is the cluster controller, located in DC1 * B2 is the leader for partition "foo", located in DC2 * B3 is a replica for partition "foo", located in DC1 h2. Condition There is a network partition between DC1 and DC2 h2. Actual behavior B2 will not relinquish leadership and will continue to serve C1. This split-brain situation will go on until C1 refreshes its metadata and finds out about the new leader. h2. Expected behavior B2 should stop accepting requests when it loses the ZK connection. I am prioritizing this a "minor" bug because the multi-DC arrangement is not optimal, and we'd be better off using a tool such as MirrorMaker. > Leader should stop accepting requests when disconnected from ZK > --------------------------------------------------------------- > > Key: KAFKA-6173 > URL: https://issues.apache.org/jira/browse/KAFKA-6173 > Project: Kafka > Issue Type: Bug > Reporter: Kostas Christidis > Priority: Minor > Attachments: After.png, Before.png, Partition.png > > > h2. Setup > 1 consumer (C1), 2 datacenters (DC1, DC2), a ZK ensemble located in DC1, and 3 brokers (B1, B2, B3), where: > * B1 is the cluster controller, located in DC1 > * B2 is the leader for partition "foo", located in DC2 > * B3 is a replica for partition "foo", located in DC1 > h2. Condition > There is a network partition between DC1 and DC2 > h2. Actual behavior > B2 will not relinquish leadership and will continue to serve C1. This split-brain situation will go on until C1 refreshes its metadata and finds out about the new leader. > h2. Expected behavior > B2 should stop accepting requests when it loses the ZK connection. > -- > I am prioritizing this a "minor" bug because the multi-DC arrangement is not optimal, and we'd be better off using a tool such as MirrorMaker. -- This message was sent by Atlassian JIRA (v6.4.14#64029)