Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 9493B200D3C for ; Tue, 14 Nov 2017 10:38:06 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 92D53160BF4; Tue, 14 Nov 2017 09:38:06 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id D88481609EF for ; Tue, 14 Nov 2017 10:38:05 +0100 (CET) Received: (qmail 34902 invoked by uid 500); 14 Nov 2017 09:38:04 -0000 Mailing-List: contact jira-help@kafka.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@kafka.apache.org Delivered-To: mailing list jira@kafka.apache.org Received: (qmail 34891 invoked by uid 99); 14 Nov 2017 09:38:04 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Nov 2017 09:38:04 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 930A5C3F80 for ; Tue, 14 Nov 2017 09:38:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id QntGwDo3uKud for ; Tue, 14 Nov 2017 09:38:02 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 817D65F659 for ; Tue, 14 Nov 2017 09:38:02 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 0F134E0A29 for ; Tue, 14 Nov 2017 09:38:01 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 1592223F05 for ; Tue, 14 Nov 2017 09:38:00 +0000 (UTC) Date: Tue, 14 Nov 2017 09:38:00 +0000 (UTC) From: "Andrey (JIRA)" To: jira@kafka.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Issue Comment Deleted] (KAFKA-5504) Kafka controller is not getting elected MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 14 Nov 2017 09:38:06 -0000 [ https://issues.apache.org/jira/browse/KAFKA-5504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrey updated KAFKA-5504: -------------------------- Comment: was deleted (was: zkCli shows: - broker 1 is the controller - several seconds later broker 2 is the controller - several seconds later broker 1 is the controller or broker 3. {code} [zk: host1:2181(CONNECTED) 18] get /kafka-prod/controller {"version":1,"brokerid":1,"timestamp":"1510651771022"} {code} {code} [zk: host2:2181(CONNECTED) 19] get /kafka-prod/controller {"version":1,"brokerid":2,"timestamp":"1510651882077"} {code} Looks like some king of double locking error on zk resources. ) > Kafka controller is not getting elected > --------------------------------------- > > Key: KAFKA-5504 > URL: https://issues.apache.org/jira/browse/KAFKA-5504 > Project: Kafka > Issue Type: Bug > Components: controller > Affects Versions: 0.9.0.1 > Reporter: Ashish Kumar > > I am having a kafka cluster of 20 nodes and I was facing the issue of under-replicated topics issue for last few days so decided to restart the broker which was working as a controller but after restart getting below logs in all the brokers (It seems controller is not finalized and leader election is happening continuously): > [2017-06-23 02:59:50,388] INFO Result of znode creation is: NODEEXISTS (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:50,396] INFO New leader is 12 (kafka.server.ZookeeperLeaderElector$LeaderChangeListener) > [2017-06-23 02:59:50,410] INFO Rolled new log segment for 'dpe_feedback_rating_history-4' in 0 ms. (kafka.log.Log) > [2017-06-23 02:59:51,585] INFO Creating /controller (is it secure? false) (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:51,590] INFO Result of znode creation is: NODEEXISTS (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:51,609] INFO New leader is 11 (kafka.server.ZookeeperLeaderElector$LeaderChangeListener) > [2017-06-23 02:59:52,792] INFO Creating /controller (is it secure? false) (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:52,799] INFO Result of znode creation is: NODEEXISTS (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:52,808] INFO New leader is 12 (kafka.server.ZookeeperLeaderElector$LeaderChangeListener) > [2017-06-23 02:59:54,122] INFO New leader is 3 (kafka.server.ZookeeperLeaderElector$LeaderChangeListener) > [2017-06-23 02:59:55,504] INFO Creating /controller (is it secure? false) (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:55,512] INFO Result of znode creation is: NODEEXISTS (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:55,520] INFO New leader is 11 (kafka.server.ZookeeperLeaderElector$LeaderChangeListener) > [2017-06-23 02:59:56,695] INFO Creating /controller (is it secure? false) (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:56,701] INFO Result of znode creation is: NODEEXISTS (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:56,709] INFO New leader is 11 (kafka.server.ZookeeperLeaderElector$LeaderChangeListener) > [2017-06-23 02:59:57,949] INFO Creating /controller (is it secure? false) (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:57,955] INFO Result of znode creation is: NODEEXISTS (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:57,965] INFO New leader is 12 (kafka.server.ZookeeperLeaderElector$LeaderChangeListener) > [2017-06-23 02:59:59,378] INFO Creating /controller (is it secure? false) (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:59,384] INFO Result of znode creation is: NODEEXISTS (kafka.utils.ZKCheckedEphemeral) > [2017-06-23 02:59:59,395] INFO New leader is 12 (kafka.server.ZookeeperLeaderElector$LeaderChangeListener) > . > . > . > Tried deleting controller znode (/controller) but no luck. Please let me know if any fix is possible here. -- This message was sent by Atlassian JIRA (v6.4.14#64029)