Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 49CE6200CCA for ; Tue, 4 Jul 2017 10:15:06 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 48A31160BEF; Tue, 4 Jul 2017 08:15:06 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 8CF25160BE1 for ; Tue, 4 Jul 2017 10:15:05 +0200 (CEST) Received: (qmail 62364 invoked by uid 500); 4 Jul 2017 08:15:03 -0000 Mailing-List: contact jira-help@kafka.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@kafka.apache.org Delivered-To: mailing list jira@kafka.apache.org Received: (qmail 62353 invoked by uid 99); 4 Jul 2017 08:15:03 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Jul 2017 08:15:03 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 27C8F180314 for ; Tue, 4 Jul 2017 08:15:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -96.202 X-Spam-Level: X-Spam-Status: No, score=-96.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, KAM_TIME=3, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id uiul1zHLlrir for ; Tue, 4 Jul 2017 08:15:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id D99FE5FC9D for ; Tue, 4 Jul 2017 08:15:00 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 503A4E0637 for ; Tue, 4 Jul 2017 08:15:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 0B9E9245F9 for ; Tue, 4 Jul 2017 08:15:00 +0000 (UTC) Date: Tue, 4 Jul 2017 08:15:00 +0000 (UTC) From: "Pengwei (JIRA)" To: jira@kafka.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (KAFKA-5553) Delete topic failed to change from OnlineReplica to ReplicaDeletionStarted if ISR not created MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 04 Jul 2017 08:15:06 -0000 [ https://issues.apache.org/jira/browse/KAFKA-5553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16073291#comment-16073291 ] Pengwei commented on KAFKA-5553: -------------------------------- One method try to fix this issue is : in the ReplicaStateMachine.handleStateChange when the target is OfflineReplica if we find controllerContext.partitionLeadershipInfo does not contain this topic partition, we still change this replica's state to OfflineReplica. Is this method OK? [~junrao] [~ijuma] Maybe I can try to fix it. > Delete topic failed to change from OnlineReplica to ReplicaDeletionStarted if ISR not created > ---------------------------------------------------------------------------------------------- > > Key: KAFKA-5553 > URL: https://issues.apache.org/jira/browse/KAFKA-5553 > Project: Kafka > Issue Type: Bug > Components: controller > Affects Versions: 0.9.0.1, 0.10.2.0, 0.11.0.0 > Reporter: Pengwei > > We found the error log as follow and the topic can not remove for a long time: > [2016-07-11 20:17:52,965] ERROR Controller 1328 epoch 315 initiated state change of replica 1328 for partition [websocket_test_topic,0] from OnlineReplica to ReplicaDeletionStarted failed (state.change.logger) > java.lang.AssertionError: assertion failed: Replica [Topic=websocket_test_topic,Partition=0,Replica=1328] should be in the OfflineReplica states before moving to ReplicaDeletionStarted state. Instead it is in OnlineReplica state > at scala.Predef$.assert(Predef.scala:165) > at kafka.controller.ReplicaStateMachine.assertValidPreviousStates(ReplicaStateMachine.scala:309) > at kafka.controller.ReplicaStateMachine.handleStateChange(ReplicaStateMachine.scala:190) > at kafka.controller.ReplicaStateMachine$$anonfun$handleStateChanges$2.apply(ReplicaStateMachine.scala:114) > at kafka.controller.ReplicaStateMachine$$anonfun$handleStateChanges$2.apply(ReplicaStateMachine.scala:114) > at scala.collection.immutable.HashSet$HashSet1.foreach(HashSet.scala:322) > at kafka.controller.ReplicaStateMachine.handleStateChanges(ReplicaStateMachine.scala:114) > at kafka.controller.TopicDeletionManager$$anonfun$startReplicaDeletion$2.apply(TopicDeletionManager.scala:344) > at kafka.controller.TopicDeletionManager$$anonfun$startReplicaDeletion$2.apply(TopicDeletionManager.scala:334) > at scala.collection.immutable.Map$Map1.foreach(Map.scala:116) > at kafka.controller.TopicDeletionManager.startReplicaDeletion(TopicDeletionManager.scala:334) > at kafka.controller.TopicDeletionManager.kafka$controller$TopicDeletionManager$$onPartitionDeletion(TopicDeletionManager.scala:367) > at kafka.controller.TopicDeletionManager$$anonfun$kafka$controller$TopicDeletionManager$$onTopicDeletion$2.apply(TopicDeletionManager.scala:313) > at kafka.controller.TopicDeletionManager$$anonfun$kafka$controller$TopicDeletionManager$$onTopicDeletion$2.apply(TopicDeletionManager.scala:312) > at scala.collection.immutable.Set$Set1.foreach(Set.scala:79) > at kafka.controller.TopicDeletionManager.kafka$controller$TopicDeletionManager$$onTopicDeletion(TopicDeletionManager.scala:312) > at kafka.controller.TopicDeletionManager$DeleteTopicsThread$$anonfun$doWork$1$$anonfun$apply$mcV$sp$4.apply(TopicDeletionManager.scala:431) > at kafka.controller.TopicDeletionManager$DeleteTopicsThread$$anonfun$doWork$1$$anonfun$apply$mcV$sp$4.apply(TopicDeletionManager.scala:403) > at scala.collection.immutable.Set$Set1.foreach(Set.scala:79) > at kafka.controller.TopicDeletionManager$DeleteTopicsThread$$anonfun$doWork$1.apply$mcV$sp(TopicDeletionManager.scala:403) > at kafka.controller.TopicDeletionManager$DeleteTopicsThread$$anonfun$doWork$1.apply(TopicDeletionManager.scala:397) > at kafka.controller.TopicDeletionManager$DeleteTopicsThread$$anonfun$doWork$1.apply(TopicDeletionManager.scala:397) > at kafka.utils.CoreUtils$.inLock(CoreUtils.scala:262) > at kafka.controller.TopicDeletionManager$DeleteTopicsThread.doWork(TopicDeletionManager.scala:397) > at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:63) -- This message was sent by Atlassian JIRA (v6.4.14#64029)