Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A82CD200D15 for ; Thu, 5 Oct 2017 17:36:04 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id A67671609DA; Thu, 5 Oct 2017 15:36:04 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id EC02D1609E1 for ; Thu, 5 Oct 2017 17:36:03 +0200 (CEST) Received: (qmail 95296 invoked by uid 500); 5 Oct 2017 15:36:02 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 95286 invoked by uid 99); 5 Oct 2017 15:36:02 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 Oct 2017 15:36:02 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id E85B11808F1 for ; Thu, 5 Oct 2017 15:36:01 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id blnTRVJwma_2 for ; Thu, 5 Oct 2017 15:36:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 0D9C95FD66 for ; Thu, 5 Oct 2017 15:36:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 9E1B2E0536 for ; Thu, 5 Oct 2017 15:36:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 5C6672431D for ; Thu, 5 Oct 2017 15:36:00 +0000 (UTC) Date: Thu, 5 Oct 2017 15:36:00 +0000 (UTC) From: "Mark Miller (JIRA)" To: dev@lucene.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (SOLR-11435) Replicas failed to be deleted can overwrite replicas of recreated collections MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 05 Oct 2017 15:36:04 -0000 [ https://issues.apache.org/jira/browse/SOLR-11435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16193057#comment-16193057 ] Mark Miller commented on SOLR-11435: ------------------------------------ We should add a test for this for sure. bq. I think we should add it back when users are not using the autoAddReplica feature? That may be a good short term fix, but we should consider how this all works with legacyCloud=false and the new autoAddReplica replacement longer term. Another option is to just revert that change for now, even with it's affect on autoAddReplica, given the nightly test was broken some time ago and a variety of things could be broken now. Also though, I wonder if SOLR-7248 was really the right fix for this issue. Using the coreNodeName is a very simple design to understand. This complicated things. Can we not prevent this behavior in a better way? The most important thing is to have a test for this though. > Replicas failed to be deleted can overwrite replicas of recreated collections > ----------------------------------------------------------------------------- > > Key: SOLR-11435 > URL: https://issues.apache.org/jira/browse/SOLR-11435 > Project: Solr > Issue Type: Bug > Security Level: Public(Default Security Level. Issues are Public) > Affects Versions: 7.0 > Reporter: Mano Kovacs > > When a replica comes up that was deleted from ZK while it was gone, it can replace replicas in ZK if the collection an core names are equal. > Reproduction: > # Create {{collection1}} with 1 shard, 2 replicas on {{node1}} and {{node2}} > # Shut down {{node2}} > # Delete {{collection1}} > # Create {{collection1}} on {{node1}} and {{node3}} > # Start {{node2}} > Expected: > {{node2}} will not initialize node as it is not assigned to in ZK ({{legacyCloud=false}}) > Actual: > {{node2}} will overwrite the {{baseurl}} in {{state.json}} for one of the replicas as the {{coreNodeName}} and the collection name will match the core it has. > Note: SOLR-7248 introduced a {{baseurl}} comparison which was removed in SOLR-10279. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org