Return-Path: Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: (qmail 88329 invoked from network); 1 Nov 2010 16:39:18 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 1 Nov 2010 16:39:18 -0000 Received: (qmail 81377 invoked by uid 500); 1 Nov 2010 16:39:50 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 81358 invoked by uid 500); 1 Nov 2010 16:39:50 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 81350 invoked by uid 99); 1 Nov 2010 16:39:50 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Nov 2010 16:39:50 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Nov 2010 16:39:47 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id oA1GdQLF014358 for ; Mon, 1 Nov 2010 16:39:26 GMT Message-ID: <4168857.175801288629566030.JavaMail.jira@thor> Date: Mon, 1 Nov 2010 12:39:26 -0400 (EDT) From: "Gary Dusbabek (JIRA)" To: commits@cassandra.apache.org Subject: [jira] Commented: (CASSANDRA-1670) cannot move a node In-Reply-To: <15052143.108591288218501507.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/CASSANDRA-1670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12927034#action_12927034 ] Gary Dusbabek commented on CASSANDRA-1670: ------------------------------------------ When a node is decommissioned, it gets added to justRemovedEndpoints_, but removed from endpointStateMap_. The old code will only remove a node from justRemovedEndpoints_ if it currently exists in endpointStateMap_. If the node stays in justRemovedEndpoints_ (which it will currently), it can never be recognized as part of the ring because of the check in Gossiper.handleNewJoin(). > cannot move a node > ------------------ > > Key: CASSANDRA-1670 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1670 > Project: Cassandra > Issue Type: Bug > Components: Core > Affects Versions: 0.6.6 > Environment: RAX > Reporter: Matthew F. Dennis > Assignee: Gary Dusbabek > Fix For: 0.6.7, 0.7.0 > > Attachments: 1670-0.6.txt, v1-0001-code-that-tidied-Gossiper.justRemovedEndpoints_-was-no.txt > > > two node cluster (node0, node1). node0 is listed as the only seed on both nodes. Listen addresses explicitly set to an IP on both nodes. No initial token, no autobootstrap (but see below). Bring up the ring. Everything is fine on both nodes. > decom node1. verify decom completed correctly by reading the logs on both nodes. rm all data/logs on node1. bring node1 up again. > One of two things happen: > * node0 thinks it is in a ring by itself, node1 thinks both nodes are in the ring. > * both node0 and node1 think they are in rings by themselves > If you restart node0 after decom, it appears to work normally. > Similar issues seem to present if you kill node1 (either when autobootstrapping before it completes or after it is in the ring) and removetoken. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.