Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7DFF67304 for ; Tue, 13 Sep 2011 00:22:13 +0000 (UTC) Received: (qmail 26126 invoked by uid 500); 13 Sep 2011 00:22:11 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 26102 invoked by uid 500); 13 Sep 2011 00:22:11 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 26094 invoked by uid 99); 13 Sep 2011 00:22:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Sep 2011 00:22:11 +0000 X-ASF-Spam-Status: No, hits=1.8 required=5.0 tests=FREEMAIL_FROM,FREEMAIL_REPLY,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of sdolgy@gmail.com designates 209.85.215.43 as permitted sender) Received: from [209.85.215.43] (HELO mail-ew0-f43.google.com) (209.85.215.43) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Sep 2011 00:22:06 +0000 Received: by ewy20 with SMTP id 20so1906ewy.30 for ; Mon, 12 Sep 2011 17:21:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; bh=7mH6HuFmWn/Rr5C5qjjhdXK3X9VB4PH1ryRZUoVBr/I=; b=bl6K2dU5Usg1+4HFUoivX0BnuGi7Kd9rUrsf6h+rZFDN3Y3R6wQJLQk/Dd5F5Hzx2z K15kVvv0lHWop/5w61Nv5CdD+oNwpvWgmYzicLz9VrCy8f+8+l3AWYst4CwtSqgVGRxb AKyapD/j/rveZQJBUr757cmP7vYaKtu2k+V6w= Received: by 10.14.0.204 with SMTP id 52mr468515eeb.190.1315873304093; Mon, 12 Sep 2011 17:21:44 -0700 (PDT) MIME-Version: 1.0 Received: by 10.14.189.1 with HTTP; Mon, 12 Sep 2011 17:21:24 -0700 (PDT) In-Reply-To: References: From: Sasha Dolgy Date: Tue, 13 Sep 2011 02:21:24 +0200 Message-ID: Subject: Re: AntiEntropyService.getNeighbors pulls information from where? To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable use system; del LocationInfo[52696e67]; i ran this on the nodes that had the problems. stopped, started the nodes, it re-did it's job .... job done. all fixed with a new bug! https://issues.apache.org/jira/browse/CASSANDRA-3186 On Tue, Sep 13, 2011 at 2:09 AM, aaron morton wro= te: > I'm pretty sure I'm behind on how to deal with this problem. > > Best I know is to start the node with "-Dcassandra.load_ring_state=3Dfals= e" as a JVM option. But if the ghost IP address is in gossip it will not wo= rk, and it should be in gossip. > > Does the ghost IP show up in nodetool ring ? > > Anyone know a way to remove a ghost IP from gossip that does not have a t= oken associated with it ? > > Cheers > > ----------------- > Aaron Morton > Freelance Cassandra Developer > @aaronmorton > http://www.thelastpickle.com > > On 13/09/2011, at 6:39 AM, Sasha Dolgy wrote: > >> This relates to the issue i opened the other day: >> https://issues.apache.org/jira/browse/CASSANDRA-3175 .. =A0basically, >> 'nodetool ring' throws an exception on two of the four nodes. >> >> In my fancy little world, the problems appear to be related to one of >> the nodes thinking that someone is their neighbor ... and that someone >> moved away a long time ago............ >> >> /mnt/cassandra/logs/system.log: INFO [AntiEntropySessions:5] >> 2011-09-10 21:20:02,182 AntiEntropyService.java (line 658) Could not >> proceed on repair because a neighbor (/10.130.185.136) is dead: >> manual-repair-d8cdb59a-04a4-4596-b73f-cba3bd2b9eab failed. >> /mnt/cassandra/logs/system.log: INFO [AntiEntropySessions:7] >> 2011-09-11 21:20:02,258 AntiEntropyService.java (line 658) Could not >> proceed on repair because a neighbor (/10.130.185.136) is dead: >> manual-repair-ad17e938-f474-469c-9180-d88a9007b6b9 failed. >> /mnt/cassandra/logs/system.log: INFO [AntiEntropySessions:9] >> 2011-09-12 21:20:02,256 AntiEntropyService.java (line 658) Could not >> proceed on repair because a neighbor (/10.130.185.136) is dead: >> manual-repair-636150a5-4f0e-45b7-b400-24d8471a1c88 failed. >> >> Appears only in the logs for one node that is generating the issue. 172.= 16.12.10 >> >> Where do I find where the AntiEntropyService.getNeighbors(tablename, >> range) is pulling it's information from? >> >> On the two nodes that work: >> >> [default@system] describe cluster; >> Cluster Information: >> Snitch: org.apache.cassandra.locator.Ec2Snitch >> Partitioner: org.apache.cassandra.dht.RandomPartitioner >> Schema versions: >> 1b871300-dbdc-11e0-0000-564008fe649f: [172.16.12.10, 172.16.12.11, >> 172.16.14.12, 172.16.14.10] >> [default@system] >> >> From the two nodes that don't work: >> >> [default@unknown] describe cluster; >> Cluster Information: >> Snitch: org.apache.cassandra.locator.Ec2Snitch >> Partitioner: org.apache.cassandra.dht.RandomPartitioner >> Schema versions: >> 1b871300-dbdc-11e0-0000-564008fe649f: [172.16.12.10, 172.16.12.11, >> 172.16.14.12, 172.16.14.10] >> UNREACHABLE: [10.130.185.136] --> which is really 172.16.14.10 >> [default@unknown] >> >> Really now. =A0Where does 10.130.185.136 exist? =A0It's in none of the >> configurations I have AND the full ring has been shut down and started >> up ... not trying to give Vijay a hard time by posting here btw! >> >> Just thinking it could be something super silly ... that a wider >> audience has come across. >> >> -- >> Sasha Dolgy >> sasha.dolgy@gmail.com > > --=20 Sasha Dolgy sasha.dolgy@gmail.com