Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3360F696D for ; Wed, 13 Jul 2011 16:56:20 +0000 (UTC) Received: (qmail 47300 invoked by uid 500); 13 Jul 2011 16:56:17 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 46909 invoked by uid 500); 13 Jul 2011 16:56:16 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 46898 invoked by uid 99); 13 Jul 2011 16:56:16 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Jul 2011 16:56:16 +0000 X-ASF-Spam-Status: No, hits=1.8 required=5.0 tests=FREEMAIL_FROM,FREEMAIL_REPLY,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of sdolgy@gmail.com designates 209.85.210.172 as permitted sender) Received: from [209.85.210.172] (HELO mail-iy0-f172.google.com) (209.85.210.172) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Jul 2011 16:56:10 +0000 Received: by iye7 with SMTP id 7so6803269iye.31 for ; Wed, 13 Jul 2011 09:55:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; bh=EoSrTd3EsuUjMJbtMOcCoZZc5zVr7OGtNa9sdC0xNhs=; b=KDlnsyFioSpQaTq+XctQwC6yolBJNBG0uEuQampl2p42LoSCX0xpi8PVrsvNN8CnTg SPEhjw5E+41xgAdbeYMZP+OEzwYEXNoONbg1l3axrKBeA37lIWXQ8VEhGdwtZwLzGqbR ScDCqE58kdZ8JpEd7qS9pXYNOQXGeUSW06PXk= Received: by 10.43.53.4 with SMTP id vo4mr1193363icb.395.1310576149046; Wed, 13 Jul 2011 09:55:49 -0700 (PDT) MIME-Version: 1.0 Received: by 10.42.4.6 with HTTP; Wed, 13 Jul 2011 09:55:29 -0700 (PDT) In-Reply-To: References: <71758072ACCC42D19AEBFE0380824EEA@gmail.com> From: Sasha Dolgy Date: Wed, 13 Jul 2011 18:55:29 +0200 Message-ID: Subject: Re: One node down but it thinks its fine... To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org any firewall changes? ping is fine ... but if you can't get from node(a) to nodes(n) on the specific ports... On Wed, Jul 13, 2011 at 6:47 PM, samal wrote: > Check seed ip is same in all node and should not be loopback ip on cluste= r. > > On Wed, Jul 13, 2011 at 8:40 PM, Ray Slakinski > wrote: >> >> One of our nodes, which happens to be the seed thinks its Up and all the >> other nodes are down. However all the other nodes thinks the seed is dow= n >> instead. The logs for the seed node show everything is running as it sho= uld >> be. I've tried restarting the node, turning on/off gossip and thrift and >> nothing seems to get the node to see the rest of its ring as up and runn= ing. >> I have also tried restarting one of the other nodes, which had no affect= on >> the situation. Below is the ring outputs for the seed and one other node= in >> the ring, plus a ping to show that the seed can ping the other node. >> >> # bin/nodetool -h 0.0.0.0 ring >> Address Status State Load Owns Token >> =A0141784319550391026443072753096570088105 >> 127.0.0.1 Up Normal 4.61 GB 16.67% 0 >> xx.xxx.30.210 Down Normal ? 16.67% 2835686391007820528861455061931401762= 1 >> xx.xx.90.87 Down Normal ? 16.67% 56713727820156410577229101238628035242 >> xx.xx.22.236 Down Normal ? 16.67% 85070591730234615865843651857942052863 >> xx.xx.97.96 Down Normal ? 16.67% 113427455640312821154458202477256070484 >> xx.xxx.17.122 Down Normal ? 16.67% 1417843195503910264430727530965700881= 05 >> >> >> # ping xx.xxx.30.210 >> PING xx.xxx.30.210 (xx.xxx.30.210) 56(84) bytes of data. >> 64 bytes from xx.xxx.30.210: icmp_req=3D1 ttl=3D61 time=3D0.299 ms >> 64 bytes from xx.xxx.30.210: icmp_req=3D2 ttl=3D61 time=3D0.287 ms >> ^C >> --- xx.xxx.30.210 ping statistics --- >> 2 packets transmitted, 2 received, 0% packet loss, time 999ms >> rtt min/avg/max/mdev =3D 0.287/0.293/0.299/0.006 ms >> >> >> # bin/nodetool -h xx.xxx.30.210 ring >> Address Status State Load Owns Token >> =A0141784319550391026443072753096570088105 >> xx.xxx.23.40 Down Normal ? 16.67% 0 >> xx.xxx.30.210 Up Normal 10.58 GB 16.67% >> 28356863910078205288614550619314017621 >> xx.xx.90.87 Up Normal 10.47 GB 16.67% >> 56713727820156410577229101238628035242 >> xx.xx.22.236 Up Normal 9.63 GB 16.67% >> 85070591730234615865843651857942052863 >> xx.xx.97.96 Up Normal 10.68 GB 16.67% >> 113427455640312821154458202477256070484 >> xx.xxx.17.122 Up Normal 10.18 GB 16.67% >> 141784319550391026443072753096570088105 >> >> -- >> Ray Slakinski >> >> > > --=20 Sasha Dolgy sasha.dolgy@gmail.com