Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9319D10EC7 for ; Thu, 22 Aug 2013 15:19:38 +0000 (UTC) Received: (qmail 99977 invoked by uid 500); 22 Aug 2013 15:19:36 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 99920 invoked by uid 500); 22 Aug 2013 15:19:35 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 99911 invoked by uid 99); 22 Aug 2013 15:19:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Aug 2013 15:19:35 +0000 X-ASF-Spam-Status: No, hits=2.6 required=5.0 tests=HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS,TRACKER_ID,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mvallebr@gmail.com designates 209.85.219.41 as permitted sender) Received: from [209.85.219.41] (HELO mail-oa0-f41.google.com) (209.85.219.41) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Aug 2013 15:19:30 +0000 Received: by mail-oa0-f41.google.com with SMTP id j6so3778865oag.0 for ; Thu, 22 Aug 2013 08:19:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=u+/z7MdZqByC5L2Cfi73rmiZVQlxu/kq/ahIGlCZMWw=; b=tM6TObzF4Favs4Z/Ocg8qg1qp0tZklurguZZP+JnsMYvZ7w5NF6bncPfGes5DdD17v jA4/qN0HssuWa6H7FILOXN/93laNIeN95FKqUoWLdCAbP50NNb5LlrQ09RsFd57HqqqQ jg23rX2/mWXuGoLHX2+GgkQhERkHz3CzgxgwUhORdB5OaM0GLW5Sf4RiYy9oiz2T2/vd ogTScTGhVTFQkc8WJmOJc9KKk4mWWEGMGw7Wt7/ucwzuGQZJWfFYMvfwlF94R3SljDFE PzMxGqTK7l4oFh07LIcjnUp5xVib0cSB8FS9kaUw8MJ9brgiO46nvJKdn6naOYMdaxsS QO2A== MIME-Version: 1.0 X-Received: by 10.182.106.114 with SMTP id gt18mr9274225obb.77.1377184750209; Thu, 22 Aug 2013 08:19:10 -0700 (PDT) Received: by 10.60.62.40 with HTTP; Thu, 22 Aug 2013 08:19:10 -0700 (PDT) Date: Thu, 22 Aug 2013 12:19:10 -0300 Message-ID: Subject: node dead after restart From: Marcelo Elias Del Valle To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=e89a8fb1fe36d42f3c04e48acfa8 X-Virus-Checked: Checked by ClamAV on apache.org --e89a8fb1fe36d42f3c04e48acfa8 Content-Type: text/plain; charset=ISO-8859-1 Hello, I am having a problem with a node in a test environment I have at amazon. I am using cassandra 1.2.3 in Amazon EC2. Here is my nodetool ring output: $ nodetool ring Note: Ownership information does not include topology; for complete information, specify a keyspace Datacenter: us-east ========== Address Rack Status State Load Owns Token 113427455640312821154458202479064646084 10.0.0.76 1b Up Normal 31.34 MB 33.33% 1808575600 10.0.0.146 1b Up Normal 34.24 MB 33.33% 56713727820156410577229101240436610842 10.0.0.111 1b Down Normal 21.19 MB 33.33% 113427455640312821154458202479064646084 I logged in 10.0.0.111 machine and restarted cassandra, while looking at the log. Gossip protocol is still up, but the node starts and goes down just after it. Here is what I see in the logs: sudo tail /var/log/cassandra/output.log INFO 12:16:23,084 Node /10.0.0.111 has restarted, now UP INFO 12:16:23,095 InetAddress /10.0.0.111 is now UP INFO 12:16:23,097 Node /10.0.0.111 state jump to normal INFO 12:16:23,105 Not starting native transport as requested. Use JMX (StorageService->startNativeTransport()) to start it INFO 12:16:23,108 Binding thrift service to ip-10-0-0-146.ec2.internal/ 10.0.0.146:9160 INFO 12:16:23,137 Using TFramedTransport with a max frame size of 15728640 bytes. INFO 12:16:23,143 Using synchronous/threadpool thrift server on ip-10-0-0-146.ec2.internal : 9160 INFO 12:16:23,143 Listening for thrift clients... INFO 12:16:30,063 Saved local counter id: 76c1a930-a866-11e2-a3bd-831b111cd74c INFO 12:16:32,860 InetAddress /10.0.0.111 is now dead. I am having no clue of what is wrong. Any hint of what could I do to look for the problem? Best regards, -- Marcelo Elias Del Valle http://mvalle.com - @mvallebr --e89a8fb1fe36d42f3c04e48acfa8 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hello,=A0

=A0 =A0 I am having a problem= with a node in a test environment I have at amazon. I am using cassandra 1= .2.3 in Amazon EC2. Here is my nodetool ring output:

$ nodetool ring
Note: Ownership information does not include topo= logy; for complete information, specify a keyspace

Datacenter: us-east
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
Add= ress =A0 =A0 =A0 =A0 Rack =A0 =A0 =A0 =A0Status State =A0 Load =A0 =A0 =A0 = =A0 =A0 =A0Owns =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0Token =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=A0
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 =A0 =A0 =A0113427455640312821154458202479064646084 =A0 =A0=A0
10.0.0.76 =A0 1b =A0 =A0 =A0 =A0 =A0Up =A0 =A0 Normal =A031.34 MB =A0 =A0= =A0 =A033.33% =A0 =A0 =A0 =A0 =A0 =A0 =A01808575600 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0
10.0.0.146 =A0 =A01b =A0 =A0 =A0 =A0 =A0Up =A0 =A0 Normal =A034.24 MB = =A0 =A0 =A0 =A033.33% =A0 =A0 =A0 =A0 =A0 =A0 =A056713727820156410577229101= 240436610842 =A0 =A0 =A0
10.0.0.111 =A0 1b =A0 =A0 =A0 =A0 =A0Dow= n =A0 Normal =A021.19 MB =A0 =A0 =A0 =A033.33% =A0 =A0 =A0 =A0 =A0 =A0 =A01= 13427455640312821154458202479064646084 =A0 =A0=A0

=A0 =A0 =A0I logged in 10.0.0.111 machine and res= tarted cassandra, while looking at the log. Gossip protocol is still up, bu= t the node starts and goes down just after it. Here is what I see in the lo= gs:

sudo tail /var/log/cassandra/output.log=A0
=A0INFO 12:16:23,084 Node /10.0.0.111 = has restarted, now UP
=A0INFO 12:16:23,095 InetAddress /10.0.0.111 is now UP
=A0INFO 12:16:23,097 Node /10.0.0.111 state jump to normal
=A0INF= O 12:16:23,137 Using TFramedTransport with a max frame size of 15728640 byt= es.
=A0INFO 12:16:23,143 Using synchronous/threadpool thrift server on ip-= 10-0-0-146.ec2.internal : 9160
=A0INFO 12:16:23,143 Listening for= thrift clients...
=A0INFO 12:16:30,063 Saved local counter id: 7= 6c1a930-a866-11e2-a3bd-831b111cd74c
=A0INFO 12:16:32,860 InetAddress /10.0.0= .111 is now dead.

=A0 =A0 =A0I am having= no clue of what is wrong. Any hint of what could I do to look for the prob= lem?

Best regards,
--
Marcelo Elias Del Valle
http://mvalle.com=A0- @mval= lebr
--e89a8fb1fe36d42f3c04e48acfa8--