Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id EACA6200C14 for ; Tue, 7 Feb 2017 18:17:15 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id E955A160B3E; Tue, 7 Feb 2017 17:17:15 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 1AB0C160B32 for ; Tue, 7 Feb 2017 18:17:14 +0100 (CET) Received: (qmail 44973 invoked by uid 500); 7 Feb 2017 17:17:13 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 44963 invoked by uid 99); 7 Feb 2017 17:17:13 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Feb 2017 17:17:13 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 25A3D1A01CE for ; Tue, 7 Feb 2017 17:17:13 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.379 X-Spam-Level: ** X-Spam-Status: No, score=2.379 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id BufqDLVc_QcE for ; Tue, 7 Feb 2017 17:17:11 +0000 (UTC) Received: from mail-wm0-f51.google.com (mail-wm0-f51.google.com [74.125.82.51]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 436A65F283 for ; Tue, 7 Feb 2017 17:17:11 +0000 (UTC) Received: by mail-wm0-f51.google.com with SMTP id v186so20280510wmd.0 for ; Tue, 07 Feb 2017 09:17:11 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=Ash4vPBRCylOj2oYq91sk3/i09ud7oXysZxTgemsJOI=; b=hSg2VmbFaP9uSSuc1nMcPr/h05BNxUORuiL5gOAQwVEl+098EeVFfKEVoWyvQpzHz2 0uMX+Eb2sLXpF1mUdmBoSWZ47A5dxovzhYVaiH4UrmyCb6PM6d/9qae9ey8YuaNDyFEc HQcYoIsuqGMCoTgvE3XxNfPOcl6oed2096BK3NDy4dhGESo0OO5AxeIHhXsJg3xEbRH/ AANN0cNBuztyv+SJcxeQjc6MwVSotlIONfwHkNMuG7WgrIgyBcINbZXeWZjkNxKBrm9/ Jlbql2rziJrfqSgFMSDtqqyM5TVHGQ37Z1Hd/BcwAVRCQIB651ktVE4meDCZ20M9TKkw 3v2w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=Ash4vPBRCylOj2oYq91sk3/i09ud7oXysZxTgemsJOI=; b=X4Dg+lJqXpB+OihWJMptksQ4F5TxFcepKeSM0t/n7zJZIRWPZyMmKHBEksOz6kx1pn D1M6dHTRQkkiVKr0nag8/TRruQfaIm6IRBHtucSW2rVoEfsIx9xNOW2t8ezvkUcGdxQ/ s/4XkusXOM3fz+OS7FXQezoSlauudo25j3h+QhLEHfmXBJzhWCBlz8lJwDBZjxo0yGMd oA9faQJT7xCpRlDc0lKSKN7uzclz3fxZsR9ZoSsccD7RT6uIkDfcPNbLAmqx0pswrQcm JFw1fDAuJoQPI3dEuPAdn9+L0IzuEyyhX4M94UhYA5HQiddyz+zD46BnAT9Q7JHCgXmw zciA== X-Gm-Message-State: AMke39lh7wgk7cuPLjLnnrXfqICwADj2vl0wGA4Lj5Ba4jvqOUfPLqPBwtB9x6b1Y3MdMvufmykYQ84ncN1hwA== X-Received: by 10.223.130.111 with SMTP id 102mr17648600wrb.59.1486487828291; Tue, 07 Feb 2017 09:17:08 -0800 (PST) MIME-Version: 1.0 Received: by 10.80.176.5 with HTTP; Tue, 7 Feb 2017 09:16:47 -0800 (PST) In-Reply-To: References: From: Simone Franzini Date: Tue, 7 Feb 2017 11:16:47 -0600 Message-ID: Subject: Re: UnknownColumnFamilyException after removing all Cassandra data To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=001a114b2f0af7e9730547f3eaa6 archived-at: Tue, 07 Feb 2017 17:17:16 -0000 --001a114b2f0af7e9730547f3eaa6 Content-Type: text/plain; charset=UTF-8 To further add to my previous answer, the node in question is a seed node, so it did not bootstrap. Should I remove it from the list of seed nodes and then try to restart it? Simone Franzini, PhD http://www.linkedin.com/in/simonefranzini On Tue, Feb 7, 2017 at 9:43 AM, Simone Franzini wrote: > This is exactly what I did on the second node. If this is not the correct > / best procedure to adopt in these cases, please advise: > > 1. Removed all the data, including the system table (rm -rf data/ > commitlog/ saved_caches). > 2. Configured the node to replace itself, by adding the following line to > cassandra-env.sh: JVM_OPTS="$JVM_OPTS -Dcassandra.replace_address= own IP address>" > 3. Start the node. > > Noticeably, I did not do nodetool decommission or removenode. Is that the > recommended approach? > > Given what I did, I am mystified as to what the problem is. If I query the > system.schema_columnfamilies on the affected node, all CF IDs are there. > Same goes for the only other node that is currently up. Also, the other > node that is currently up has data for all those CF IDs in the data folder. > > > Simone Franzini, PhD > > http://www.linkedin.com/in/simonefranzini > > On Tue, Feb 7, 2017 at 5:39 AM, kurt greaves wrote: > >> The node is trying to communicate with another node, potentially >> streaming data, and is receiving files/data for an "unknown column family". >> That is, it doesn't know about the CF with the id >> e36415b6-95a7-368c-9ac0-ae0ac774863d. >> If you deleted some columnfamilies but not all the system keyspace and >> restarted the node I'd expect this error to occur. Or I suppose if you >> didn't decommission the node properly before blowing the data away and >> restarting. >> >> You'll have to give us more information on what your exact steps were on >> this 2nd node: >> >> When you say deleted all Cassandra data, did this include the system >> tables? Were your steps to delete all the data and then just restart the >> node? Did you remove the node from the cluster prior to deleting the data >> and restarting it (nodetool decommission/removenode? Did the node rejoin >> the cluster or did it have to bootstrap? >> >> >> > --001a114b2f0af7e9730547f3eaa6 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
To further add to my previous answer, the node in question= is a seed node, so it did not bootstrap.=C2=A0
Should I remove it from= the list of seed nodes and then try to restart it?


On Tue, Feb 7, 2017 at 9:43 AM, Simone Franz= ini <captainfranz@gmail.com> wrote:
This is exactly what I did on the second n= ode. If this is not the correct / best procedure to adopt in these cases, p= lease advise:

1. Remove= d all the data, including the system table (rm -rf data/ commitlog/ saved_c= aches).
2. Configured the= node to replace itself, by adding the following line to cassandra-env.sh: = JVM_OPTS=3D"$JVM_OPTS -Dcassandra.replace_address=3D<node own = IP address>"
3. S= tart the node.

Noticeably, I did not do nodet= ool decommission or removenode. Is that the recommended approach?

Given what I did, I am mystified as to what the probl= em is. If I query the system.schema_columnfamilies on the affected node, al= l CF IDs are there. Same goes for the only other node that is currently up.= Also, the other node that is currently up has data for all those CF IDs in= the data folder.



On Tue, Feb 7,= 2017 at 5:39 AM, kurt greaves <kurt@instaclustr.com> wro= te:
The node is try= ing to communicate with another node, potentially streaming data, and is re= ceiving files/data for an "unknown column family". That is, it do= esn't know about the CF with the id e36415b6-95a7-368c-9ac0-ae0ac7= 74863d.
If you deleted some columnfamilies but not all the sy= stem keyspace and restarted the node I'd expect this error to occur. Or= I suppose if you didn't decommission the node properly before blowing = the data away and restarting.

You'll have to give us more infor= mation on what your exact steps were on this 2nd node:

When yo= u say deleted all Cassandra data, did this include the system tables? Were = your steps to delete all the data and then just restart the node? Did you r= emove the node from the cluster prior to deleting the data and restarting i= t (nodetool decommission/removenode? Did the node rejoin the cluster or did= it have to bootstrap?




--001a114b2f0af7e9730547f3eaa6--