Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 58B1CC723 for ; Fri, 15 Mar 2013 01:20:45 +0000 (UTC) Received: (qmail 19895 invoked by uid 500); 15 Mar 2013 01:20:42 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 19865 invoked by uid 500); 15 Mar 2013 01:20:42 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 19856 invoked by uid 99); 15 Mar 2013 01:20:42 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Mar 2013 01:20:42 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dane@optimalsocial.com designates 209.85.223.178 as permitted sender) Received: from [209.85.223.178] (HELO mail-ie0-f178.google.com) (209.85.223.178) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Mar 2013 01:20:36 +0000 Received: by mail-ie0-f178.google.com with SMTP id c13so3751879ieb.37 for ; Thu, 14 Mar 2013 18:20:15 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type:x-gm-message-state; bh=IYxUnxEfz4F2n9MA3jXKnKObmA7F+pd2584wBBTLhus=; b=Ot1zel+eU2OsFYOq3SIu69ZhPR1d3TUepPvoh+gS9lY2l40vcTLMYvIdHofVjQ+xXS 2UgiNi5pnSL0RobzMqWDjCH4Fn4wM57s38/owPuXcsRPAQobs34D8M5qOphMzR5sVe/Y ZXv1qMYbyRMx/vT8dEmbk69q5v0GjwDn/jLmdlW14NsC1LuKyi5+zWJDRxUKgJfZU6EW M57sTInc0Zlc9xkIfnBuIAmkOqmUR9asPAJJYdeWhkLb7CmxWn4ip36cqVCcLXSxCQRf LoKJDI0qfWtBvncg7eRr6ywmeS6y1I9R68Nu1lAbnu6pCW9ZfAA2dWpvrRGzlX10hbpl WHtQ== MIME-Version: 1.0 X-Received: by 10.50.183.233 with SMTP id ep9mr54508igc.87.1363310414910; Thu, 14 Mar 2013 18:20:14 -0700 (PDT) Received: by 10.64.128.170 with HTTP; Thu, 14 Mar 2013 18:20:14 -0700 (PDT) In-Reply-To: <313072CD-68FF-4AE9-B107-42D33ACD1B40@thelastpickle.com> References: <1363203547.24795.GenericBBA@web160905.mail.bf1.yahoo.com> <313072CD-68FF-4AE9-B107-42D33ACD1B40@thelastpickle.com> Date: Thu, 14 Mar 2013 18:20:14 -0700 Message-ID: Subject: Re: repair hangs From: Dane Miller To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQmXYs+gVyH6JzUw0xltAwwzPP4dVu2PBiUvwvRqoWXLPHSP6rfxzpjPinb7w3byEIdtE/1K X-Virus-Checked: Checked by ClamAV on apache.org On Thu, Mar 14, 2013 at 6:34 AM, aaron morton wrote: >> 1. is this a nodetool bug? is there any way to propagate the >> java.io.IOException back to nodetool? > The repair continues to work even if nodetool fails, it's a server side thing. > >> 2. network problems on EC2, I'm shocked! are there recommended >> network settings for EC2? > Streaming does not put a timeout on the socket, in this case check the 10.82.233.59 > node to see why the pipe broke. Good catch. There are indeed errors on node 10.82.233.59: cassandra3:/var/log/cassandra# grep -A1 ERROR system.log ERROR [Thread-5571] 2013-03-11 23:38:12,554 CassandraDaemon.java (line 132) Exception in thread Thread[Thread-5571,5,main] java.lang.UnsupportedOperationException: SSTable /raid0/cassandra/data/OpsCenter/events_timeline/OpsCenter-events_timeline-hf-1-Data.db is not compatible with current version ib -- ERROR [Thread-5572] 2013-03-11 23:38:12,692 CassandraDaemon.java (line 132) Exception in thread Thread[Thread-5572,5,main] java.lang.UnsupportedOperationException: SSTable /raid0/cassandra/data/OpsCenter/rollups60/OpsCenter-rollups60-hf-766-Data.db is not compatible with current version ib -- ERROR [Thread-5612] 2013-03-11 23:39:55,931 CassandraDaemon.java (line 132) Exception in thread Thread[Thread-5612,5,main] java.lang.UnsupportedOperationException: SSTable /raid0/cassandra/data/OpsCenter/rollups300/OpsCenter-rollups300-hf-506-Data.db is not compatible with current version ib -- This attempt to create a new cluster from backups of the original cluster has not been going well. These errors lead me to believe I need to focus on repairing the original cluster first. Dane