Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7042F4BB7 for ; Sat, 9 Jul 2011 05:38:20 +0000 (UTC) Received: (qmail 28970 invoked by uid 500); 9 Jul 2011 05:38:16 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 28697 invoked by uid 500); 9 Jul 2011 05:38:04 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 28686 invoked by uid 99); 9 Jul 2011 05:38:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 09 Jul 2011 05:38:00 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of izquierdo@strands.com designates 217.116.18.226 as permitted sender) Received: from [217.116.18.226] (HELO mail.strands.com) (217.116.18.226) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 09 Jul 2011 05:37:53 +0000 Received: from localhost (localhost [127.0.0.1]) by mail.strands.com (Postfix) with ESMTP id 5E0C63047D2 for ; Sat, 9 Jul 2011 07:37:32 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at strands.com X-Spam-Score: -9.5 X-Spam-Level: Received: from mail.strands.com ([127.0.0.1]) by localhost (mail.strands.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id jQztV3O56Jl6 for ; Sat, 9 Jul 2011 07:37:27 +0200 (CEST) Received: from [192.168.1.141] (41.67.217.87.dynamic.jazztel.es [87.217.67.41]) (using SSLv3 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) (Authenticated sender: izquierdo) by mail.strands.com (Postfix) with ESMTPSA id 5EA7C3047CB for ; Sat, 9 Jul 2011 07:37:27 +0200 (CEST) Subject: Re: Corrupted data From: =?ISO-8859-1?Q?H=E9ctor?= Izquierdo Seliva To: user@cassandra.apache.org In-Reply-To: <137920FE-1CF4-42E9-950E-6B7544B0662D@thelastpickle.com> References: <1310143134.5666.1.camel@Avalon> <137920FE-1CF4-42E9-950E-6B7544B0662D@thelastpickle.com> Content-Type: text/plain; charset="UTF-8" Date: Sat, 09 Jul 2011 07:37:25 +0200 Message-ID: <1310189845.1935.1.camel@Avalon> Mime-Version: 1.0 X-Mailer: Evolution 2.32.2 Content-Transfer-Encoding: 8bit X-Old-Spam-Flag: NO X-Old-Spam-Status: No, score=-9.5 required=5.1 tests=[ALL_TRUSTED=-5.5, BAYES_00=-4] autolearn=ham Hi Aaron, El vie, 08-07-2011 a las 14:47 -0700, aaron morton escribió: > You may not lose data. > > - What version and whats the upgrade history? all versions from 0.7.1 to 0.8.1. All cfs were in 0.8.1 format though > - What RF / node count / CL ? RF=3, node count = 6 > - Have you been running repair consistently ? Nop, only when something breaks > - Is this on a single node or all nodes ? A couple of nodes. Scrub told there were a few thousand of columns it could not restore. > > Cheers > > ----------------- > Aaron Morton > Freelance Cassandra Developer > @aaronmorton > http://www.thelastpickle.com > > On 8 Jul 2011, at 09:38, Héctor Izquierdo Seliva wrote: > > > Hi everyone, > > > > I'm having thousands of these errors: > > > > WARN [CompactionExecutor:1] 2011-07-08 16:36:45,705 > > CompactionManager.java (line 737) Non-fatal error reading row > > (stacktrace follows) > > java.io.IOError: java.io.IOException: Impossible row size > > 6292724931198053 > > at > > org.apache.cassandra.db.compaction.CompactionManager.scrubOne(CompactionManager.java:719) > > at > > org.apache.cassandra.db.compaction.CompactionManager.doScrub(CompactionManager.java:633) > > at org.apache.cassandra.db.compaction.CompactionManager.access > > $600(CompactionManager.java:65) > > at org.apache.cassandra.db.compaction.CompactionManager > > $3.call(CompactionManager.java:250) > > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > > at java.util.concurrent.ThreadPoolExecutor > > $Worker.runTask(ThreadPoolExecutor.java:886) > > at java.util.concurrent.ThreadPoolExecutor > > $Worker.run(ThreadPoolExecutor.java:908) > > at java.lang.Thread.run(Thread.java:662) > > Caused by: java.io.IOException: Impossible row size 6292724931198053 > > ... 9 more > > INFO [CompactionExecutor:1] 2011-07-08 16:36:45,705 > > CompactionManager.java (line 743) Retrying from row index; data is -8 > > bytes starting at 4735525245 > > WARN [CompactionExecutor:1] 2011-07-08 16:36:45,705 > > CompactionManager.java (line 767) Retry failed too. Skipping to next > > row (retry's stacktrace follows) > > java.io.IOError: java.io.EOFException: bloom filter claims to be > > 863794556 bytes, longer than entire row size -8 > > > > > > THis is during scrub, as I saw similar errors while in normal operation. > > Is there anything I can do? It looks like I'm going to lose a ton of > > data > > >