Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 17346 invoked from network); 1 Apr 2010 03:23:01 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 1 Apr 2010 03:23:01 -0000 Received: (qmail 15339 invoked by uid 500); 1 Apr 2010 03:23:00 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 15261 invoked by uid 500); 1 Apr 2010 03:23:00 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 15249 invoked by uid 99); 1 Apr 2010 03:22:59 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Apr 2010 03:22:59 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of beuknight@gmail.com designates 74.125.82.44 as permitted sender) Received: from [74.125.82.44] (HELO mail-ww0-f44.google.com) (74.125.82.44) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Apr 2010 03:22:53 +0000 Received: by wwb31 with SMTP id 31so505871wwb.31 for ; Wed, 31 Mar 2010 20:22:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:received:message-id:subject:from:to:content-type; bh=qpX12W09qCrMJnoR+nScTkQ02qqxF+wY05d4jETiggA=; b=NZ2OuFZQ0PtXe92Vm+Rcpt33bKExUx8FHn0SoC7VFGo5cGwQqHWKKLFda+Fzk+Sc38 fUigOjSurxd9hTSHUFaj66X8hauB0JV0YbIJAcTGpNyVp/uX7XuMIOZ42BXKo3tIUq2H RaWNdQo9jsCuIhEtYGSElzx+lTTrxiuu+ErRU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=np81wpDgBjAzrQid5s5a201Nb+wO97lpfZWLW6r3R8mc+UIAFoebSNibP/qg4bcnLT t1A+qG0GO2ZHeUChREuANf0K0CABFGgxq+N5GXVT6pEt6wF6lrMzI7UXlsBOBRji8cj3 iLiwUNcBzXm0wdjlneQI2B3qhrxzqjqOy4Ue0= MIME-Version: 1.0 Received: by 10.216.50.207 with HTTP; Wed, 31 Mar 2010 20:22:32 -0700 (PDT) In-Reply-To: References: <1270043746.660617146@192.168.2.228> Date: Wed, 31 Mar 2010 23:22:32 -0400 Received: by 10.216.89.211 with SMTP id c61mr116400wef.54.1270092152481; Wed, 31 Mar 2010 20:22:32 -0700 (PDT) Message-ID: Subject: Re: Cassandra data file corrupt From: JKnight JKnight To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0016e6d58ae594a6b20483245f84 X-Virus-Checked: Checked by ClamAV on apache.org --0016e6d58ae594a6b20483245f84 Content-Type: text/plain; charset=ISO-8859-1 Dear Jeremy Dunck, I tried to compact, and get and error: Caused by: java.io.UTFDataFormatException: malformed input around byte 13 at java.io.DataInputStream.readUTF(DataInputStream.java:617) at java.io.RandomAccessFile.readUTF(RandomAccessFile.java:887) at org.apache.cassandra.io.IteratingRow.(IteratingRow.java:48) at org.apache.cassandra.io.SSTableScanner$KeyScanningIterator.next(SSTableScanner.java:115) I also tried in version 0.6, but above error still exist. Perhaps, I will tried the way David Timothy suggest. @Stu Hood: Do you implement code for 808 issues? Thank a lot for support. On Wed, Mar 31, 2010 at 11:02 AM, Jeremy Dunck wrote: > On Wed, Mar 31, 2010 at 7:55 AM, Stu Hood wrote: > > Eventually the new file format will make it in with #674, and we'll be > able to implement an option to skip corrupted data: > > > > https://issues.apache.org/jira/browse/CASSANDRA-808 > > That ticket seems to indicate that compaction will remove the corrupt > data, and might make the later data readable. > > JKnight, did you try compacting? > -- Best regards, JKnight --0016e6d58ae594a6b20483245f84 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Dear Jeremy Dunck,

I tried to compact, and get and error:

Ca= used by: java.io.UTFDataFormatException: malformed input around byte 13
= =A0=A0=A0 at java.io.DataInputStream.readUTF(DataInputStream.java:617)
= =A0=A0=A0 at java.io.RandomAccessFile.readUTF(RandomAccessFile.java:887) =A0=A0=A0 at org.apache.cassandra.io.IteratingRow.<init>(IteratingRow= .java:48)
=A0=A0=A0 at org.apache.cassandra.io.SSTableScanner$KeyScannin= gIterator.next(SSTableScanner.java:115)

I also tried in version 0.6,= but above error still exist.

Perhaps, I will tried the way David Timothy suggest.

@Stu Hood: = Do you implement code for 808 issues?

Thank a lot for support.
On Wed, Mar 31, 2010 at 11:02 AM, Jeremy Dunck= <jdunck@gmail.com= > wrote:
On Wed, Mar 31, 2010 at 7:55 AM, Stu Hood <stu.hood@rackspace.com> wrote:
> Eventually the new file format will make it in with #674, and we'l= l be able to implement an option to skip corrupted data:
>
> https://issues.apache.org/jira/browse/CASSANDRA-808

That ticket seems to indicate that compaction will remove the corrupt=
data, and might make the later data readable.

JKnight, did you try compacting?



--
Best regards,
JKnigh= t
--0016e6d58ae594a6b20483245f84--