Return-Path: Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: (qmail 42638 invoked from network); 16 May 2010 05:19:05 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 16 May 2010 05:19:05 -0000 Received: (qmail 1322 invoked by uid 500); 16 May 2010 05:19:05 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 1230 invoked by uid 500); 16 May 2010 05:19:05 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 1222 invoked by uid 99); 16 May 2010 05:19:04 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 16 May 2010 05:19:04 +0000 X-ASF-Spam-Status: No, hits=-1431.8 required=10.0 tests=ALL_TRUSTED,AWL X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 16 May 2010 05:19:03 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o4G5IgHR019600 for ; Sun, 16 May 2010 05:18:43 GMT Message-ID: <11916893.69871273987122889.JavaMail.jira@thor> Date: Sun, 16 May 2010 01:18:42 -0400 (EDT) From: "Toby Jungen (JIRA)" To: commits@cassandra.apache.org Subject: [jira] Commented: (CASSANDRA-1093) BinaryMemtable interface silently dropping data. In-Reply-To: <24966965.49781273868082498.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-1093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12867953#action_12867953 ] Toby Jungen commented on CASSANDRA-1093: ---------------------------------------- Yes, I'm flushing each node after the import. I've also tried flushing the system keyspace (no effect). As noted in the readme, I would not be surprised if this problem is unique to my hardware/software configuration and isn't an inherent problem with Cassandra's BMT interface. For what it's worth, I've hacked together a "workaround" for this problem by writing SSTables directly (using o.a.c.io.SSTableWriter), copying the generated files to appropriate directories on the nodes, and then restarting the nodes. This solution is bound to result in other bugs, but for now I've verified that there is no lost data with this method. > BinaryMemtable interface silently dropping data. > ------------------------------------------------ > > Key: CASSANDRA-1093 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1093 > Project: Cassandra > Issue Type: Bug > Components: Core > Environment: Linux Centos5, Fedora Core 4. Java HotSpot Server 1.6.0_14. See readme for more details. > Reporter: Toby Jungen > Fix For: 0.6.2 > > Attachments: cassandra_bmt_test.tar.gz > > > I've been attempting to use the Binary Memtable (BMT) interface to load a large number of rows. During my testing, I discovered that on larger loads (~1 million rows), occasionally some of the data never appears in the database. This happens in a non-deterministic manner, as sometimes all the data loads fine, and other times a significant chunk goes missing. No errors are ever logged to indicate a problem. I'm attaching some sample code that approximates my application's usage of Cassandra and explains this bug in more detail. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.