Return-Path: Delivered-To: apmail-incubator-cassandra-commits-archive@minotaur.apache.org Received: (qmail 97556 invoked from network); 16 Mar 2009 22:00:11 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 Mar 2009 22:00:11 -0000 Received: (qmail 8611 invoked by uid 500); 16 Mar 2009 22:00:11 -0000 Delivered-To: apmail-incubator-cassandra-commits-archive@incubator.apache.org Received: (qmail 8589 invoked by uid 500); 16 Mar 2009 22:00:11 -0000 Mailing-List: contact cassandra-commits-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-dev@incubator.apache.org Delivered-To: mailing list cassandra-commits@incubator.apache.org Received: (qmail 8578 invoked by uid 99); 16 Mar 2009 22:00:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Mar 2009 15:00:11 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Mar 2009 22:00:10 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 92A67234C046 for ; Mon, 16 Mar 2009 14:59:50 -0700 (PDT) Message-ID: <2029868243.1237240790599.JavaMail.jira@brutus> Date: Mon, 16 Mar 2009 14:59:50 -0700 (PDT) From: "Neophytos Demetriou (JIRA)" To: cassandra-commits@incubator.apache.org Subject: [jira] Commented: (CASSANDRA-7) Cassandra silently loses data when a single row gets large In-Reply-To: <1248463770.1237232510541.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/CASSANDRA-7?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12682482#action_12682482 ] Neophytos Demetriou commented on CASSANDRA-7: --------------------------------------------- (a) It happens when you insert a large number of columns in a single row (b) Cassandra silently loses some of these inserts (batch inserts are also inserts). (c) This DOES happen when the threshold is violated (the cumulative size is only one of the reasons for the threshold to be violated) (d) It is also while flushing the memtable to disk. Yes, I can open a new ticket but it seemed relevant to this issue. > Cassandra silently loses data when a single row gets large > ---------------------------------------------------------- > > Key: CASSANDRA-7 > URL: https://issues.apache.org/jira/browse/CASSANDRA-7 > Project: Cassandra > Issue Type: Bug > Environment: code in trunk, Red Hat 4.1.2-33, Linux version 2.6.23.1-42.fc8, java version "1.7.0-nio2" > Reporter: Sandeep Tata > Priority: Critical > Attachments: BigReadWriteTest.java, dirty_bit_patch.txt, dirty_bit_patch_v2.txt > > > When you insert a large number of columns in a single row, Cassandra silently loses some of these inserts. > This does not happen until the cumulative size of the columns in a single row exceeds several megabytes. > Say each value is 1MB large, > insert("row", "col0", value, timestamp) > insert("row", "col1", value, timestamp) > insert("row", "col2", value, timestamp) > ... > ... > insert("row", "col100", value, timestamp) > Running: > get_column("row", "col0") > get_column("row", "col1") > ... > .. > get_column("row", "col100") > The sequence of get_columns will fail at some point before 100. This was a problem with the old code in code.google also. > I will attach a small program that will help you reproduce this. > 1. This only happens when the cumulative size of the row exceeds several megabytes. > 2. In fact, the single row should be large enough to trigger an SSTable flush to trigger this error. > 3. No OutOfMemory errors are thrown, there is nothing relevant in the logs. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.