Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CCE277EF3 for ; Sat, 8 Oct 2011 21:30:43 +0000 (UTC) Received: (qmail 56356 invoked by uid 500); 8 Oct 2011 21:30:41 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 56329 invoked by uid 500); 8 Oct 2011 21:30:41 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 56321 invoked by uid 99); 8 Oct 2011 21:30:41 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 08 Oct 2011 21:30:41 +0000 X-ASF-Spam-Status: No, hits=1.4 required=5.0 tests=SPF_PASS,TO_NO_BRKTS_DIRECT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [217.115.141.165] (HELO mail2.tricnet.de) (217.115.141.165) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 08 Oct 2011 21:30:35 +0000 X-Received: from [192.168.178.22] (91-65-198-185-dynip.superkabel.de [91.65.198.185]) by mail2.tricnet.de (Postfix) with ESMTP id 8E5DB1BF328 for ; Sat, 8 Oct 2011 23:30:13 +0200 (CEST) Message-ID: <4E90C0E5.9050002@tricnet.de> Date: Sat, 08 Oct 2011 23:30:13 +0200 From: Thomas Richter User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.21) Gecko/20110831 Lightning/1.0b2 Thunderbird/3.1.13 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Existing column(s) not readable X-Enigmail-Version: 1.1.1 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Hi, we are running a 3 node cassandra (0.7.6-2) cluster and some of our column families contain quite large rows (400k+ columns, 4-6GB row size). Replicaton factor is 3 for all keyspaces. The cluster is running fine for several months now and we never experienced any serious trouble. Some days ago we noticed, that some previously written columns could not be read. This does not always happen, and only some dozen columns out of 400k are affected. After ruling out application logic as a cause I dumped the row in question with sstable2json and the columns are there (and are not marked for deletion). Next thing was setting up a fresh single node cluster and copying the column family data to that node. Columns could not be read either. Right now I'm running a nodetool compact for the cf to see if data could be read afterwards. Is there any explanation for such behavior? Are there any suggestions for further investigation? TIA, Thomas