Return-Path: Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: (qmail 19095 invoked from network); 12 Feb 2011 13:52:23 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 12 Feb 2011 13:52:23 -0000 Received: (qmail 51799 invoked by uid 500); 12 Feb 2011 13:52:23 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 51222 invoked by uid 500); 12 Feb 2011 13:52:20 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 51214 invoked by uid 99); 12 Feb 2011 13:52:19 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 12 Feb 2011 13:52:19 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 12 Feb 2011 13:52:17 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id B15081A1BDB for ; Sat, 12 Feb 2011 13:51:57 +0000 (UTC) Date: Sat, 12 Feb 2011 13:51:57 +0000 (UTC) From: "Gary Dusbabek (JIRA)" To: commits@cassandra.apache.org Message-ID: <121469761.12826.1297518717722.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] Issue Comment Edited: (CASSANDRA-1472) Add bitmap secondary indexes MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983248#comment-12983248 ] Gary Dusbabek edited comment on CASSANDRA-1472 at 2/12/11 1:50 PM: ------------------------------------------------------------------- bq. would it be reasonable to commit this patch with Avro's file format, and then to discuss replacing the file format in a separate issue Not a good idea, imo. bq. writing our own file format is an optimization I don't think it is an optimization at all. It's a way of protecting us from ourselves. The only way to do that and still use avro is to create our own readers according to the spec, no? bq. This decision needs to be made for technical reasons and not grounded in NIH. The reasons are technical. The fact is that the way we use avro is constantly breaking on us (I'm referring to migrations). Some of this is probably self-inflicted. We don't have much experience yet with what happens when records change slightly or radically over time and how avro copes with that. The experience we do have leads me to believe that unless we are extremely cautious, we'll end up in a hole. See CASSANDRA-2001 for an example. Two seemly innocuous changes have both managed to break migration deserialization. was (Author: gdusbabek): bq. would it be reasonable to commit this patch with Avro's file format, and then to discuss replacing the file format in a separate issue Not a good idea, imo. bq. writing our own file format is an optimization I don't think it is an optimization at all. It's a way of protecting us from ourselves. The only way to do that and still use avro is to create our own readers according to the spec, no? bq. This decision needs to be made for technical reasons and not grounded in NIH. Poppycock. The reasons are technical. The fact is that the way we use avro is constantly breaking on us (I'm referring to migrations). Some of this is probably self-inflicted, but either way--we should avoid chopping down the forest until we can operate the chainsaw without cutting off our fingers. We don't have much experience yet with what happens when records change slightly or radically over time and how avro copes with that. The experience we do have leads me to believe that unless we are extremely cautious, we'll end up in a hole. See CASSANDRA-2001 for an example. Two seemly innocuous changes have both managed to break migration deserialization. > Add bitmap secondary indexes > ---------------------------- > > Key: CASSANDRA-1472 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1472 > Project: Cassandra > Issue Type: Improvement > Components: Core > Reporter: Stu Hood > Assignee: Stu Hood > Fix For: 0.7.2 > > Attachments: 0.7-1472-v5.tgz, 0.7-1472-v6.tgz, 0001-CASSANDRA-1472-rebased-to-0.7-branch.txt, 0019-Rename-bugfixes-and-fileclose.txt, 1472-v3.tgz, 1472-v4.tgz, 1472-v5.tgz, anatomy.png, v4-bench-c32.txt > > > Bitmap indexes are a very efficient structure for dealing with immutable data. We can take advantage of the fact that SSTables are immutable by attaching them directly to SSTables as a new component (supported by CASSANDRA-1471). -- This message is automatically generated by JIRA. - For more information on JIRA, see: http://www.atlassian.com/software/jira