Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 11871 invoked from network); 11 Feb 2011 16:22:42 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 11 Feb 2011 16:22:42 -0000 Received: (qmail 98258 invoked by uid 500); 11 Feb 2011 16:22:40 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 98034 invoked by uid 500); 11 Feb 2011 16:22:37 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 97824 invoked by uid 99); 11 Feb 2011 16:22:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Feb 2011 16:22:36 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of chris.burroughs@gmail.com designates 209.85.216.44 as permitted sender) Received: from [209.85.216.44] (HELO mail-qw0-f44.google.com) (209.85.216.44) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Feb 2011 16:22:27 +0000 Received: by qwi2 with SMTP id 2so1779748qwi.31 for ; Fri, 11 Feb 2011 08:22:06 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:message-id:date:from:user-agent:mime-version:to :subject:references:in-reply-to:content-type :content-transfer-encoding; bh=jLd1OtO5jOtgo5bUHkhEAyectmxtTq0z2A07jACBu0Q=; b=tGbNsgGVd0e4HN6//xW1ThcFRT3dBbdxpK8rKsCmrPnuGeW6jN7h1+MRRsmnvdyA8J L4ikAbo52qUZMGGJUfpM1DG3LZvg7KYa0QqT3CcARdQYeNojj+nc1Y/4BPxSH+w8rnDJ Rdi9/FY+BIU7y/SGmsbpan/1eDUs43r84lQeU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; b=bHbbuux603EtlaMCuvnKe0kNUc0WQeJF9UcnjFIeLUv/Jra/VEJ9pPFJfrxkcWZT4M g9S7bOXK9TNp4Oyeo6zhSDZjAK93JwYFoiGH9gPLGf5SvMuomuzFoPsD+6f439jt6JzI lwMuxndHODLIN0bwUYnbaj3uCbsBfMoiSNXF8= Received: by 10.224.46.1 with SMTP id h1mr761874qaf.306.1297441324542; Fri, 11 Feb 2011 08:22:04 -0800 (PST) Received: from [10.10.50.239] (cl-pat-tr.clearspring.com [8.18.54.254]) by mx.google.com with ESMTPS id nb15sm656769qcb.38.2011.02.11.08.22.02 (version=SSLv3 cipher=OTHER); Fri, 11 Feb 2011 08:22:03 -0800 (PST) Message-ID: <4D556229.8040602@gmail.com> Date: Fri, 11 Feb 2011 11:22:01 -0500 From: Chris Burroughs User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101208 Thunderbird/3.1.7 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: Column name size References: In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org On 02/11/2011 05:06 AM, Patrik Modesto wrote: > Hi all! > > I'm thinking if size of a column name could matter for a large dataset > in Cassandra (I mean lots of rows). For example what if I have a row > with 10 columns each has 10 bytes value and 10 bytes name. Do I have > half the row size just of the column names and the other half of the > data (not counting storage overhead)? What if I have 10M of these > rows? Is there a difference? Should I use some 3bytes codes for a > column name to save memory/bandwidth? > > Thanks, > Patrik You are correct that you can for small row/column key values they key itself can represent a large proportion of the total size. I think you will find the consensus on this list is that trying to be clever with names is usually not worth the additional complexity. The right solution to this is https://issues.apache.org/jira/browse/CASSANDRA-47.