Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 711369DE2 for ; Mon, 20 Feb 2012 18:45:46 +0000 (UTC) Received: (qmail 61822 invoked by uid 500); 20 Feb 2012 18:45:44 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 61791 invoked by uid 500); 20 Feb 2012 18:45:44 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 61783 invoked by uid 99); 20 Feb 2012 18:45:44 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 Feb 2012 18:45:44 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of bstarkenburg@gmail.com designates 74.125.82.44 as permitted sender) Received: from [74.125.82.44] (HELO mail-ww0-f44.google.com) (74.125.82.44) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 Feb 2012 18:45:36 +0000 Received: by wgbdt10 with SMTP id dt10so3911638wgb.25 for ; Mon, 20 Feb 2012 10:45:16 -0800 (PST) Received-SPF: pass (google.com: domain of bstarkenburg@gmail.com designates 10.180.104.4 as permitted sender) client-ip=10.180.104.4; Authentication-Results: mr.google.com; spf=pass (google.com: domain of bstarkenburg@gmail.com designates 10.180.104.4 as permitted sender) smtp.mail=bstarkenburg@gmail.com; dkim=pass header.i=bstarkenburg@gmail.com Received: from mr.google.com ([10.180.104.4]) by 10.180.104.4 with SMTP id ga4mr20622207wib.17.1329763516521 (num_hops = 1); Mon, 20 Feb 2012 10:45:16 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; bh=6H7ozA18HSe5ZsPWlvDM3YShSlCHXPJlRVyZcFXHeSM=; b=qhZumYBnwVntQawl37fdCs34nI0oGhTmBQhA6cmmyT2YvQ/OpJ/FAU4WlFgfmubekr f6ZfZjttWJI8GJWPOXVXz7psrU8BnuUSh3lbebTTnb2B2H9bduVJ2BBoW9lsU5GEStNq FN85im6GpflMyzGW5MaZh+RdmZBu/ASwEw8nQ= MIME-Version: 1.0 Received: by 10.180.104.4 with SMTP id ga4mr17254770wib.17.1329763516488; Mon, 20 Feb 2012 10:45:16 -0800 (PST) Received: by 10.223.91.65 with HTTP; Mon, 20 Feb 2012 10:45:16 -0800 (PST) Date: Mon, 20 Feb 2012 10:45:16 -0800 Message-ID: Subject: Wide Row Performance & Index Question From: Blake Starkenburg To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=f46d04427182091b6004b969b2a4 X-Virus-Checked: Checked by ClamAV on apache.org --f46d04427182091b6004b969b2a4 Content-Type: text/plain; charset=ISO-8859-1 Question pertaining to wide or large rows in Cassandra. I recall reading in a blog I believe posted by Aaron Morton a notation that Cassandra creates its own index of a row when it reaches X amount of columns? My curiosity is: A.) At what column count does this happen? B.) If Thrift is only getting slices of a large row (column_start=X, column_end=Y, limit 20) is their any performance hits to rows over and above the A.) threshold above? Finally, I am correct in thinking the cluster may appear slightly unbalanced depending on the RF and the amount of nodes with a great deal of large rows? note: using php_cassa & Cassandra 0.8.10 Thanks! --f46d04427182091b6004b969b2a4 Content-Type: text/html; charset=ISO-8859-1 Question pertaining to wide or large rows in Cassandra. I recall reading in a blog I believe posted by Aaron Morton a notation that Cassandra creates its own index of a row when it reaches X amount of columns? My curiosity is:

A.) At what column count does this happen?
B.) If Thrift is only getting slices of a large row (column_start=X, column_end=Y, limit 20) is their any performance hits to rows over and above the A.) threshold above?

Finally, I am correct in thinking the cluster may appear slightly unbalanced depending on the RF and the amount of nodes with a great deal of large rows?

note: using php_cassa & Cassandra 0.8.10

Thanks!
--f46d04427182091b6004b969b2a4--