Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5034CD974 for ; Thu, 27 Sep 2012 14:01:46 +0000 (UTC) Received: (qmail 10444 invoked by uid 500); 27 Sep 2012 14:01:44 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 10407 invoked by uid 500); 27 Sep 2012 14:01:43 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 10399 invoked by uid 99); 27 Sep 2012 14:01:43 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Sep 2012 14:01:43 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mvallebr@gmail.com designates 209.85.214.44 as permitted sender) Received: from [209.85.214.44] (HELO mail-bk0-f44.google.com) (209.85.214.44) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Sep 2012 14:01:39 +0000 Received: by bkcjc3 with SMTP id jc3so1877579bkc.31 for ; Thu, 27 Sep 2012 07:01:17 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=T/ST+ha2jnp3cksVCYczfVyQq8Z2JMAml5jwvv6vy2A=; b=X8DDA2PDSi5DUyKHY3Zs0aCG22Yt63MEnAX+TAe7RyLz+agSjWFCS48U9QCpCN92MW 3pFwXcD36PFA5uzLL+Z7VCfl24OYekRr0THEPu0yVE5SwmWH+OmeLeqWTln7yIVt8gLe tR21mzFwaIzFdfvTRKrMogWkpAOAyVHi/APS/cxqtqBcdcJGTFGbsJ4Sy4k3EuzM84dV /hyeTYlaB01t73qP9c3GlokoZ7UIoPMy3dRwZDOgf2yPlTwFK9S/vRpYPhRpluzDrXVq S30q0Tu1g2XQoyNeN9zNAAD9NbbEDuoc/eOBb+iDImKr9YFYVKLGLWUWc5z4eQnfG3MA dO1w== MIME-Version: 1.0 Received: by 10.152.112.233 with SMTP id it9mr3322243lab.40.1348754468785; Thu, 27 Sep 2012 07:01:08 -0700 (PDT) Received: by 10.112.21.132 with HTTP; Thu, 27 Sep 2012 07:01:08 -0700 (PDT) In-Reply-To: References: Date: Thu, 27 Sep 2012 11:01:08 -0300 Message-ID: Subject: Re: 1000's of column families From: Marcelo Elias Del Valle To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=f46d040838d3004f8204caaf5f6c X-Virus-Checked: Checked by ClamAV on apache.org --f46d040838d3004f8204caaf5f6c Content-Type: text/plain; charset=ISO-8859-1 Out of curiosity, is it really necessary to have that amount of CFs? I am probably still used to relational databases, where you would use a new table just in case you need to store different kinds of data. As Cassandra stores anything in each CF, it might probably make sense to have a lot of CFs to store your data... But why wouldn't you use a single CF with partitions in these case? Wouldn't it be the same thing? I am asking because I might learn a new modeling technique with the answer. []s 2012/9/26 Hiller, Dean > We are streaming data with 1 stream per 1 CF and we have 1000's of CF. > When using the tools they are all geared to analyzing ONE column family at > a time :(. If I remember correctly, Cassandra supports as many CF's as you > want, correct? Even though I am going to have tons of funs with > limitations on the tools, correct? > > (I may end up wrapping the node tool with my own aggregate calls if needed > to sum up multiple column families and such). > > Thanks, > Dean > -- Marcelo Elias Del Valle http://mvalle.com - @mvallebr --f46d040838d3004f8204caaf5f6c Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Out of curiosity, is it really necessary to have that amount of CFs?
I = am probably still used to relational databases, where you would use a new t= able just in case you need to store different kinds of data. As Cassandra s= tores anything in each CF, it might probably make sense to have a lot of CF= s to store your data...
But why wouldn't you use a single CF with partitions in these case= ? Wouldn't it be the same thing? I am asking because I might learn a ne= w modeling technique with the answer.

[]s

2012/9/26 Hiller, Dean <Dean.Hiller@nrel.gov= >
We are streaming data with 1 stream per 1 CF and we have 1000's of CF. = =A0When using the tools they are all geared to analyzing ONE column family = at a time :(. =A0If I remember correctly, Cassandra supports as many CF'= ;s as you want, correct? =A0Even though I am going to have tons of funs wit= h limitations on the tools, correct?

(I may end up wrapping the node tool with my own aggregate calls if needed = to sum up multiple column families and such).

Thanks,
Dean



--
Marcelo Elia= s Del Valle
http://mvall= e.com=A0- @mvallebr
--f46d040838d3004f8204caaf5f6c--