Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 68007 invoked from network); 6 Mar 2011 23:40:56 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 6 Mar 2011 23:40:56 -0000 Received: (qmail 39785 invoked by uid 500); 6 Mar 2011 23:40:54 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 39754 invoked by uid 500); 6 Mar 2011 23:40:54 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 39745 invoked by uid 99); 6 Mar 2011 23:40:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 Mar 2011 23:40:54 +0000 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests=MIME_QP_LONG_LINE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [208.113.200.5] (HELO homiemail-a40.g.dreamhost.com) (208.113.200.5) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 Mar 2011 23:40:47 +0000 Received: from homiemail-a40.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a40.g.dreamhost.com (Postfix) with ESMTP id 741CC74C070 for ; Sun, 6 Mar 2011 15:40:22 -0800 (PST) DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=subject :references:from:content-type:in-reply-to:message-id:date:to :content-transfer-encoding:mime-version; q=dns; s= thelastpickle.com; b=Kmaa1jd4pZmPgPPmwoB3uuPIypDKZfegY4G+nnPuuXX HWvZXo+gQ9EmZYpMHzcbhUIsm+mGYgPIzK3PR4YFoy1Gam8l81INbAaG/KN7H5e1 B6wj3CPLG+hxQCacPO8irlzdDHh722XxaPI4RYkOdGrXO4I/RoL0MqOnzo/KGZT0 = DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=thelastpickle.com; h= subject:references:from:content-type:in-reply-to:message-id:date :to:content-transfer-encoding:mime-version; s=thelastpickle.com; bh=ce2XZlDdaFpoCFyXHq6EXyrUYOo=; b=eCda1deQRga0tcdmnL9APC1js9ly PNr5tSAr0peCyqbqndxAztF1NlqjRcnv8GAZTem3Pyaq3Q5XABpATdwoElsHOpaa 9RhdtIPPpbYfSApSKH160oPrtYtT1v9km0d/yQhEQCClXwEVxsJdsrdpNyiHdlyT 2qGOsG2y/0vP00k= Received: from [115.189.203.72] (unknown [115.189.203.72]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: aaron@thelastpickle.com) by homiemail-a40.g.dreamhost.com (Postfix) with ESMTPSA id 8C43274C05B for ; Sun, 6 Mar 2011 15:40:21 -0800 (PST) Subject: Re: What would be a good strategy for Storing the large text contents like blog posts in Cassandra. References: From: Aaron Morton Content-Type: text/plain; charset=us-ascii X-Mailer: iPad Mail (8C148) In-Reply-To: Message-Id: Date: Mon, 7 Mar 2011 12:40:12 +1300 To: "user@cassandra.apache.org" Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (iPad Mail 8C148) Sounds reasonable, one CF for the blog post one CF for the comments. You cou= ld also use a single CF if you will often read the blog and the comments at t= he same time. The best design is the one that suits how your app works, try o= ne and be prepared to change. Note that counters are only in the 0.8 trunk and are still under development= , they are not going to be released for a couple of months. Your per column data size is nothing to be concerned abut. Hope that helps. Aaron=20 On 7/03/2011, at 6:35 AM, Aditya Narayan wrote: > What would be a good strategy to store large text content/(blog posts > of around 1500-3000 characters) in cassandra? I need to store these > blog posts along with their metadata like bloggerId, blogTags. I am > looking forward to store this data in a single row giving each > attribute a single column. So one blog per row. Is using a single > column for a large blog post like this a good strategy? >=20 > Next, I also need to store the blogComments which I am planning to > store all, in another single row. 1 comment per column. Thus the > entire information about the a single comment like commentBody, > commentor would be serialized(using google Protocol buffers) and > stored in a single column, > For storing the no. of likes of each comment itself, I am planning to > keep a counter_column, in the same row, for each comment that will > hold an no. specifiying no. of 'likes' of that comment. >=20 > Any suggestions on the above design highly appreciated.. Thanks.