Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 5660 invoked from network); 1 May 2010 11:35:21 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 1 May 2010 11:35:21 -0000 Received: (qmail 93476 invoked by uid 500); 1 May 2010 11:35:20 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 93433 invoked by uid 500); 1 May 2010 11:35:19 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 93425 invoked by uid 99); 1 May 2010 11:35:19 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 01 May 2010 11:35:19 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of rakeshxp@gmail.com designates 209.85.221.192 as permitted sender) Received: from [209.85.221.192] (HELO mail-qy0-f192.google.com) (209.85.221.192) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 01 May 2010 11:35:12 +0000 Received: by qyk30 with SMTP id 30so1463699qyk.16 for ; Sat, 01 May 2010 04:34:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:from:date :message-id:subject:to:content-type; bh=MsceUH3t9DU9jVe6gwErMvSzDZUmtwEwR5TfaN5qedc=; b=AVXi8ZMLElIlu/u408yxwtLVNLDRsNehSA0e2RcdLreYm5DGyYJkrc0m2GMR5FQdY0 /4wNPw72Tzz14fw4Pp2slbfZSd+vnnceUVAZ+OGqTBAu/79v9lZIzIu4bCQWpqfBtCY0 rFr3B3R2ONYPdt+RUtY9y6mkiqS83uq8aHumw= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:from:date:message-id:subject:to:content-type; b=HIf+8y3SYMjpxEG1CSTZDVyh3fu04mb571+rjSSYV9qaaxsXckw8op6PWHkaIkvFfx N35klBL686sk3Yqqlt6KSZiejRPOeEmzvAgwlDrhX//dJ/8qFOhlVL+ztha50Nl4Qs5J KJK28FW+m/3HFoj/cUeJyo37DnCYLaM0lrQgQ= Received: by 10.229.182.145 with SMTP id cc17mr342894qcb.35.1272713691201; Sat, 01 May 2010 04:34:51 -0700 (PDT) MIME-Version: 1.0 Received: by 10.229.1.72 with HTTP; Sat, 1 May 2010 04:34:31 -0700 (PDT) From: Rakesh Rajan Date: Sat, 1 May 2010 17:04:31 +0530 Message-ID: Subject: Design Query To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0016364ecbf276fff6048586bf62 X-Virus-Checked: Checked by ClamAV on apache.org --0016364ecbf276fff6048586bf62 Content-Type: text/plain; charset=ISO-8859-1 I am evaluating cassandra to implement activity streams. We currently have over 1000000 feeds with total entries exceeding 320000000 implemented using redis ( ~320 entries / feed). Would like hear from the community on how to use cassandra to solve the following cases: 1. Ability to fetch entries by applying a few filters ( like show me only likes from a given user). This would include range query to support pagination. So this would mean indices on a few columns like the feed id, feed type etc. 2. We have around 3 machines with 4GB RAM for this purpose and thinking of having replication factor 2. Would 4GB * 3 be enough for cassandra for this kind of data? I read that cassandra does not keep all the data in memory but want to be sure that we have the right server config to handle this data using cassandra. Thanks, Rakesh --0016364ecbf276fff6048586bf62 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
I am evaluating cassandra to implement activity streams. We currently = have over 1000000 feeds with total entries exceeding 320000000 implemented = using redis ( ~320 entries / feed). Would like hear from the community on h= ow to use cassandra to solve the following cases:
  1. Ability to fetch entries by applying a few filters ( like show= me only likes from a given user). This would include range query to suppor= t pagination. So this would mean indices on a few columns like the feed id,= feed type etc.
  2. We have around 3 machines with 4GB RAM for this purpose and thinking of= having replication factor 2. Would 4GB * 3 be enough for cassandra for thi= s kind of data? I read that cassandra does not keep all the data in memory = but want to be sure that we have the right server config to handle this dat= a using cassandra.
Thanks,
Rakesh=A0
--0016364ecbf276fff6048586bf62--