Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 3614 invoked from network); 4 May 2010 12:52:23 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 4 May 2010 12:52:23 -0000 Received: (qmail 98610 invoked by uid 500); 4 May 2010 12:52:22 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 98588 invoked by uid 500); 4 May 2010 12:52:22 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 98580 invoked by uid 99); 4 May 2010 12:52:22 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 May 2010 12:52:22 +0000 X-ASF-Spam-Status: No, hits=4.7 required=10.0 tests=FREEMAIL_FROM,FREEMAIL_REPLY,HTML_MESSAGE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ddragutoiu@gmail.com designates 209.85.161.44 as permitted sender) Received: from [209.85.161.44] (HELO mail-fx0-f44.google.com) (209.85.161.44) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 May 2010 12:52:13 +0000 Received: by fxm4 with SMTP id 4so3483301fxm.31 for ; Tue, 04 May 2010 05:51:53 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from :user-agent:mime-version:to:subject:references:in-reply-to :content-type; bh=aJFbsTjUZvT/bfPpUN73HguomxrozBl9PiReLphY/Yo=; b=FmmxlK2dl0N2PZJssw6XgzAbG+5uI2idQUE1zW36ywpzf3g6zhlcaQn8rEk+7WAVm8 KtapcesUnn8G2ZmpNZ/f3E6nj/+ZesLvljaGfFahIrxUwgNLNVfEjxEn9sAiORvJ4YP4 pmKX3AG/pWH229PFtwcflP9BkFL/sp3q9PEVE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type; b=ZKHqnfuBX2USPJGH/67MwjMafzx3yrIyS0ZTcUWMrv580Di+88m96tL9LASvHaRnB2 2COM9KzOx/w/tRIe0/uBMAmP2anmbypmSwFeqk9e0VU25CJFGZ2s2jQQzh31jCZcHqjk a4B10q//nniTXBUVfsls+YN2SO0QW0q9324JQ= Received: by 10.223.16.207 with SMTP id p15mr4178815faa.99.1272977512953; Tue, 04 May 2010 05:51:52 -0700 (PDT) Received: from [192.168.2.100] ([89.137.244.246]) by mx.google.com with ESMTPS id 2sm11280027fks.12.2010.05.04.05.51.51 (version=TLSv1/SSLv3 cipher=RC4-MD5); Tue, 04 May 2010 05:51:52 -0700 (PDT) Message-ID: <4BE01865.1090703@gmail.com> Date: Tue, 04 May 2010 15:51:49 +0300 From: Dorin Dragutoiu User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.1; ro; rv:1.9.1.9) Gecko/20100317 Lightning/1.0b1 Thunderbird/3.0.4 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: Design Query References: In-Reply-To: Content-Type: multipart/alternative; boundary="------------090904020503000401050305" X-Virus-Checked: Checked by ClamAV on apache.org This is a multi-part message in MIME format. --------------090904020503000401050305 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit 2. I have used the same configuration (3 machines with 4GB RAM) and I got an Out of memory error on compactation each time trying to compact 4 x 128MB sstables. Tried different configuration incl Java Opts, same result. When I have used 16GB ram machine everything worked like a charm. Pe 04.05.2010 12:28, vineet daniel a scris: > > As you havent specified all the details pertaining to filters and your > data layout (structure) at a very high level what i can suggest is > that you need to create a seperate CF for each filter. > > > On Sat, May 1, 2010 at 5:04 PM, Rakesh Rajan > wrote: > > I am evaluating cassandra to implement activity streams. We > currently have over 1000000 feeds with total entries exceeding > 320000000 implemented using redis ( ~320 entries / feed). Would > like hear from the community on how to use cassandra to solve the > following cases: > > 1. Ability to fetch entries by applying a few filters ( like > show me only likes from a given user). This would include > range query to support pagination. So this would mean > indices on a few columns like the feed id, feed type etc. > 2. We have around 3 machines with 4GB RAM for this purpose and > thinking of having replication factor 2. Would 4GB * 3 be > enough for cassandra for this kind of data? I read that > cassandra does not keep all the data in memory but want to > be sure that we have the right server config to handle this > data using cassandra. > > Thanks, > Rakesh > > --------------090904020503000401050305 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: 8bit 2. I have used the same configuration (3 machines with 4GB RAM) and I got an Out of memory error on compactation each time trying to compact 4 x 128MB sstables. Tried different configuration incl Java Opts, same result. When I have used 16GB ram machine everything worked like a charm.

Pe 04.05.2010 12:28, vineet daniel a scris:

As you havent specified all the details pertaining to filters and your data layout (structure) at a very high level what i can suggest is that you need to create a seperate CF for each filter.


On Sat, May 1, 2010 at 5:04 PM, Rakesh Rajan <rakeshxp@gmail.com> wrote:
I am evaluating cassandra to implement activity streams. We currently have over 1000000 feeds with total entries exceeding 320000000 implemented using redis ( ~320 entries / feed). Would like hear from the community on how to use cassandra to solve the following cases:
  1. Ability to fetch entries by applying a few filters ( like show me only likes from a given user). This would include range query to support pagination. So this would mean indices on a few columns like the feed id, feed type etc.
  2. We have around 3 machines with 4GB RAM for this purpose and thinking of having replication factor 2. Would 4GB * 3 be enough for cassandra for this kind of data? I read that cassandra does not keep all the data in memory but want to be sure that we have the right server config to handle this data using cassandra.
Thanks,
Rakesh 

--------------090904020503000401050305--