Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 39755 invoked from network); 15 Jun 2010 22:22:38 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 15 Jun 2010 22:22:38 -0000 Received: (qmail 93827 invoked by uid 500); 15 Jun 2010 22:22:37 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 93810 invoked by uid 500); 15 Jun 2010 22:22:36 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 93802 invoked by uid 99); 15 Jun 2010 22:22:36 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Jun 2010 22:22:36 +0000 X-ASF-Spam-Status: No, hits=1.0 required=10.0 tests=AWL,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.213.172] (HELO mail-yx0-f172.google.com) (209.85.213.172) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Jun 2010 22:22:28 +0000 Received: by yxt33 with SMTP id 33so1969655yxt.31 for ; Tue, 15 Jun 2010 15:22:07 -0700 (PDT) Received: by 10.150.236.15 with SMTP id j15mr9211174ybh.233.1276640525462; Tue, 15 Jun 2010 15:22:05 -0700 (PDT) Received: from iholsman.local (h-64-236-128-62.nat.aol.com [64.236.128.62]) by mx.google.com with ESMTPS id v1sm42239788ybh.35.2010.06.15.15.22.03 (version=TLSv1/SSLv3 cipher=RC4-MD5); Tue, 15 Jun 2010 15:22:04 -0700 (PDT) Message-ID: <4C17FD0A.1070201@holsman.net> Date: Tue, 15 Jun 2010 18:22:02 -0400 From: Ian Holsman User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.1.9) Gecko/20100317 Thunderbird/3.0.4 MIME-Version: 1.0 To: common-user@hadoop.apache.org, user@cassandra.apache.org Subject: [OT] Real Time Open source solutions for aggregation and stream processing Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit firstly, my apologies for the off-topic message, but I thought most people on this list would be knowledgeable and interested in this kind of thing. We are looking to find a open source, scalable solution to do RT aggregation and stream processing (similar to what the 'hop' project http://code.google.com/p/hop/ set out to do) for large(ish) click-stream logs. My first thought was something like esper, but in our testing it kind of hits the wall at around 10,000 rules per JVM. I was wondering if any of you guys had some experiences in this area, and what your favorite toolsets are around this. currently we are using cassandra and redis with home grown software to do the aggregation, but I'd love to use a common package if there is one. and again.. apologies for the off-topic message and the x-posting. regards Ian