Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BB7FD955E for ; Sat, 19 May 2012 12:05:26 +0000 (UTC) Received: (qmail 28241 invoked by uid 500); 19 May 2012 12:05:24 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 28126 invoked by uid 500); 19 May 2012 12:05:23 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 28108 invoked by uid 99); 19 May 2012 12:05:22 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 19 May 2012 12:05:22 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of gcdcu-cassandra-user-1@m.gmane.org designates 80.91.229.3 as permitted sender) Received: from [80.91.229.3] (HELO plane.gmane.org) (80.91.229.3) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 19 May 2012 12:05:15 +0000 Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1SViOt-0001f3-QG for user@cassandra.apache.org; Sat, 19 May 2012 14:04:47 +0200 Received: from c-68-32-133-231.hsd1.nj.comcast.net ([68.32.133.231]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sat, 19 May 2012 14:04:47 +0200 Received: from oleg.dulin by c-68-32-133-231.hsd1.nj.comcast.net with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sat, 19 May 2012 14:04:47 +0200 X-Injected-Via-Gmane: http://gmane.org/ To: user@cassandra.apache.org From: Oleg Dulin Subject: Data aggregation - averages, sums, etc. Date: Sat, 19 May 2012 12:04:35 +0000 (UTC) Lines: 21 Message-ID: <972495380359121597.085627oleg.dulin-gmail.com@news.gmane.org> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: c-68-32-133-231.hsd1.nj.comcast.net User-Agent: NewsTap/3.5.5 (iPad) X-Virus-Checked: Checked by ClamAV on apache.org Dear distinguished colleagues: I am trying to come up with a data model that lets me do aggregations, such as sums and averages. Here are my requirements: 1. Data may be updated concurrently 2. I want to avoid changing schema; we have a multitennant cloud solution that is driven by configuration. Schema is the same for all customs. Here is what I have at my disposal: 1. We have a proprietary distributed in memory column store that acts as a buffer between the server and Cassandra. Frequent reads are not a problem. 2. I know I have counter columns. I can do sums. But can I do averages ? One of the ideas is to record data as it comes in organized by time and periodically aggregate it. Thoughts ?