Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 23E7E9A4A for ; Sat, 21 Jan 2012 03:38:34 +0000 (UTC) Received: (qmail 38568 invoked by uid 500); 21 Jan 2012 03:38:31 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 38295 invoked by uid 500); 21 Jan 2012 03:38:15 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 38284 invoked by uid 99); 21 Jan 2012 03:38:11 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 21 Jan 2012 03:38:11 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of potekhin@bnl.gov designates 130.199.3.132 as permitted sender) Received: from [130.199.3.132] (HELO smtpgw.bnl.gov) (130.199.3.132) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 21 Jan 2012 03:38:03 +0000 X-BNL-policy-q: X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: Ai8FAI4yGk+CxzYH/2dsb2JhbABAA4MNqwmBBYFyAQEFOEARCxgJFg8JAwIBAgFFEwgBAYgAuEiJBQWDHASIO5IzjRg X-IronPort-AV: E=Sophos;i="4.71,546,1320642000"; d="scan'208";a="162819052" Received: from rcf.rhic.bnl.gov ([130.199.54.7]) by smtpgw.sec.bnl.local with ESMTP/TLS/DHE-RSA-AES256-SHA; 20 Jan 2012 22:37:42 -0500 Received: from [192.168.1.128] (rrcs-173-197-115-165.west.biz.rr.com [173.197.115.165]) (authenticated bits=0) by rcf.rhic.bnl.gov (8.13.8/8.13.8) with ESMTP id q0L3bfxd013482 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Fri, 20 Jan 2012 22:37:42 -0500 Message-ID: <4F1A3304.4000409@bnl.gov> Date: Fri, 20 Jan 2012 19:37:40 -0800 From: Maxim Potekhin User-Agent: Mozilla/5.0 (Windows NT 6.0; rv:9.0) Gecko/20111222 Thunderbird/9.0.1 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: Cassandra to Oracle? References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org What makes you think that RDBMS will give you acceptable performance? I guess you will try to index it to death (because otherwise the "ad hoc" queries won't work well if at all), and at this point you may be hit with a performance penalty. It may be a good idea to interview users and build denormalized views in Cassandra, maybe on a separate "look-up" cluster. A few percent of users will be unhappy, but you'll find it hard to do better. I'm talking from my experience with an industrial strength RDBMS which doesn't scale very well for what you call "ad-hoc" queries. Regards, Maxim On 1/20/2012 9:28 AM, Brian O'Neill wrote: > > I can't remember if I asked this question before, but.... > > We're using Cassandra as our transactional system, and building up > quite a library of map/reduce jobs that perform data quality analysis, > statistics, etc. > (> 100 jobs now) > > But... we are still struggling to provide an "ad-hoc" query mechanism > for our users. > > To fill that gap, I believe we still need to materialize our data in > an RDBMS. > > Anyone have any ideas? Better ways to support ad-hoc queries? > > Effectively, our users want to be able to select count(distinct Y) > from X group by Z. > Where Y and Z are arbitrary columns of rows in X. > > We believe we can create column families with different key structures > (using Y an Z as row keys), but some column names we don't know / > can't predict ahead of time. > > Are people doing bulk exports? > Anyone trying to keep an RDBMS in synch in real-time? > > -brian > > -- > Brian ONeill > Lead Architect, Health Market Science (http://healthmarketscience.com) > mobile:215.588.6024 > blog: http://weblogs.java.net/blog/boneill42/ > blog: http://brianoneill.blogspot.com/ >