Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@minotaur.apache.org Received: (qmail 57628 invoked from network); 21 Jan 2010 03:12:12 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 21 Jan 2010 03:12:12 -0000 Received: (qmail 36644 invoked by uid 500); 21 Jan 2010 03:12:11 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 36535 invoked by uid 500); 21 Jan 2010 03:12:11 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 36525 invoked by uid 99); 21 Jan 2010 03:12:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 Jan 2010 03:12:11 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lists@nabble.com designates 216.139.236.158 as permitted sender) Received: from [216.139.236.158] (HELO kuber.nabble.com) (216.139.236.158) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 Jan 2010 03:12:03 +0000 Received: from isper.nabble.com ([192.168.236.156]) by kuber.nabble.com with esmtp (Exim 4.63) (envelope-from ) id 1NXnSP-0008HJ-UE for hbase-user@hadoop.apache.org; Wed, 20 Jan 2010 19:11:41 -0800 Message-ID: <27252203.post@talk.nabble.com> Date: Wed, 20 Jan 2010 19:11:41 -0800 (PST) From: canucks To: hbase-user@hadoop.apache.org Subject: learning hbase - schema design advice MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Nabble-From: anhlon@gmail.com Hi, i'm pretty interested in learning hbase. what i want to do is store financial data for analytical/graphing/displaying purposes. there hundreds of millions of rows and of course, i want fast response when retrieving the data. if i were to do it in a RDBMS it would be REPORT, MARKET, OPERATING_DATE, OPERATING_INTERVAL, HOUR_ENDING VALUE where the bolded column name are PK. if i were to store this in hbase would it look like this? REPORT.MARKET.OPERATING_DATE.OPERATING_INTERVAL.HOUR_ENDING.TIMESTAMP{ VALUE: 92.29 } so that i can do queries like below: - give me all reports with the name of "ABC" - give me all the values where OPERATING_DATE is from jan-01-2010 to jan-10-2010 - give me all the values where OPERATING_DATE is from jan-01-2010 to jan-10-2010 and HOUR_ENDING is between 5 and 10 (or simply 5 or variations thereof) in short, is hbase the wrong way to go about it or would it yield better performance? also, you folks happen to know any good links/articles on hbase table & schema? thanks -- View this message in context: http://old.nabble.com/learning-hbase---schema-design-advice-tp27252203p27252203.html Sent from the HBase User mailing list archive at Nabble.com.