Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@minotaur.apache.org Received: (qmail 77632 invoked from network); 14 Feb 2010 08:57:45 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 14 Feb 2010 08:57:45 -0000 Received: (qmail 58778 invoked by uid 500); 14 Feb 2010 08:57:44 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 58686 invoked by uid 500); 14 Feb 2010 08:57:43 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 58676 invoked by uid 99); 14 Feb 2010 08:57:43 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 14 Feb 2010 08:57:43 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: 209.85.222.174 is neither permitted nor denied by domain of sujee@sujee.net) Received: from [209.85.222.174] (HELO mail-pz0-f174.google.com) (209.85.222.174) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 14 Feb 2010 08:57:35 +0000 Received: by pzk4 with SMTP id 4so4250032pzk.5 for ; Sun, 14 Feb 2010 00:57:13 -0800 (PST) MIME-Version: 1.0 Received: by 10.142.67.24 with SMTP id p24mr2435502wfa.265.1266137833148; Sun, 14 Feb 2010 00:57:13 -0800 (PST) From: Sujee Maniyam Date: Sun, 14 Feb 2010 00:56:53 -0800 Message-ID: Subject: how to calculate top-xxx rowkeys To: hbase-user Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org HI I have a table with rowkey is composed of userid + timestamp. I need to figure out 'top-100' users. One approach is running a scanner and keeping a hashmap of user-count in memory. Wondering if there is an hbase-trick I could use? thanks Sujee