Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@minotaur.apache.org Received: (qmail 89252 invoked from network); 30 Sep 2009 03:47:14 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 30 Sep 2009 03:47:14 -0000 Received: (qmail 28943 invoked by uid 500); 30 Sep 2009 03:47:13 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 28895 invoked by uid 500); 30 Sep 2009 03:47:13 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 28883 invoked by uid 99); 30 Sep 2009 03:47:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Sep 2009 03:47:13 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of saint.ack@gmail.com designates 74.125.92.26 as permitted sender) Received: from [74.125.92.26] (HELO qw-out-2122.google.com) (74.125.92.26) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Sep 2009 03:47:03 +0000 Received: by qw-out-2122.google.com with SMTP id 5so1979986qwd.35 for ; Tue, 29 Sep 2009 20:46:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:sender:received:in-reply-to :references:date:x-google-sender-auth:message-id:subject:from:to :content-type; bh=yYv2smUwEpCyT3v+HdRGUM2C5pLl1AAq8mPkdGOYoBI=; b=Cz4Le0iLdOehl9aUHaKNPB0yJTdKslr3z7VFnlNPAYWNjRhf2JjDh9oIrcQzz2Oak+ z360Tesihm+T3thcr0yJqfMGerHzcBHe1DgT2+Vmvj3k1950j1YRvHCm+I7zkmVeF71n mkN0MFBYf+trj1Q/ZDqSjZKPTqdtSDW2GxxNU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; b=XuXcyAm2qWdRK3j3QAKeO4UEbBdQfTqbehj9rq/9C5pH63bQ3kRz2tKedgpCvWa0NW ow3VbSt0lZO4D7cW6UMDZgyttHKAhQaEfSdChbFTApwEs3AMcS+qdrwkVCYNz1ZuoR+c toEcj5gYK6IoXwIMtl2BeK3Cy0GTDS+ulneSU= MIME-Version: 1.0 Sender: saint.ack@gmail.com Received: by 10.229.116.140 with SMTP id m12mr2560071qcq.54.1254282402574; Tue, 29 Sep 2009 20:46:42 -0700 (PDT) In-Reply-To: References: Date: Tue, 29 Sep 2009 20:46:42 -0700 X-Google-Sender-Auth: fb88d0ad9705c026 Message-ID: <7c962aed0909292046n6231b9a5xd5aded9155f0903@mail.gmail.com> Subject: Re: use hbase to store realtime logs From: stack To: hbase-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=00c09fa216fe0db38d0474c3611e X-Virus-Checked: Checked by ClamAV on apache.org --00c09fa216fe0db38d0474c3611e Content-Type: text/plain; charset=ISO-8859-1 You could use hbase to do this. Why not just put them into hdfs (Check out tech like facebook's scribe). If you do put them to hbase, make sure you provision your cluster with sufficient firepower (measure write rate to a single node then size appropriately giving yourself a decent amount of elbow room to grow in). Unix timestamp is not enough to uniquely specify log entries, not if you are doing 100k a second. You may have to design a better key than this. Add a sequence number or some such. St.Ack Other architectures that you might consider are writing files locally and then on a period pushing to hdfs. On Tue, Sep 29, 2009 at 6:17 PM, Zheng Shao wrote: > Is it a good use case to store realtime logs into hbase? > > I am thinking of using unix timestamp as the key, and we have 100K/rows per > seconds, and 100 bytes per row (about 10MB/second). > Users can do range query to get the latest rows. Periodically, we rotate > the tables. > > In my case, the key is monotonically increasing but HBase is general enough > to take random keys. > I am not sure this is a good use case for HBase. > > Does anybody have similar use case? Does HBase work well for this? > > Zheng > > --00c09fa216fe0db38d0474c3611e--