Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@minotaur.apache.org Received: (qmail 25139 invoked from network); 16 Feb 2010 00:45:46 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 16 Feb 2010 00:45:46 -0000 Received: (qmail 35261 invoked by uid 500); 16 Feb 2010 00:45:45 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 35177 invoked by uid 500); 16 Feb 2010 00:45:44 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 35166 invoked by uid 99); 16 Feb 2010 00:45:44 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Feb 2010 00:45:44 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [206.190.38.57] (HELO web50303.mail.re2.yahoo.com) (206.190.38.57) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 16 Feb 2010 00:45:34 +0000 Received: (qmail 5839 invoked by uid 60001); 16 Feb 2010 00:45:11 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1266281111; bh=thO/2hgBrz+qxHzkad4pRHLRDL/oVxHEbUB1NRt0Qcg=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:MIME-Version:Content-Type; b=orq5OPAghUoETqJAEFPwFxthMt53WlrCRB40wl0uyMXtsEZc8gOQZJJVC7L+SFPw6cEARzUI8J6AUeA6DtL6sj2te78KNJOoKEohPn6TnGBvT4mno5uOVUb0AUxv7G/m5cs9VQl2xje0whQdY/eCl5SptazcWolnl6aEdlHmC1k= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:MIME-Version:Content-Type; b=B3bJLOWGU6d1jPUuUsEqocaQe0FNKlEPeXsvfD7fz7vNENjK6j/lwzt3TLtrdO0VhM7ILF4y3wE1dByvCaF8H13pY8pjmpRdU0rmITQXAqsWHgxFJd3oK4JmsGHG2KvtJHeEEGwwrQZepLQgpZATjo0KkxZm3yK+B8iPzYn/blw=; Message-ID: <878809.4167.qm@web50303.mail.re2.yahoo.com> X-YMail-OSG: 76GTf4UVM1n6la.rVEAku5c4pAF9XFipwboNdWQKjiirPB2PPCp6whqaP5ZIhN024OcHdQ5IXCKxg2iJb4dSjy9pgXYmAOEYvi_NSW2K.6SStygtMBRwrShwg8n_98jZ4Licgwg0d1bY6E3b0fpbg8xy6HYXrx0c6EVe.eflEmo2bDJ5yq3bOodB7VIwZQKtJ6n6vBnfoOaRIPnEKDWVZR4P1lMQ77HG335xz1YpurF7lYm5PLCihTooI7EfrRck35Y2YqzGPyitL4zTIYOfBtt9nY11J0LXSNYuFXepA27TMJ_T5Jj2Cx0rQwvx4wN43e_5ZRpXAr0fH5UoPFXY7VE2toaDYBw4uyAMPoM4khHfqSnh_NOmPZIo4Q-- Received: from [74.73.1.126] by web50303.mail.re2.yahoo.com via HTTP; Mon, 15 Feb 2010 16:45:11 PST X-Mailer: YahooMailRC/300.3 YahooMailWebService/0.8.100.260964 Date: Mon, 15 Feb 2010 16:45:11 -0800 (PST) From: Otis Gospodnetic Subject: Storing lots of raw log data in HBase To: hbase-user@hadoop.apache.org MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Hello, I've seen the following in a few HBase presentations now: * What to store in HBase? * Maybe not your raw log data... * ...but the results of processing it with Hadoop e.g. slides 26 & 27: http://www.slideshare.net/cloudera/hw09-practical-h-base-getting-the-most-from-your-h-base-install Is there anything wrong in storing raw log data directly into HBase and doing so in real-time, even when that means having to insert a few hundred rows/second? Is the above advice purely because of data volume associated with storing lots of raw logs or some other reason? Thanks, Otis ---- Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop ecosystem search :: http://search-hadoop.com/