Return-Path: Delivered-To: apmail-lucene-hadoop-commits-archive@locus.apache.org Received: (qmail 5163 invoked from network); 30 Apr 2007 06:57:10 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 30 Apr 2007 06:57:10 -0000 Received: (qmail 65892 invoked by uid 500); 30 Apr 2007 06:57:16 -0000 Delivered-To: apmail-lucene-hadoop-commits-archive@lucene.apache.org Received: (qmail 65807 invoked by uid 500); 30 Apr 2007 06:57:16 -0000 Mailing-List: contact hadoop-commits-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-commits@lucene.apache.org Received: (qmail 65794 invoked by uid 99); 30 Apr 2007 06:57:16 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 29 Apr 2007 23:57:16 -0700 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.130] (HELO eos.apache.org) (140.211.11.130) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 29 Apr 2007 23:57:09 -0700 Received: from eos.apache.org (localhost [127.0.0.1]) by eos.apache.org (Postfix) with ESMTP id 2F49C59F71 for ; Mon, 30 Apr 2007 06:56:49 +0000 (GMT) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: Apache Wiki To: hadoop-commits@lucene.apache.org Date: Mon, 30 Apr 2007 06:56:49 -0000 Message-ID: <20070430065649.10448.64025@eos.apache.org> Subject: [Lucene-hadoop Wiki] Trivial Update of "Hbase/HbaseArchitecture" by JimKellerman X-Virus-Checked: Checked by ClamAV on apache.org Dear Wiki user, You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification. The following page has been changed by JimKellerman: http://wiki.apache.org/lucene-hadoop/Hbase/HbaseArchitecture ------------------------------------------------------------------------------ Other related features and TODOs: 1. Single-machine log reconstruction works great, but distributed log recovery is not yet implemented. This is relatively easy, involving just a sort of the log entries, placing the shards into the right DFS directories 1. Data compression is not yet implemented, but there is an obvious place to do so in the HStore. - 1. We need easy interfaces to !MapReduce jobs, so they can scan tables. We have been contacted by several parties interested in contributing to HBase, and one has signed up to work on the map/reduce interface. + 1. We need easy interfaces to !MapReduce jobs, so they can scan tables. We have been contacted by Vuk Ercegovac [[mailto(vercego AT SPAMFREE us DOT ibm DOT com)]] of IBM Almaden Research who expressed an interest in working on an HBase interface to the Hadoop map/reduce interface to HBase. 1. The HMemcache lookup structure is relatively inefficient 1. File compaction is relatively slow; we should have a more conservative algorithm for deciding when to apply compaction. 1. For the getFull() operation, use of Bloom filters would speed things up