Return-Path: Delivered-To: apmail-hadoop-common-commits-archive@www.apache.org Received: (qmail 15093 invoked from network); 2 Apr 2011 18:55:39 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 2 Apr 2011 18:55:39 -0000 Received: (qmail 41423 invoked by uid 500); 2 Apr 2011 18:55:39 -0000 Delivered-To: apmail-hadoop-common-commits-archive@hadoop.apache.org Received: (qmail 41363 invoked by uid 500); 2 Apr 2011 18:55:39 -0000 Mailing-List: contact common-commits-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-dev@hadoop.apache.org Delivered-To: mailing list common-commits@hadoop.apache.org Received: (qmail 41356 invoked by uid 500); 2 Apr 2011 18:55:39 -0000 Delivered-To: apmail-hadoop-core-commits@hadoop.apache.org Received: (qmail 41352 invoked by uid 99); 2 Apr 2011 18:55:39 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 02 Apr 2011 18:55:39 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.131] (HELO eos.apache.org) (140.211.11.131) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 02 Apr 2011 18:55:37 +0000 Received: from eos.apache.org (localhost [127.0.0.1]) by eos.apache.org (Postfix) with ESMTP id 444E4397; Sat, 2 Apr 2011 18:55:17 +0000 (UTC) MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable From: Apache Wiki To: Apache Wiki Date: Sat, 02 Apr 2011 18:55:17 -0000 Message-ID: <20110402185517.57275.64020@eos.apache.org> Subject: =?utf-8?q?=5BHadoop_Wiki=5D_Update_of_=22Hbase/DesignOverview=22_by_DougM?= =?utf-8?q?eil?= Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for ch= ange notification. The "Hbase/DesignOverview" page has been changed by DougMeil. The comment on this change is: Per stack, this page is now directing reader= s to the HBase book.. http://wiki.apache.org/hadoop/Hbase/DesignOverview?action=3Ddiff&rev1=3D21&= rev2=3D22 -------------------------------------------------- - '''This page was created on 06.03.09 and now is in progress of constructi= on....''' - =3D Table of Contents =3D = + The HBase design overview can now be found in the HBase book at [[http://= hbase.apache.org/book.html#datamodel]] = - * [[#intro|Introduction]] - * [[#datamodel|Data Model]] - * [[#conceptual|Conceptual View]] - * [[#internal|Internal View]] - * [[#api|API]] - * [[#design|Architecture Design]] - * [[#master|HBaseMaster]] - * [[#hregionserv|HRegionServer]] - * [[#client|HBase Client]] - * [[#impl|Implementation]] - <> - =3D Introduction =3D = - This paper is HBase oriented analogue of Google [[http://labs.google.com/= papers/bigtable.html|Bigtable paper]]. This paper will be self-sufficient. = - = - HBase is an [[http://apache.org/|Apache]] open source project whose goal = is to provide Bigtable-like storage for the Hadoop Distributed Computing En= vironment. HBase leverages the distributed data storage provided by the [[h= ttp://hadoop.apache.org/core/docs/current/hdfs_design.html|Hadoop Distribut= ed File System (HDFS)]] and use [[http://hadoop.apache.org/zookeeper/docs/c= urrent/zookeeperOver.html|ZooKeeper]] for coordination between HBase nodes. - = - Data is logically organized into tables, rows and columns. An iterator-li= ke interface is available for scanning through a row range and, of course, = there is the ability to retrieve a column value for a specific row key. Any= particular column may have multiple versions for the same row key. - = - <> - =3D Data Model =3D - = - Applications store data rows in labeled tables. A data row has a sortable= row key and an arbitrary number of columns. The table is stored sparsely, = so that rows in the same table can have widely varying numbers of columns. - = - HBase table is three dimensional sorted map. It maps from Cartesian produ= ct of row key, column key and timestamp to cell value: - = - '''(row:byte[] x column:byte[] x timestamp:Long) -> byte[]''' - = - A column name has the form ''":