hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Lucene-hadoop Wiki] Update of "Hbase" by udanax
Date Mon, 05 Feb 2007 09:53:37 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.

The following page has been changed by udanax:
http://wiki.apache.org/lucene-hadoop/Hbase

The comment on the change is:
Data store and Retrieval Plan

------------------------------------------------------------------------------
  Bigtable (and Hbase) provide a means for organizing and efficiently
  accessing these large data sets.
  
+ == It is not Row-Oriented. ==
+ 
+ It's need to be much smaller, much faster, managed for high-demand analytics and can be
sparse.
+ So, BigTable(Hbase) must Column Oriented storing like C-Store for wide and sparse data.
+ In a column oriented NULLs are much easier to handle, and impose a significantly smaller
performance overhead.
+ And supports both Horizontal/Vertical Parallel Processing.
+ 
+ Do you know RDF(Resource Description Framework) Storage?
+ We Can put it.
+ 
+  * Storing and managing very large amounts of structured data
+  * Row/column space can be sparse
+  * Columns are in the form of (family: optional qualifier). This is a RDF Properties 
+  * Columns have type information  
+  * Because of the design of the system, columns are easy to create (and are created implicitly)

+  * Column families can be split into locality groups (Ontologies!) 
+ 
+ And then, assume some job.
+ I wanna get clustered document set by one of RDF Properties.
+ It can be Readed only vertical(Column) Data from Table, because Column-stored.
+ 
+ 
  == Project Links ==
  
  [wiki:Hbase/HbaseArchitecture  Hbase Architecture - a work in progress]
@@ -45, +67 @@

    * JimKellerman [[MailTo(jim AT SPAMFREE powerset DOT com)]]
    * Doug Judd [[MailTo(doug AT SPAMFREE zvents DOT com)]]
    * Ivan Small [[MailTo(ivan AT SPAMFREE blueseaturtle DOT com)]]
+   * Udanax [[MailTo(webmaster AT SPAMFREE udanax DOT org)]]
+ ----
+ CategoryTemplate CategoryTemplate CategoryTemplate
  

Mime
View raw message