hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Trivial Update of "Hbase/NewFileFormat" by stack
Date Thu, 22 Jan 2009 05:01:15 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The following page has been changed by stack:
http://wiki.apache.org/hadoop/Hbase/NewFileFormat

------------------------------------------------------------------------------
   * Always-on General bloomfilter. We know how many entries a file will have when we go to
flush it so we can optimally size a bloomfilter.  The small amount of memory a bloomfilter
occupies will pay for itself many-fold in the seeks saved trying to figure is a file contains
an asked for key.
   * Optimal random-access
   * Iterate over keys only, rather than mapfiles currenty key+values always.  This'd be useful
when trying to find closest. TFile and SequenceFile can do this (Its not exposed in !MapFile).
+  * Smart getClosest and getClosestAtOrBefore [https://issues.apache.org/jira/browse/HBASE-792
hbase-792]
  
  === Index ===
  TODO, but the TFile block-based rather than !MapFile interval-based would seem better for
us; indices then are of predicatable size; a seek to the index position will load at an amenable
spot when blocks are compressed. 
  
  === Nice-to-haves ===
-  * Don't write out the family portion of column when writing keys.
+  * Don't write out the family portion of column when writing keys [https://issues.apache.org/jira/browse/HBASE-68
HBASE-68]
  
  === Excercise ===
  

Mime
View raw message