hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Created: (HBASE-1200) Add bloomfilters to hfile; use dynamicbloomfilter instead of base bloomfilter; depend on hadoop 0.20
Date Fri, 13 Feb 2009 18:38:59 GMT
Add bloomfilters to hfile; use dynamicbloomfilter instead of base bloomfilter; depend on hadoop
0.20
----------------------------------------------------------------------------------------------------

                 Key: HBASE-1200
                 URL: https://issues.apache.org/jira/browse/HBASE-1200
             Project: Hadoop HBase
          Issue Type: Task
            Reporter: stack
            Assignee: stack
             Fix For: 0.20.0


Add bloomfiltering to hfile.  Should it be optional or on always?  Currently, we bloom filter
rows only, not the column + ts component, which seems good place to start but we size the
bloomfilter with the number of entries we are about to flush which seems like usually we'd
be making a filter too big.  How to figure how many rows in the flush?   We should use the
DynamicBloomFilter as Andrezj does up in hadoop BloomFilterMapFile.  Start small and let it
resize as entries are added.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message