hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Namit Jain (JIRA)" <j...@apache.org>
Subject [jira] Created: (HIVE-1702) optimize JDBM to make mapjoin faster
Date Tue, 12 Oct 2010 20:56:33 GMT
optimize JDBM to make mapjoin faster
------------------------------------

                 Key: HIVE-1702
                 URL: https://issues.apache.org/jira/browse/HIVE-1702
             Project: Hadoop Hive
          Issue Type: Improvement
          Components: Query Processor
            Reporter: Namit Jain
            Assignee: Liyin Tang


Htree.get() cost 70% total time. It could help a lot if there is bloom filter here to avoid
unneeded get() if we know for sure the given key is not in JDBM. (we can generate the bloom
filter when doing the jdbm sink, and read into memory when doing read. )

Copied from https://issues.apache.org/jira/browse/HIVE-1700

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message