hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Namit Jain (JIRA)" <>
Subject [jira] Created: (HIVE-1702) optimize JDBM to make mapjoin faster
Date Tue, 12 Oct 2010 20:56:33 GMT
optimize JDBM to make mapjoin faster

                 Key: HIVE-1702
             Project: Hadoop Hive
          Issue Type: Improvement
          Components: Query Processor
            Reporter: Namit Jain
            Assignee: Liyin Tang

Htree.get() cost 70% total time. It could help a lot if there is bloom filter here to avoid
unneeded get() if we know for sure the given key is not in JDBM. (we can generate the bloom
filter when doing the jdbm sink, and read into memory when doing read. )

Copied from

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message