hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Allan Yan (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-11033) BloomFilter index is not honored by ORC reader
Date Wed, 17 Jun 2015 02:00:09 GMT
Allan Yan created HIVE-11033:
--------------------------------

             Summary: BloomFilter index is not honored by ORC reader
                 Key: HIVE-11033
                 URL: https://issues.apache.org/jira/browse/HIVE-11033
             Project: Hive
          Issue Type: Bug
    Affects Versions: 1.2.0
            Reporter: Allan Yan


There is a bug in the org.apache.hadoop.hive.ql.io.orc.ReaderImpl class which caused the bloom
filter index saved in the ORC file not being used. The reason is because the bloomFilterIndices
variable defined in the SargApplier class superseded from its parent class.

Here is one way to fix it
{noformat}
18:46 $ diff src/java/org/apache/hadoop/hive/ql/io/orc/RecordReaderImpl.java src/java/org/apache/hadoop/hive/ql/io/orc/RecordReaderImpl.java.original
174d173
<     bloomFilterIndices = new OrcProto.BloomFilterIndex[types.size()];
178c177
<           sarg, options.getColumnNames(), strideRate, types, included.length, bloomFilterIndices);
---
>           sarg, options.getColumnNames(), strideRate, types, included.length);
204a204
>     bloomFilterIndices = new OrcProto.BloomFilterIndex[types.size()];
673c673
<         List<OrcProto.Type> types, int includedCount, OrcProto.BloomFilterIndex[]
bloomFilterIndices) {
---
>         List<OrcProto.Type> types, int includedCount) {
677c677
<       this.bloomFilterIndices = bloomFilterIndices;
---
>       bloomFilterIndices = new OrcProto.BloomFilterIndex[types.size()];
{noformat}





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message