hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Junjie Chen (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-17261) Hive use deprecated ParquetInputSplit constructor which blocked parquet dictionary filter
Date Mon, 07 Aug 2017 08:32:00 GMT
Junjie Chen created HIVE-17261:
----------------------------------

             Summary: Hive use deprecated ParquetInputSplit constructor which blocked parquet
dictionary filter
                 Key: HIVE-17261
                 URL: https://issues.apache.org/jira/browse/HIVE-17261
             Project: Hive
          Issue Type: Improvement
          Components: Database/Schema
    Affects Versions: 2.2.0
            Reporter: Junjie Chen
            Priority: Minor


Hive use deprecated ParquetInputSplit in [https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/ParquetRecordReaderBase.java#L128]

Please see interface definition in [https://github.com/apache/parquet-mr/blob/master/parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ParquetInputSplit.java#L80]

Old interface set rowgroupoffset values which will lead to skip dictionary filter in parquet.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message