drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Damien Profeta (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-5796) Filter pruning for multi rowgroup parquet file
Date Sat, 16 Sep 2017 01:00:00 GMT
Damien Profeta created DRILL-5796:

             Summary: Filter pruning for multi rowgroup parquet file
                 Key: DRILL-5796
                 URL: https://issues.apache.org/jira/browse/DRILL-5796
             Project: Apache Drill
          Issue Type: Improvement
          Components: Storage - Parquet
            Reporter: Damien Profeta

Today, filter pruning use the file name as the partitioning key. This means you can remove
a partition only if the whole file is for the same partition. With parquet, you can prune
the filter if the rowgroup make a partition of your dataset as the unit of work if the rowgroup
not the file.

This message was sent by Atlassian JIRA

View raw message