drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rahul Challapalli (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-3410) Partition Pruning : We are doing a prune when we shouldn't
Date Sat, 27 Jun 2015 00:21:05 GMT
Rahul Challapalli created DRILL-3410:

             Summary: Partition Pruning : We are doing a prune when we shouldn't
                 Key: DRILL-3410
                 URL: https://issues.apache.org/jira/browse/DRILL-3410
             Project: Apache Drill
          Issue Type: Bug
          Components: Query Planning & Optimization
            Reporter: Rahul Challapalli
            Assignee: Jinfeng Ni
            Priority: Critical
             Fix For: 1.1.0


The below plan does not look right. It should scan all the files based on the filters in the
query. Also hive returned more rows than drill
explain plan for select * from `existing_partition_pruning/lineitempart` where (dir0=1993
and columns[0] >29600) or (dir0=1994 or columns[0]>29700);
| 00-00    Screen
00-01      Project(*=[$0])
00-02        Project(T70¦¦*=[$0])
00-03          SelectionVectorRemover
00-04            Filter(condition=[OR(AND(=($1, 1993), >(ITEM($2, 0), 29600)), =($1, 1994),
>(ITEM($2, 0), 29700))])
00-05              Project(T70¦¦*=[$0], dir0=[$1], columns=[$2])
00-06                Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/drill/testdata/ctas_auto_partition/existing_partition_pruning/lineitempart/0_0_3.parquet],
ReadEntryWithPath [path=/drill/testdata/ctas_auto_partition/existing_partition_pruning/lineitempart/0_0_4.parquet]],
numFiles=2, columns=[`*`]]])

I attached the data set used. Let me know if you need anything more

This message was sent by Atlassian JIRA

View raw message