drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rahul Challapalli (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-4999) Metadata Cache Pruning not taking place when we have both directory partitioning and auto partitioned parquet files
Date Fri, 04 Nov 2016 04:58:59 GMT
Rahul Challapalli created DRILL-4999:
----------------------------------------

             Summary: Metadata Cache Pruning not taking place when we have both directory
partitioning and auto partitioned parquet files
                 Key: DRILL-4999
                 URL: https://issues.apache.org/jira/browse/DRILL-4999
             Project: Apache Drill
          Issue Type: Bug
          Components: Metadata, Query Planning & Optimization
            Reporter: Rahul Challapalli


git.commit.id.abbrev=17b9648

Metadata cache pruning is not taking place in the below scenario
{code}
0: jdbc:drill:zk=10.10.100.190:5181> explain plan for select distinct i_rec_end_date from
dfs.`/drill/testdata/parquet_date/auto_partition/item_single` where dir0='1.9' and i_rec_start_date
= date '1997-10-27';
+------+------+
| text | json |
+------+------+
| 00-00    Screen
00-01      Project(i_rec_end_date=[$0])
00-02        HashAgg(group=[{0}])
00-03          Project(i_rec_end_date=[$2])
00-04            SelectionVectorRemover
00-05              Filter(condition=[AND(=($0, '1.9'), =($1, 1997-10-27))])
00-06                Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/drill/testdata/parquet_date/auto_partition/item_single/1.9/0_0_1.parquet]],
selectionRoot=/drill/testdata/parquet_date/auto_partition/item_single, numFiles=1, usedMetadataFile=true,
cacheFileRoot=/drill/testdata/parquet_date/auto_partition/item_single, columns=[`dir0`, `i_rec_start_date`,
`i_rec_end_date`]]])
{code}

I attached the data set and the log file



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message