drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rahul Challapalli (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-3892) Metadata cache not being leveraged when partition pruning is taking place
Date Sun, 04 Oct 2015 21:15:27 GMT
Rahul Challapalli created DRILL-3892:

             Summary: Metadata cache not being leveraged when partition pruning is taking
                 Key: DRILL-3892
                 URL: https://issues.apache.org/jira/browse/DRILL-3892
             Project: Apache Drill
          Issue Type: Bug
          Components: Metadata
    Affects Versions: 1.2.0
            Reporter: Rahul Challapalli
            Priority: Critical


As we can see from the below plan, metadata cache is not being leveraged even when the cache
file is being present

0: jdbc:drill:zk=> refresh table metadata dfs.`/drill/testdata/metadata_caching/lineitem_deletecache`;
|  ok   |                                             summary                            
| true  | Successfully updated metadata for table /drill/testdata/metadata_caching/lineitem_deletecache.
1 row selected (0.402 seconds)
0: jdbc:drill:zk=> explain plan for select count(*) from dfs.`/drill/testdata/metadata_caching/lineitem_deletecache`
where dir0=2006 group by l_linestatus;
| text | json |
| 00-00    Screen
00-01      Project(EXPR$0=[$1])
00-02        HashAgg(group=[{0}], EXPR$0=[COUNT()])
00-03          Project(l_linestatus=[$0])
00-04            Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:/drill/testdata/metadata_caching/lineitem_deletecache/2006/1/lineitem_999.parquet]],
selectionRoot=maprfs:/drill/testdata/metadata_caching/lineitem_deletecache, numFiles=1, usedMetadataFile=false,
columns=[`l_linestatus`, `dir0`]]])

I attached the data set used. Let me know if you need anything more

This message was sent by Atlassian JIRA

View raw message