drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Venkata Jyothsna Donapati (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-7238) Drill does not use DirectScan for non-existent columns
Date Mon, 06 May 2019 20:33:00 GMT

     [ https://issues.apache.org/jira/browse/DRILL-7238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Venkata Jyothsna Donapati updated DRILL-7238:
---------------------------------------------
    Component/s: Metadata

> Drill does not use DirectScan for non-existent columns
> ------------------------------------------------------
>
>                 Key: DRILL-7238
>                 URL: https://issues.apache.org/jira/browse/DRILL-7238
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Metadata
>            Reporter: Venkata Jyothsna Donapati
>            Assignee: Venkata Jyothsna Donapati
>            Priority: Major
>
> This query does not use the summary metadata cache file:
> select count(int_nulls_id), count(int_id), count(ss_ticket_number), count(extra) from
store_sales_null_blocks_int;
> In this query, extra is a column that does not exist (non-existent column).
> Here is the explain plan:
> {noformat}
> | 00-00    Screen
> 00-01      Project(EXPR$0=[$0], EXPR$1=[$1], EXPR$2=[$2], EXPR$3=[$3])
> 00-02        StreamAgg(group=[{}], EXPR$0=[$SUM0($0)], EXPR$1=[$SUM0($1)], EXPR$2=[$SUM0($2)],
EXPR$3=[$SUM0($3)])
> 00-03          UnionExchange
> 01-01            StreamAgg(group=[{}], EXPR$0=[COUNT($0)], EXPR$1=[COUNT($1)], EXPR$2=[COUNT($2)],
EXPR$3=[COUNT($3)])
> 01-02              Scan(table=[[dfs, parquet_metadata_cache, store_sales_null_blocks_int]],
groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/drill/testdata/metadata_cache/store_sales_null_blocks_int]],
selectionRoot=/drill/testdata/metadata_cache/store_sales_null_blocks_int, numFiles=1, numRowGroups=11,
usedMetadataFile=true, cacheFileRoot=/drill/testdata/metadata_cache/store_sales_null_blocks_int,
columns=[`int_nulls_id`, `int_id`, `ss_ticket_number`, `extra`]]])
> {noformat}
> This is a regression from Drill 1.15.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message