drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Miroslav Holubec (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-4519) File system directory-based partition pruning doesn't work correctly with parquet metadata
Date Fri, 18 Mar 2016 15:17:33 GMT

     [ https://issues.apache.org/jira/browse/DRILL-4519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Miroslav Holubec updated DRILL-4519:
------------------------------------
    Description: 
We have parquet files in folders with following convention YYYY/MM/DD/HH.
Without drill's parquet metadata directory prunning works seamlessly.
{noformat}
select dir0, dir1, dir2 from hdfs.test.indexed;
dir0 = YYYY,  dir1 = MM, dir2 = DD, dir3 = HH
{noformat}
After creating metadata and executing same query, dir0 contains HH folder name instead yearly
folder name. dir1...3 are null.
{noformat}
select dir0, dir1, dir2 from hdfs.test.indexed;
dir0 = HH,  dir1 = null, dir2 = null, dir3 = null
{noformat}



  was:
We have parquet files in folders with following convention YYYY/MM/DD/HH.
Without drill's parquet metadata directory prunning works seamlessly.
{noformat}
select dir0, dir1, dir2 from hdfs.test.indexed;
dir0 = YYYY,  dir1 = MM, dir2 = DD, dir3 = HH
{noformat}
After creating metadata and executing same query, dir0 contains HH folder name instead yearly
folder name. dir1...4 are null.
{noformat}
select dir0, dir1, dir2 from hdfs.test.indexed;
dir0 = HH,  dir1 = null, dir2 = null, dir3 = null
{noformat}




> File system directory-based partition pruning doesn't work correctly with parquet metadata
> ------------------------------------------------------------------------------------------
>
>                 Key: DRILL-4519
>                 URL: https://issues.apache.org/jira/browse/DRILL-4519
>             Project: Apache Drill
>          Issue Type: Bug
>    Affects Versions: 1.4.0, 1.5.0
>            Reporter: Miroslav Holubec
>
> We have parquet files in folders with following convention YYYY/MM/DD/HH.
> Without drill's parquet metadata directory prunning works seamlessly.
> {noformat}
> select dir0, dir1, dir2 from hdfs.test.indexed;
> dir0 = YYYY,  dir1 = MM, dir2 = DD, dir3 = HH
> {noformat}
> After creating metadata and executing same query, dir0 contains HH folder name instead
yearly folder name. dir1...3 are null.
> {noformat}
> select dir0, dir1, dir2 from hdfs.test.indexed;
> dir0 = HH,  dir1 = null, dir2 = null, dir3 = null
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message