drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jerry Ylilammi (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-4193) All Parquet columns are loaded when querying S3
Date Mon, 14 Dec 2015 09:20:46 GMT
Jerry Ylilammi created DRILL-4193:
-------------------------------------

             Summary: All Parquet columns are loaded when querying S3
                 Key: DRILL-4193
                 URL: https://issues.apache.org/jira/browse/DRILL-4193
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Parquet
    Affects Versions: 1.3.0
            Reporter: Jerry Ylilammi


Drill starts downloading all data from S3 not making use of Parquet being columnar format
and only loading required column.

Query:
{code}SELECT DISTINCT data.measurement.cid 
FROM mys3bucket.`test/datatable` AS data;{code}

Parquet file:
{code}...
measurement:
.cid:                  INT64 GZIP DO:0 FPO:9380624 SZ:110/76/0.69 VC:32327 ENC:BIT_PACKED,PLAIN_DICTIONARY,RLE
...{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message