drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jerry Ylilammi (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-4193) All Parquet columns are loaded when querying S3
Date Mon, 14 Dec 2015 09:20:46 GMT
Jerry Ylilammi created DRILL-4193:

             Summary: All Parquet columns are loaded when querying S3
                 Key: DRILL-4193
                 URL: https://issues.apache.org/jira/browse/DRILL-4193
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Parquet
    Affects Versions: 1.3.0
            Reporter: Jerry Ylilammi

Drill starts downloading all data from S3 not making use of Parquet being columnar format
and only loading required column.

{code}SELECT DISTINCT data.measurement.cid 
FROM mys3bucket.`test/datatable` AS data;{code}

Parquet file:
.cid:                  INT64 GZIP DO:0 FPO:9380624 SZ:110/76/0.69 VC:32327 ENC:BIT_PACKED,PLAIN_DICTIONARY,RLE

This message was sent by Atlassian JIRA

View raw message