drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Khurram Faraaz (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-5674) Drill should support .zip compression
Date Tue, 18 Jul 2017 11:54:01 GMT

     [ https://issues.apache.org/jira/browse/DRILL-5674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Khurram Faraaz updated DRILL-5674:
----------------------------------
    Component/s: Storage - Text & CSV

> Drill should support .zip compression
> -------------------------------------
>
>                 Key: DRILL-5674
>                 URL: https://issues.apache.org/jira/browse/DRILL-5674
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Text & CSV
>    Affects Versions: 1.10.0
>            Reporter: Paul Rogers
>
> Zip is a very common compression format. Create a compressed CSV file with column headers:
data.csv.zip.
> Define a storage plugin config for the file, call it "dfs.myws", set delimiter = ",",
extract header = true, skip header = false.
> Run a simple query:
> SELECT * FROM dfs.myws.`data.csv.zip`
> The result is garbage as the CSV reader is trying to parse Zipped data as if it were
text.
> DRILL-5506 asks how to do this; the responder said to add a library to the path. Better
would be to simply support zip out-of-the-box as a default format.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message