drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Roger Dielrton (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-4659) Specify, as part of the query, table information: data format (CSV, parquet, JSON. etc.), field delimiter, etc.
Date Mon, 09 May 2016 16:17:12 GMT
Roger Dielrton created DRILL-4659:
-------------------------------------

             Summary: Specify, as part of the query, table information: data format (CSV,
parquet, JSON. etc.), field delimiter, etc.
                 Key: DRILL-4659
                 URL: https://issues.apache.org/jira/browse/DRILL-4659
             Project: Apache Drill
          Issue Type: Improvement
          Components: Query Planning & Optimization, SQL Parser
            Reporter: Roger Dielrton
            Priority: Minor


I have a file, that I would like to use in a query, and it can have one or more of the
following properties:
* Has not extension ==> Drill is unable to handle it.
* I know it contains data in CSV format, but with an non standard character as field separator
==>
Drill is unable to parse it (without modify the storage plugin configuration).
* Is located in an Amazon S3 bucket ==> I can rename it.
* Has a big size ==> It would be expensive to make a copy of it. 

It would be nice if you can specify, as part of the "select" query, as metadata, relevant
table
information as:
* Data format (CSV, parquet, JSON. etc.)
* Field delimiter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message