drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paul Rogers (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (DRILL-5492) CSV reader does not validate header names, causes nonsense output
Date Thu, 11 May 2017 00:41:04 GMT

     [ https://issues.apache.org/jira/browse/DRILL-5492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Paul Rogers reassigned DRILL-5492:
----------------------------------

    Assignee: Paul Rogers

> CSV reader does not validate header names, causes nonsense output
> -----------------------------------------------------------------
>
>                 Key: DRILL-5492
>                 URL: https://issues.apache.org/jira/browse/DRILL-5492
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: Paul Rogers
>            Assignee: Paul Rogers
>            Priority: Minor
>
> Consider the same test case as in DRILL-5491, but with a slightly different input file:
> {code}
> ___
> a,b,c
> d,e,f
> {code}
> The underscores represent three spaces: use spaces in the real test.
> In this case, the code discussed in DRILL-5491 finds some characters and happily returns
the following array:
> {code}
> ["   "]
> {code}
> The field name of three blanks is returned to the client to produce the following bizarre
output:
> {code}
> 2 row(s):
>     
> a
> d
> {code}
> The blank line is normally the header, but the header here was considered to be three
blanks. (In fact, the blanks are actually printed.)
> Since the blanks were considered to be a field, the file is assumed to have only one
field, so only the first column was returned.
> The expected behavior is that spaces are trimmed from field names, so the field name
list would be empty and a User Error thrown. (That is, it is confusing to the user why a blank
line produces NPE, some produce the {{ExecutionSetupException}} shown in DRILL-5491, and some
produce blank headings. Behavior should be consistent.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message