drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paul Rogers (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-5492) CSV with spaces for header uses spaces as field name
Date Tue, 09 May 2017 05:13:04 GMT
Paul Rogers created DRILL-5492:

             Summary: CSV with spaces for header uses spaces as field name
                 Key: DRILL-5492
                 URL: https://issues.apache.org/jira/browse/DRILL-5492
             Project: Apache Drill
          Issue Type: Bug
            Reporter: Paul Rogers
            Priority: Minor

Consider the same test case as in DRILL-5491, but with a slightly different input file:


The underscores represent three spaces: use spaces in the real test.

In this case, the code discussed in DRILL-5491 finds some characters and happily returns the
following array:

["   "]

The field name of three blanks is returned to the client to produce the following bizarre

2 row(s):

The blank line is normally the header, but the header here was considered to be three blanks.
(In fact, the blanks are actually printed.)

Since the blanks were considered to be a field, the file is assumed to have only one field,
so only the first column was returned.

The expected behavior is that spaces are trimmed from field names, so the field name list
would be empty and a User Error thrown. (That is, it is confusing to the user why a blank
line produces NPE, some produce the {{ExecutionSetupException}} shown in DRILL-5491, and some
produce blank headings. Behavior should be consistent.

This message was sent by Atlassian JIRA

View raw message