drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paul Rogers (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-5498) CSV text reader does not properly handle duplicate header names
Date Wed, 10 May 2017 03:43:04 GMT
Paul Rogers created DRILL-5498:
----------------------------------

             Summary: CSV text reader does not properly handle duplicate header names
                 Key: DRILL-5498
                 URL: https://issues.apache.org/jira/browse/DRILL-5498
             Project: Apache Drill
          Issue Type: Bug
    Affects Versions: 1.8.0
            Reporter: Paul Rogers
            Priority: Minor


Consider the following CSV file:

{code}
h,h,h
a,b,c
d,e,f
{code}

Parse this with the CSV storage plugins to parse headers. The result:

{code}
2 row(s):
h
c
f
{code}

Expected a runtime error for the duplicate column names, or automatic "uniqification" of the
names. Certainly did not expect the first two columns to be dropped.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message