asterixdb-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Murtadha Makki Al Hubail (Jira)" <j...@apache.org>
Subject [jira] [Assigned] (ASTERIXDB-2710) Incorrect CSV quote handling
Date Thu, 16 Apr 2020 13:07:00 GMT

     [ https://issues.apache.org/jira/browse/ASTERIXDB-2710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Murtadha Makki Al Hubail reassigned ASTERIXDB-2710:
---------------------------------------------------

    Assignee: Ali Alsuliman  (was: Murtadha Makki Al Hubail)

> Incorrect CSV quote handling
> ----------------------------
>
>                 Key: ASTERIXDB-2710
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-2710
>             Project: Apache AsterixDB
>          Issue Type: Bug
>            Reporter: Ian Maxon
>            Assignee: Ali Alsuliman
>            Priority: Major
>
> the delimited text parser chokes on this quoted input for some reason. to reproduce use
this ddl:
> create type foo as closed{ messageid: string, userid: string, message: string, updated_time:
string, nchar: string }; 
> create dataset foods(foo) primary key messageid;
> load dataset foods  using localfs
>     (("path"="127.0.0.1:///home/parshimers/status_synth.csv"),
>     ("format"="delimited-text"),
>     ("quote"="\""));
> where status_synth.csv contains:
> 1,foo,",       bar",baz,1
> 2,foo,"bar",baz,2
> will  error out with:
> At record: 1, field#: 4 - a quote enclosing a field needs to be placed in the beginning
of that field. [IOException]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message