carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "xuchuanyin (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (CARBONDATA-2110) option of TempCsv should be removed since the default delimiter may conflicts with field value
Date Thu, 01 Feb 2018 08:03:00 GMT

     [ https://issues.apache.org/jira/browse/CARBONDATA-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

xuchuanyin reassigned CARBONDATA-2110:
--------------------------------------

    Assignee: xuchuanyin

> option of TempCsv should be removed since the default delimiter may conflicts with field
value
> ----------------------------------------------------------------------------------------------
>
>                 Key: CARBONDATA-2110
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-2110
>             Project: CarbonData
>          Issue Type: Bug
>          Components: data-load
>            Reporter: xuchuanyin
>            Assignee: xuchuanyin
>            Priority: Major
>
> Currently in carbondata, an option named ‘tempCSV’ is available during loading dataframe.
>  
> After enabling this option, Carbondata will write the dataframe to a *standard* csv
file at first and then load the data files.
>  
> The delimiters of the standard csv file, such as field delimiter / escape char/ quote
char/ multi-line/ line separator and so on may conflict with the actual field value. For example,
if a field contains ',', then it will cause problem in further data loading if we save the
tempCSV using ',' as field separator.
>  
> Since we are not sure about the content of dataframe, I think it's better to deprecate
this option. To make forward compatible, user can still use this option but will get warning
about it.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message