carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "jiangmanhua (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (CARBONDATA-2017) Error occurs when loading multiple files
Date Fri, 19 Jan 2018 12:34:00 GMT

     [ https://issues.apache.org/jira/browse/CARBONDATA-2017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

jiangmanhua reassigned CARBONDATA-2017:
---------------------------------------

    Assignee: jiangmanhua

> Error occurs when loading multiple files
> ----------------------------------------
>
>                 Key: CARBONDATA-2017
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-2017
>             Project: CarbonData
>          Issue Type: Bug
>            Reporter: jiangmanhua
>            Assignee: jiangmanhua
>            Priority: Minor
>             Fix For: 1.3.0
>
>          Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> Problem:
> Carbon supports loading from multiple file paths at once, but we find that Carbon will
throw an exception like "The input file does not exist" when loading multiple files on HDFS.
> For example:
> ex1: LOAD DATA INPATH '/data/source.csv,/data/source2.csv' INTO TABLE test_table
> ex2: LOAD DATA INPATH 'hdfs://ha/data/source.csv,hdfs://ha/data/source2.csv' INTO TABLE
test_table
> ex1 will throw an exception saying that source2.csv does not exist.
> ex2 will execute normally.
> Solution:
> We found that carbon takes the PATH as a whole and checks its prefix before spliting
it into multiplt paths. So the problem will be solved when we do the prefix checking job for
each path after spliting PATH into multiplt paths. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message