carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "jiangmanhua (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CARBONDATA-2017) Error occurs when loading multiple files
Date Wed, 10 Jan 2018 14:05:00 GMT
jiangmanhua created CARBONDATA-2017:
---------------------------------------

             Summary: Error occurs when loading multiple files
                 Key: CARBONDATA-2017
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-2017
             Project: CarbonData
          Issue Type: Bug
            Reporter: jiangmanhua
            Priority: Minor


Problem:
Carbon supports loading from multiple file paths at once, but we find that Carbon will throw
an exception like "The input file does not exist" when loading multiple files on HDFS.

For example:
ex1: LOAD DATA INPATH '/data/source.csv,/data/source2.csv' INTO TABLE test_table
ex2: LOAD DATA INPATH 'hdfs://ha/data/source.csv,hdfs://ha/data/source2.csv' INTO TABLE test_table

ex1 will throw an exception saying that source2.csv does not exist.
ex2 will execute normally.


Solution:
We found that carbon takes the PATH as a whole and checks its prefix before spliting it into
multiplt paths. So the problem will be solved when we do the prefix checking job for each
path after spliting PATH into multiplt paths. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message