carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "WilliamZhu (JIRA)" <>
Subject [jira] [Created] (CARBONDATA-465) Spark streaming dataframe support
Date Tue, 29 Nov 2016 07:54:59 GMT
WilliamZhu created CARBONDATA-465:

             Summary: Spark streaming dataframe support
                 Key: CARBONDATA-465
             Project: CarbonData
          Issue Type: Improvement
          Components: data-load
    Affects Versions: 0.3.0-incubating
            Reporter: WilliamZhu
            Assignee: WilliamZhu
            Priority: Minor
             Fix For: 0.3.0-incubating

Carbondata-0.3.0 support load data with spark data frame api. There is a limit that kettle
is still required since DataFrameLoaderRDD still depends on kettle. We provide NewDataFrameLoaderRDD
 to load data with new flow .

Also,we discovered some bugs:

1. CarbonMetastoreCatalog.createTableFromThrift

     * schemaFilePath starts with file:// will not create meta files successfully
     * while thriftWriter will have no complains.
     * This will cause some weired error eg. No table found.
    val thriftWriter = new ThriftWriter(schemaFilePath, false)

2. There are some exceptions raised  even when you have set useKettle to false.

This message was sent by Atlassian JIRA

View raw message