carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From ravipesala <>
Subject [GitHub] carbondata pull request #1189: [WIP] Insert overwrite support and force clea...
Date Thu, 20 Jul 2017 10:32:13 GMT
GitHub user ravipesala opened a pull request:

    [WIP] Insert overwrite support and force clean up files and clean up in progress files
support added

    The following features are added in this PR.
    1. Added support for `LOAD OVERWRITE` and `INSERT OVERWRITE` in carbon load. So after
user issues overwrite command all old data will be overwritten with new data.
     Example :
    LOAD DATA INPATH '" data.csv' overwrite INTO table carbontable
    insert overwrite table carbontable select * from othertable
    When overwrite is in progress no other load will be allowed . And if any other load is
already in progress also will be overwritten
    2. Added support for force clean table to remove the table with force from disk. It is
useful in case of inconsistency with hive metastore. This support is only internal purpose
and not exposed to user, so it is supported through scala API not through SQL.
    3.  Cleanup the inprogress files while driver is initializing. In case of driver is down
while any load is in progress then it must be cleaned while coming up of driver. This is only
controlled through parameter `spark.carbon.table.loader.driver` , so it must be set true in
driver properties to cleanup the inprogress files. 

You can merge this pull request into a Git repository by running:

    $ git pull insert-overwrite

Alternatively you can review and apply these changes as the patch at:

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #1189
commit 1eca780ee69b07cdf2a86df1759dfaa7d0f96fd8
Author: Ravindra Pesala <>
Date:   2017-07-20T09:27:21Z

    Insert overwrite support and force clean up files and clean up in progress files support


If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at or file a JIRA ticket
with INFRA.

View raw message