carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Liang Chen (JIRA)" <>
Subject [jira] [Updated] (CARBONDATA-440) Provide Update/Delete functionality support in CarbonData
Date Fri, 20 Jan 2017 04:05:26 GMT


Liang Chen updated CARBONDATA-440:
    Affects Version/s:     (was: 0.2.0-incubating)
                           (was: 0.1.0-incubating)

> Provide Update/Delete functionality support in CarbonData
> ---------------------------------------------------------
>                 Key: CARBONDATA-440
>                 URL:
>             Project: CarbonData
>          Issue Type: New Feature
>          Components: core, data-query, file-format, spark-integration, sql
>    Affects Versions: 0.1.1-incubating
>            Reporter: Aniket Adnaik
>             Fix For: 1.0.0-incubating
>         Attachments: ApacheCarbonData_UpdateDelete_Support_jira440.pdf
>          Time Spent: 13h
>  Remaining Estimate: 0h
> Currently, CarbonData does not support modification of existing rows in the table. This
is a major limitation for many real world desirable use cases in data warehousing, such as
slow changing dimension tables, data correction of fact tables or data cleanup, etc. Many
users want to be able to update and delete rows from the CarbonData table.
> Following are some high level design goals to support this functionality,
> 1. Provide a standard SQL interface to perform Update and Delete operations.
> 2. Perform Update and Delete operations on CarbonData table without having to rewrite
the entire CarbonData block (file) by making use of differential files (a.k.a delta files).
> 3. After Update or Delete operation, CarbonData readers should skip deleted records and
read updated records seamlessly without having to modify user applications.
> 4. Perform compaction of Update and Delete delta files to maintain adequate query performance

This message was sent by Atlassian JIRA

View raw message