hudi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (Jira)" <>
Subject [jira] [Commented] (HUDI-2208) [SQL] Support Bulk Insert For Spark Sql
Date Sat, 07 Aug 2021 06:29:00 GMT


ASF GitHub Bot commented on HUDI-2208:

pengzhiwei2018 edited a comment on pull request #3328:

   Hi @nsivabalan , The PR has updated with the follow chanes:
   add "upsert" mode for insert.mode. Currently we have 3 insert mode:
      - upsert
        In upsert mode for insert into, duplicate record on primary key will be updated.This
is the default insert mode for pk-table.
      - strict
        In strict mode for insert into, we do the pk uniqueness guarantee for COW pk-table.
For MOR pk-table, it has the same behavior with "upsert" mode.
     - non-strict
      In non-strict mode for insert into, we use insert operation to write data which allow
writing the duplicate record.
   The default insert mode is `upsert`.

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail:

For queries about this service, please contact Infrastructure at:

> [SQL] Support Bulk Insert For Spark Sql
> ---------------------------------------
>                 Key: HUDI-2208
>                 URL:
>             Project: Apache Hudi
>          Issue Type: Sub-task
>            Reporter: pengzhiwei
>            Assignee: pengzhiwei
>            Priority: Blocker
>              Labels: pull-request-available, release-blocker
>             Fix For: 0.9.0
> Support the bulk insert for spark sql

This message was sent by Atlassian Jira

View raw message