Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 12E75200497 for ; Wed, 23 Aug 2017 23:52:06 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 0FC92169C6F; Wed, 23 Aug 2017 21:52:06 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 54FC8169C6E for ; Wed, 23 Aug 2017 23:52:05 +0200 (CEST) Received: (qmail 14976 invoked by uid 500); 23 Aug 2017 21:52:04 -0000 Mailing-List: contact issues-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list issues@hive.apache.org Received: (qmail 14967 invoked by uid 99); 23 Aug 2017 21:52:04 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Aug 2017 21:52:04 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id F2040180491 for ; Wed, 23 Aug 2017 21:52:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id pL3vBXiiIreX for ; Wed, 23 Aug 2017 21:52:03 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 79D1861B5E for ; Wed, 23 Aug 2017 21:52:02 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 915D6E0534 for ; Wed, 23 Aug 2017 21:52:01 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 8EC7F25382 for ; Wed, 23 Aug 2017 21:52:00 +0000 (UTC) Date: Wed, 23 Aug 2017 21:52:00 +0000 (UTC) From: "Wei Zheng (JIRA)" To: issues@hive.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-17361) Support LOAD DATA for transactional tables MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 23 Aug 2017 21:52:06 -0000 [ https://issues.apache.org/jira/browse/HIVE-17361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16139203#comment-16139203 ] Wei Zheng commented on HIVE-17361: ---------------------------------- {code} LOAD DATA [LOCAL] INPATH 'filepath' [OVERWRITE] INTO TABLE tablename [PARTITION (partcol1=val1, partcol2=val2 ...)] {code} Unlike non-ACID table, if the table is bucketed and there are more than 1 bucket file, then LOAD DATA on ACID table will require 'filepath' to refer to a directory not a file. Otherwise, one may end up having a bucket file in one load_delta directory and another bucket file in a different load_delta directory. The reason behind this is: a) For a non-ACID table, say tbl1, one can continue loading files into the same table via consecutive LOAD commands, that will just result more and more files under tbl1/ directory b) However, for a non-ACID table, since a new load_delta directory will be created every time when LOAD DATA is run, consecutive LOAD commands will create separate subdirectories for every single file, which may not be desirable, e.g. if one wants to load a file for one bucket, and then a file for another bucket, those two files will reside in two different load_delta directories. > Support LOAD DATA for transactional tables > ------------------------------------------ > > Key: HIVE-17361 > URL: https://issues.apache.org/jira/browse/HIVE-17361 > Project: Hive > Issue Type: Bug > Components: Transactions > Reporter: Wei Zheng > Assignee: Wei Zheng > Attachments: HIVE-17361.1.patch > > > LOAD DATA was not supported since ACID was introduced. Need to fill this gap between ACID table and regular hive table. -- This message was sent by Atlassian JIRA (v6.4.14#64029)