Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 1CC5A200CFA for ; Tue, 5 Sep 2017 09:28:11 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 1B67516568E; Tue, 5 Sep 2017 07:28:11 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 6180D16568D for ; Tue, 5 Sep 2017 09:28:10 +0200 (CEST) Received: (qmail 31202 invoked by uid 500); 5 Sep 2017 07:28:08 -0000 Mailing-List: contact issues-help@carbondata.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@carbondata.apache.org Delivered-To: mailing list issues@carbondata.apache.org Received: (qmail 31193 invoked by uid 99); 5 Sep 2017 07:28:08 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Sep 2017 07:28:08 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id F3B671A5337 for ; Tue, 5 Sep 2017 07:28:07 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id uqtDdum6JeKh for ; Tue, 5 Sep 2017 07:28:02 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 4A97C60D28 for ; Tue, 5 Sep 2017 07:28:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 7E0BEE0E4F for ; Tue, 5 Sep 2017 07:28:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 3524E2414B for ; Tue, 5 Sep 2017 07:28:00 +0000 (UTC) Date: Tue, 5 Sep 2017 07:28:00 +0000 (UTC) From: "Cao, Lionel (JIRA)" To: issues@carbondata.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CARBONDATA-1427) After Splitting Partition, Data doesn't get Divided to Different Partitions. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 05 Sep 2017 07:28:11 -0000 [ https://issues.apache.org/jira/browse/CARBONDATA-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16153209#comment-16153209 ] Cao, Lionel commented on CARBONDATA-1427: ----------------------------------------- Was the alter table split statement successful executed or any exceptions? > After Splitting Partition, Data doesn't get Divided to Different Partitions. > ---------------------------------------------------------------------------- > > Key: CARBONDATA-1427 > URL: https://issues.apache.org/jira/browse/CARBONDATA-1427 > Project: CarbonData > Issue Type: Bug > Components: data-query > Affects Versions: 1.2.0 > Environment: spark 2.1 > Reporter: Neha Bhardwaj > Assignee: Pallavi Singh > Priority: Minor > Attachments: list_partition_table.csv > > > When Performing a Split Partition Query on a Partitioned Table, The data doesn't get affected at all, however, we can see the updated Partitions using the show Partitions Query and the old partition as deleted. > But the data still remains in that partition, Ideally, the data should be divided as per the new partitions, Which happens after the subsequent loads, the data then gets to the latest partitions. > Example : > 1. Create Table : > DROP TABLE IF EXISTS list_partition_table; > CREATE TABLE list_partition_table(shortField SHORT, intField INT, bigintField LONG, doubleField DOUBLE, timestampField TIMESTAMP, decimalField DECIMAL(18,2), dateField DATE, charField CHAR(5), floatField FLOAT, complexData ARRAY ) PARTITIONED BY (stringField STRING) STORED BY 'carbondata' TBLPROPERTIES('PARTITION_TYPE'='LIST', 'LIST_INFO'='Asia, (China, Europe, NoPartition)'); > 2. Load Data : > load data inpath 'hdfs://localhost:54310/CSV/list_partition_table.csv' into table list_partition_table options('FILEHEADER'='shortfield,intfield,bigintfield,doublefield,stringfield,timestampfield,decimalfield,datefield,charfield,floatfield,complexdata', 'COMPLEX_DELIMITER_LEVEL_1'='$','COMPLEX_DELIMITER_LEVEL_2'='#'); > 3. Show Partitions : > show partitions list_partition_table; > +----------------------------------------------+--+ > | partition | > +----------------------------------------------+--+ > | 0, stringfield = DEFAULT | > | 1, stringfield = Asia | > | 2, stringfield = China, Europe, NoPartition | > +----------------------------------------------+--+ > 3 rows selected (0.09 seconds) > 4. Split Partition : > ALTER TABLE list_partition_table SPLIT PARTITION(2) INTO('China', '(Europe, NoPartition)' ); > 5. Show Partition : > show partitions list_partition_table; > +---------------------------------------+--+ > | partition | > +---------------------------------------+--+ > | 0, stringfield = DEFAULT | > | 1, stringfield = Asia | > | 3, stringfield = China | > | 4, stringfield = Europe, NoPartition | > +---------------------------------------+--+ > 4 rows selected (0.065 seconds) > The partitions get updated , but still the data remains the same(UNPARTITIONED), in the same partition. -- This message was sent by Atlassian JIRA (v6.4.14#64029)