hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chaozhong Yang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-17460) `insert overwrite` should support table schema evolution (e.g. add columns)
Date Wed, 06 Sep 2017 17:58:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-17460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16155795#comment-16155795
] 

Chaozhong Yang commented on HIVE-17460:
---------------------------------------

[~wzheng] The reason why so many(23) tests failed is that almost every schema_evol_*.q.out
is different with our expected output. e.g. https://builds.apache.org/job/PreCommit-HIVE-Build/6693/testReport/junit/org.apache.hadoop.hive.cli/TestMiniLlapLocalCliDriver/testCliDriver_schema_evol_text_nonvec_part_/

```
125c125
< 2 1 2222 new 3333
---
> 2 1 2222 new NULL
138c138
< 2 1 3333
---
> 2 1 NULL
254c254
< 2 1 2222 new 3333
---
> 2 1 2222 new NULL
267c267
```

Obviously, my output is right. However, we should not ignore those failure. Any suggestions?

> `insert overwrite` should support table schema evolution (e.g. add columns)
> ---------------------------------------------------------------------------
>
>                 Key: HIVE-17460
>                 URL: https://issues.apache.org/jira/browse/HIVE-17460
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 2.1.0, 2.2.0
>            Reporter: Chaozhong Yang
>            Assignee: Chaozhong Yang
>             Fix For: 3.0.0
>
>         Attachments: HIVE-17460.patch
>
>
> In Hive, adding columns into original table is a common use case. However, if we insert
overwrite older partitions after adding columns, added columns will not be accessed.
> ```
> create table src_table(
>         i int
> )
> PARTITIONED BY (`date` string);
> insert overwrite table src_table partition(`date`='20170905') valu
> es (3);
> select * from src_table where `date` = '20170905';
> alter table src_table add columns (bi bigint);
> insert overwrite table src_table partition(`date`='20170905') valu
> es (3, 5);
> select * from src_table where `date` = '20170905';
> ```
> The result will be as follows:
> ```
> 3, NULL, '20170905'
> ```
> Obviously, it doesn't meet our expectation. The expected result should be:
> ```
> 3, 5, '20170905'
> ```



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message