hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhizhen Hou (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-16332) When create a partitioned text format table with one partition, after we change the format of table to orc, then the array type field may output error.
Date Tue, 29 Aug 2017 09:50:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-16332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Zhizhen Hou updated HIVE-16332:
-------------------------------
    Status: Open  (was: Patch Available)

> When create a partitioned text format table with one partition, after we change the format
of table to orc, then the array type field may output error.
> -------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-16332
>                 URL: https://issues.apache.org/jira/browse/HIVE-16332
>             Project: Hive
>          Issue Type: Bug
>          Components: ORC
>    Affects Versions: 2.1.1
>            Reporter: Zhizhen Hou
>            Assignee: Zhizhen Hou
>            Priority: Critical
>              Labels: patch
>
> ##The step to reproduce the result.
> 1. First crate a text format table with array type field in hive.
> ```
>  create table test_text_orc (
>   col_int bigint,
>   col_text string, 
>   col_array array<string>, 
>   col_map map<string, string>
>   ) 
>   PARTITIONED BY (
>    day string
>    )
>    ROW FORMAT DELIMITED
>  FIELDS TERMINATED BY ',' 
>  collection items TERMINATED  BY ']'
>  map keys TERMINATED BY ':'
>   ;
>  
> ```
> 2. Create new text file hive-orc-text-file-array-error-test.txt.
> ```
> 1,text_value1,array_value1]array_value2]array_value3, map_key1:map_value1,map_key2:map_value2
> 2,text_value2,array_value4, map_key1:map_value3
> ,text_value3,, map_key1:]map_key3:map_value3
> ```
> 3.  Load the data into one partition.
> ```
>  LOAD DATA local INPATH '.hive-orc-text-file-array-error-test.txt' overwrite into table
test_text_orc partition(day=20170329)
> ```
> 4. select the data to verify the result.
> ```
> hive> select * from test.test_text_orc;
> OK
> 1	text_value1	["array_value1","array_value2","array_value3"]	{" map_key1":"map_value1","map_key2":"map_value2"}
20170329
> 2	text_value2	["array_value4"]	{"map_key1":"map_value3"}	20170329
> NULL	text_value3	[]	{" map_key1":"","map_key3":"map_value3"}	20170329
> ```
> 5. Alter table format of table to orc;
> ```
>  alter table test_text_orc set fileformat orc;
> ```
> 6. Check the result again, and you can see the  error result.
> ```
> hive> select * from test.test_text_orc;
> OK
> 1	text_value1	["array_value1","array_value2","array_value3"]	{" map_key1":"map_value1","map_key2":"map_value2"}
20170329
> 2	text_value2	["array_value4","array_value2","array_value3"]	{"map_key1":"map_value3"}
20170329
> NULL	text_value3	["array_value4","array_value2","array_value3"]	{"map_key3":"map_value3","
map_key1":""}	20170329
> ```



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message