hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Neha Tomar (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-2443) Wrong delimiter is getting picked up for structs inside an array.
Date Tue, 17 Jul 2012 19:09:34 GMT

    [ https://issues.apache.org/jira/browse/HIVE-2443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13416461#comment-13416461
] 

Neha Tomar commented on HIVE-2443:
----------------------------------

Can you please let me know if user defined delimiters are supported with hive 0.9.0?
                
> Wrong delimiter is getting picked up for structs inside an array.
> -----------------------------------------------------------------
>
>                 Key: HIVE-2443
>                 URL: https://issues.apache.org/jira/browse/HIVE-2443
>             Project: Hive
>          Issue Type: Bug
>          Components: Serializers/Deserializers
>            Reporter: Thulasi Ram Naidu P
>            Priority: Minor
>
> I am trying to create table with multiple level of delimiters. But the default LazySimpleSerDe
doesn't pick up the second serializer for serializing a struct inside an array which I specified
using COLLECTION ITEMS DELIMITED BY.
> My table looks like this:
> create external table if not exists mytable(col1 bigint, col2 string,
> col3 string, col4 double, col5 double, col6 double, col7 double, col8
> array<struct<id1:string, id2:string, id3:string, id4:string,
> id5:int>>)
>        ROW FORMAT DELIMITED
>        FIELDS TERMINATED BY '\t'
>        COLLECTION ITEMS TERMINATED BY ',:'
> Location '<FILEPATH>';
> Input data:
> 123456  XYZ1    RANDOM  1       1       1       1       x1:y1:z1:w1:5,x2:y2:z2:w1:5
> When I do "Select * from mytable" I am expecting output to be 
> 123456  XYZ1    RANDOM  1.0     1.0     1.0     1.0     [{"id1":"x1","id2":"y1","id3":"z1","id4":"w1","id5":5},{"id1":"x2","id2":"y2","id3":"z2","id4":"w1","id5":5}]
> However, it is returning,
> 123456  XYZ1    RANDOM  1.0     1.0     1.0     1.0
> [{"id1":"x1:y1:z1:w1:5","id2":null,"id3":null,"id4":null,"id5":null},{"id1":"x2:y2:z2:w1:5","id2":null,"id3":null,"id4":null,"id5":null}]
> But when I changed the schema of table as
> create external table if not exists mytable(col1 bigint, col2 string,
> col3 string, col4 double, col5 double, col6 double, col7 double, col8
> array<struct<id1:string, id2:string, id3:string, id4:string,
> id5:int>>)
>        ROW FORMAT DELIMITED
>        FIELDS TERMINATED BY '\t'
>        COLLECTION ITEMS TERMINATED BY ','
> MAP KEYS TERMINATED BY ':'
> Now the select query is returning the values correctly.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

Mime
View raw message