hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lenni Kuff (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-12362) Hive's Parquet SerDe ignores 'serialization.null.format' property
Date Wed, 31 Aug 2016 18:39:20 GMT

    [ https://issues.apache.org/jira/browse/HIVE-12362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15453014#comment-15453014
] 

Lenni Kuff commented on HIVE-12362:
-----------------------------------

I don't have a test case available to confirm this, it was only done by looking at the code
so have not confirmed. Seems that there is extra working happening for each column value in
each row, so could have a possible performance impact. 

> Hive's Parquet SerDe ignores 'serialization.null.format' property
> -----------------------------------------------------------------
>
>                 Key: HIVE-12362
>                 URL: https://issues.apache.org/jira/browse/HIVE-12362
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>    Affects Versions: 1.1.0
>            Reporter: Naveen Gangam
>            Assignee: Naveen Gangam
>         Attachments: HIVE-12362.2.patch, HIVE-12362.patch
>
>
> {code}
> create table src (a string);
> insert into table src values (NULL), (''), ('');
> 0: jdbc:hive2://localhost:10000/default> select * from src;
> +-----------+--+
> | src.a  |
> +-----------+--+
> | NULL      |
> |                |
> |                |
> +-----------+--+
> create table dest (a string) row format serde 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe'
stored as INPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat' OUTPUTFORMAT
'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat';
> alter table dest set SERDEPROPERTIES ('serialization.null.format' = '');
> alter table dest set TBLPROPERTIES ('serialization.null.format' = '');
> insert overwrite table dest select * from src;
> 0: jdbc:hive2://localhost:10000/default> select * from test11;
> +-----------+--+
> | test11.a  |
> +-----------+--+
> | NULL      |
> |                |
> |                |
> +-----------+--+
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message