hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chengxiang Li (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-7142) Hive multi serialization encoding support
Date Wed, 06 Aug 2014 08:31:12 GMT

     [ https://issues.apache.org/jira/browse/HIVE-7142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chengxiang Li updated HIVE-7142:
--------------------------------

    Description: 
Hive only support serialize/deserialize in UTF-8, real world users want to load different
kinds of encoded data into hive directly. For many PRC customers, they would like to load
GBK encoded data.
We support config serialization encoding on table level by set serialization encoding through
serde parameter, for example:
{noformat}
alter table test set serdeproperties ('serialization.encoding'='GBK'); 
{noformat}

LIMITATIONS: Only LazySimpleSerDe support "serialization.encoding" property in this patch.

  was:
Hive only support serialize/deserialize in UTF-8, real world users want to load different
kinds of encoded data into hive directly. For many PRC customers, they would like to load
GBK encoded data.
We support config serialization encoding on table level by set serialization encoding through
serde parameter, for example:
{noformat}
alter table test set serdeproperties ('serialization.encoding'='GBK'); 
{noformat}


> Hive multi serialization encoding support
> -----------------------------------------
>
>                 Key: HIVE-7142
>                 URL: https://issues.apache.org/jira/browse/HIVE-7142
>             Project: Hive
>          Issue Type: Improvement
>          Components: Serializers/Deserializers
>            Reporter: Chengxiang Li
>            Assignee: Chengxiang Li
>         Attachments: HIVE-7142.1.patch.txt
>
>
> Hive only support serialize/deserialize in UTF-8, real world users want to load different
kinds of encoded data into hive directly. For many PRC customers, they would like to load
GBK encoded data.
> We support config serialization encoding on table level by set serialization encoding
through serde parameter, for example:
> {noformat}
> alter table test set serdeproperties ('serialization.encoding'='GBK'); 
> {noformat}
> LIMITATIONS: Only LazySimpleSerDe support "serialization.encoding" property in this patch.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message