hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "chengxiang li" <>
Subject Re: Review Request 24377: HIVE-7142 Hive multi serialization encoding support
Date Wed, 13 Aug 2014 08:13:26 GMT

This is an automatically generated e-mail. To reply, visit:

(Updated Aug. 13, 2014, 8:13 a.m.)

Review request for hive.

Bugs: HIVE-7142

Repository: hive-git


Currently Hive only support serialize data into UTF-8 charset bytes or deserialize from UTF-8
bytes, real world users may want to load different kinds of encoded data into hive directly.
This jira is dedicated to support serialize/deserialize all kinds of encoded data in SerDe
For user, only need to configure serialization encoding on table level by set serialization
encoding through serde parameter, for example:
CREATE TABLE person(id INT, name STRING, desc STRING)ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe'
WITH SERDEPROPERTIES("serialization.encoding"='GBK');
ALTER TABLE person SET SERDEPROPERTIES ('serialization.encoding'='GBK'); 
LIMITATIONS: Only LazySimpleSerDe support "serialization.encoding" property in this patch.

Diffs (updated)

  serde/if/serde.thrift 31c87ee 
  serde/src/gen/thrift/gen-cpp/serde_constants.h d56c917 
  serde/src/gen/thrift/gen-cpp/serde_constants.cpp 54503e3 
  serde/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/serde/ 515cf25

  serde/src/gen/thrift/gen-php/org/apache/hadoop/hive/serde/Types.php 837dd11 
  serde/src/gen/thrift/gen-py/org_apache_hadoop_hive_serde/ 8eac87d 
  serde/src/gen/thrift/gen-rb/serde_constants.rb ed86522 
  serde/src/java/org/apache/hadoop/hive/serde2/ PRE-CREATION

  serde/src/java/org/apache/hadoop/hive/serde2/ 179f9b5 
  serde/src/java/org/apache/hadoop/hive/serde2/ b7fb048 
  serde/src/java/org/apache/hadoop/hive/serde2/lazy/ fb55c70 




chengxiang li

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message