hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "chengxiang li" <chengxiang...@intel.com>
Subject Re: Review Request 24377: HIVE-7142 Hive multi serialization encoding support
Date Wed, 13 Aug 2014 08:12:58 GMT


> On Aug. 12, 2014, 11:55 p.m., Brock Noland wrote:
> > serde/if/serde.thrift, line 33
> > <https://reviews.apache.org/r/24377/diff/5/?file=658276#file658276line33>
> >
> >     Thank you! 
> >     
> >     You also need to run the build with the thriftif profile active to generate
the new new code:
> >     
> >     mvn clean install -Phadoop-2,thriftif -DskipTests

Thanks Brock, I've generated different language file with the maven command you mentioned
above, please help to check if I missed something else.


- chengxiang


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/24377/#review50388
-----------------------------------------------------------


On Aug. 12, 2014, 2:18 a.m., chengxiang li wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/24377/
> -----------------------------------------------------------
> 
> (Updated Aug. 12, 2014, 2:18 a.m.)
> 
> 
> Review request for hive.
> 
> 
> Bugs: HIVE-7142
>     https://issues.apache.org/jira/browse/HIVE-7142
> 
> 
> Repository: hive-git
> 
> 
> Description
> -------
> 
> Currently Hive only support serialize data into UTF-8 charset bytes or deserialize from
UTF-8 bytes, real world users may want to load different kinds of encoded data into hive directly.
This jira is dedicated to support serialize/deserialize all kinds of encoded data in SerDe
layer.
> For user, only need to configure serialization encoding on table level by set serialization
encoding through serde parameter, for example:
> CREATE TABLE person(id INT, name STRING, desc STRING)ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe'
WITH SERDEPROPERTIES("serialization.encoding"='GBK');
> or
> ALTER TABLE person SET SERDEPROPERTIES ('serialization.encoding'='GBK'); 
> LIMITATIONS: Only LazySimpleSerDe support "serialization.encoding" property in this patch.
> 
> 
> Diffs
> -----
> 
>   serde/if/serde.thrift 31c87ee 
>   serde/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/serde/serdeConstants.java
515cf25 
>   serde/src/java/org/apache/hadoop/hive/serde2/AbstractEncodingAwareSerDe.java PRE-CREATION

>   serde/src/java/org/apache/hadoop/hive/serde2/DelimitedJSONSerDe.java 179f9b5 
>   serde/src/java/org/apache/hadoop/hive/serde2/SerDeUtils.java b7fb048 
>   serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazySimpleSerDe.java fb55c70 
> 
> Diff: https://reviews.apache.org/r/24377/diff/
> 
> 
> Testing
> -------
> 
> 
> Thanks,
> 
> chengxiang li
> 
>


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message