hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "yangfang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-12653) The property "serialization.encoding" in the class "org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe" does not work
Date Mon, 14 Dec 2015 08:58:46 GMT

    [ https://issues.apache.org/jira/browse/HIVE-12653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15055657#comment-15055657
] 

yangfang commented on HIVE-12653:
---------------------------------

OK,Thank you for guidance, I have already modified the code and tested it.

> The property  "serialization.encoding" in the class "org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe"
does not work
> ---------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-12653
>                 URL: https://issues.apache.org/jira/browse/HIVE-12653
>             Project: Hive
>          Issue Type: Improvement
>          Components: Contrib
>    Affects Versions: 1.2.1
>            Reporter: yangfang
>            Assignee: yangfang
>         Attachments: HIVE-12653.2.patch, HIVE-12653.patch, HIVE-12653.patch
>
>
> when I create table with ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe'
and load some files with chinese encoded by GBK:
> create table PersonInfo (cod_fn_ent string, num_seq_trc_form string, date_tr string,

> num_jrn_no string, cod_trc_form_typ string,id_intl_ip string, name string  ) 
> ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe' 
> WITH SERDEPROPERTIES ("field.delim"="|!","serialization.encoding"='GBK');
> load data local inpath '/home/mr/hive/99-BoEing-IF_PMT_NOTE-2G-20151019-00000' overwrite
into table PersonInfo;
>  I found chinese disorder code in the table and  'serialization.encoding' does not work,
the chinese disorder data list as below:
> | ����                                                                      
                                                                         99999999�ϴ�����������
                                                                                         
       0624624002��ʱ����������                              
                                                                                         
           



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message