hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Zhang Kai <zhangk1...@gmail.com>
Subject Re: Make hive support various charsets
Date Mon, 09 Apr 2012 00:17:23 GMT
Hi

I have created an issue HIVE-2917 and submitted patch through Phabricator.

Is there anyone who would like to review it?

Thanks,
Kai Zhang

2012/3/29 Namit Jain <njain@fb.com>

> Kai,
>
> That would be great.
>
> Please file a jura, and submit a patch.
> We would definitely like to get it for the whole community
>
>
> Thanks,
> -namit
>
>
> On 3/28/12 8:46 PM, "Zhang Kai" <zhangk1985@gmail.com> wrote:
>
> >Hi all
> >
> >I've been working with hive for some time.
> >
> >In my company, we use hive for querying on large datasets and found it's
> >very easy to use.
> >
> >However we also found hive is lack of various charsets support so that we
> >have to manually transform data files to utf-8 encoding before loading
> >them
> >into hive.
> >
> >So I have made a patch to make hive supports setting charset when creating
> >a table.
> >And the charset property will be used by SerDe when it serialize or
> >deserialize data.
> >
> >The modified hql is like:
> >
> >CREATE TABLE tbl1 (col1 STRING) ROW FORMAT CHARET "GBK" DELIMITED FIELDS
> >TERMINATED BY '\t';
> >
> >I'm very happy to contribute this to the community and looking forward to
> >your feedbacks.
> >
> >Thanks,
> >Kai Zhang
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message