hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vijay <tec...@gmail.com>
Subject Re: Regex and serde with hive
Date Fri, 23 Dec 2011 07:30:00 GMT
If the format is simply delimited like this, you don't need to use the
RegexSerde. Hive's default format with the right "FIELDS TERMINATED
BY" setting will work great.

-Vijay

On Thu, Dec 22, 2011 at 8:49 PM, Raghunath, Ranjith
<Ranjith.Raghunath1@usaa.com> wrote:
> I have been struggling with this for a while so I would appreciate any
> advice that you any of you may have.
>
>
>
> I have a file of the format
>
>
>
> “Xyz”~”qsd”~”1234”
>
>
>
> I created the following table definition to get the data loaded
>
>
>
> CREATE TABLE dummy
>
> (f1   string,
>
>   f2    string,
>
>   f3     string)
>
> ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
>
> WITH SERDEPROPERTIES  (
>
> "input.regex" = "([^\"~]*) ([^\"~]*) ([^\"~]*)?",
>
> "output.format.string" = "%1$s %2$s %3$s");
>
>
>
> When I load the data in and try to perform a select get NULL values. Thanks
> again.
>
> Thank you,
>
> Ranjith
>
>

Mime
View raw message