hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Jiang <it.mjji...@gmail.com>
Subject Re: how to convert single line into multiple lines in a serde (txt in txt out)?
Date Wed, 30 Mar 2011 19:46:02 GMT
Also what if I want just one step to load each log entry line from log file
and for each generate multiple lines? That is, just one table created. I
don't want to have one table and then call explode() to get multiple lines.
Otherwise, alternative way is to use streaming on loaded table to turn it
into another one with no need to customize a serde. So, yeah, the goal here
is to see how a serde can do this stuff.

Thanks!

On Wed, Mar 30, 2011 at 12:03 PM, Edward Capriolo <edlinuxguru@gmail.com>wrote:

> On Wed, Mar 30, 2011 at 2:55 PM, Michael Jiang <it.mjjiang@gmail.com>
> wrote:
> > Want to extend RegexSerDe to parse apache web log: for each log entry,
> need
> > to convert it into multiple entries. This is easy in streaming. But new
> to
> > serde, wondering if it is doable and how? Thanks!
> >
>
> You can have your serde produce list<struct> and then explode() them.
>

Mime
View raw message