pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dmitriy Ryaboy <dvrya...@gmail.com>
Subject Re: What should FLATTEN do?
Date Fri, 02 Apr 2010 18:42:59 GMT
CDH2 or CDH3?

CDH2 is basically 0.{4,5}. CDH3 is in between 5 and 6.

I expect the first result -- a flattened bag of tuples results in multiple
rows, each containing the (not-flattened) tuple.

Btw, Pig 0.6 is out.

-D

On Fri, Apr 2, 2010 at 11:32 AM, hc busy <hc.busy@gmail.com> wrote:

> doh!!!! s/map/bag/g
>
> I seem to get maps and bags mixed up or some reason...
>
> Guys, I have a row containing a *bag*
>
> 'id','data', {((1,2)), ((2,3)), ((4,5))}
>
> What is the expected behavior when I flatten on that bag? I had expected it
> to result in
>
> 'id','data', (1,2)
> 'id','data', (2,3)
> 'id','data', (4,5)
>
>
> But it appears to me that the result of applying FLATTEN to that bag is
> this
> instead:
>
> 'id','data', 1,2
> 'id','data', 2,3
> 'id','data', 4,5
>
>
> The latter is returned by the current cloudera's CDH2 and I've seen the
> prior behavior on other versions of pig.
>
> Which is the correct behavior by design?
>
> What will pig 0.6 do when it is released?
>
> thanks!
> On Fri, Apr 2, 2010 at 11:29 AM, hc busy <hc.busy@gmail.com> wrote:
>
> > Guys, I have a row containing a map
> >
> > 'id','data', {((1,2)), ((2,3)), ((4,5))}
> >
> > What is the expected behavior when I flatten on that bag? I had expected
> it
> > to result in
> >
> > 'id','data', (1,2)
> > 'id','data', (2,3)
> > 'id','data', (4,5)
> >
> >
> > But it appears to me that the result of applying FLATTEN to that bag is
> > this instead:
> >
> > 'id','data', 1,2
> > 'id','data', 2,3
> > 'id','data', 4,5
> >
> >
> > The latter is returned by the current cloudera's CDH2 and I've seen the
> > prior behavior on other versions of pig.
> >
> > Which is the correct behavior by design?
> >
> > What will pig 0.6 do when it is released?
> >
> > thanks!
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message