hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Viswanathan J <jayamviswanat...@gmail.com>
Subject Fwd: Pig GROUP operator - Data is shuffled and wind up together for the same grouping key
Date Thu, 29 Aug 2013 10:14:37 GMT
Appreciate the response.  I'm facing this issue in prod.

---------- Forwarded message ----------
From: Viswanathan J <jayamviswanathan@gmail.com>
Date: Thu, Aug 29, 2013 at 2:00 PM
Subject: Pig GROUP operator - Data is shuffled and wind up together for the
same grouping key
To: "user@pig.apache.org" <user@pig.apache.org>


Hi,

I'm using pig version 0.11.0

While using GROUP operator in Pig all the data is shuffled, so that rows in
different partitions that have the same grouping key wind up together and
got wrong results for grouping.

While storing the result data, it is share work between multiple
calculations.

How to solve this? Please advice.

-- 
Regards,
Viswa.J



-- 
Regards,
Viswa.J

Mime
View raw message