hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From tim robertson <>
Subject many terms in group by
Date Fri, 03 Jul 2009 21:11:56 GMT
Hi all,

I have several MapReduce jobs that are basically doing counts with
group by on tab delimited files.
Getting tired of writing the same thing over again for each report I
am thinking of trying Hive for this.

Does Hive work ok with 9 or so terms in the group by?
(e.g. it is happy concatenating the fields to make the key to emit
from the map so it can do the count is a reduce and complete in one
mapreduce job)

I'm meaning the equivalent of:
  select a,b,c,d,e,f,g,h,i,count(*) from table x group by a,b,c,d,e,f,g,h,i;

Many thanks,

View raw message