hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jie Li <ji...@cs.duke.edu>
Subject Map-only aggregation
Date Sat, 05 Jan 2013 03:46:42 GMT
Hi all,

Can Hive implement the aggregation as a Map-only job? As we know the
data may be pre-partitioned via PARTITION-BY or CLUSTERED-BY, so we
don't need the reduce phase to repartition the data.

The Bucket Join seems to take advantage of the buckets for joins, so I
wonder if there is some similar optimization for aggregations.

Thanks,
Jie

Mime
View raw message