hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Grandl <>
Subject Hive statistics
Date Wed, 21 Dec 2016 05:01:02 GMT
Hi guys,
I am wondering if it's possible to estimate the number of distinct keys and their distribution
in a way or another. 

More concretely, for every stage, it is possible to determine the number of distinct keys
and for each key the number of values  before the data is actually processed?

View raw message