hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alex Nastetsky <>
Subject custom table/column statistics
Date Mon, 09 Jun 2014 01:33:06 GMT
Table statistics collection was added in HIVE-33 (numRows, rawDataSize,
etc). Is there anything that lets you create your own statistics gathering?

For example, given table A with columns x, y, z, I want to gather
count(distinct x, y) as a statistic that would be stored in the metastore.

I know there exist column level statistics that have approximations for
distinct for a single column, but that doesn't help my use case above where
I have 2 columns involved.


View raw message