hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Häusler <mich...@akatose.de>
Subject column statistics for non-primitive types
Date Mon, 13 Jun 2016 15:00:49 GMT
Hi there,


when testing column statistics I stumbled upon the following error message:

DROP TABLE IF EXISTS foo;
CREATE TABLE foo (foo BIGINT, bar ARRAY<BIGINT>, foobar STRUCT<key:STRING,value:STRING>);

ANALYZE TABLE foo COMPUTE STATISTICS FOR COLUMNS;
FAILED: UDFArgumentTypeException Only primitive type arguments are accepted but array<bigint>
is passed.

ANALYZE TABLE foo COMPUTE STATISTICS FOR COLUMNS foobar, bar;
FAILED: UDFArgumentTypeException Only primitive type arguments are accepted but struct<key:string,value:string>
is passed.


1) Basically, it seems that column statistics don't work for non-primitive types. Are there
any workarounds or any plans to change this?

2) Furthermore, the convenience syntax to compute statistics for all columns does not work
as soon as there is a non-supported column. Are there any plans to change this, so it is easier
to compute statistics for all supported columns?

3) ANALYZE TABLE will only provide the first failing *type* in the error message. Especially
for wide tables it would be much easier if all non-supported column *names* would be printed.


Best regards
Michael


Mime
View raw message