hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mich Talebzadeh <mich.talebza...@gmail.com>
Subject Re: column statistics for non-primitive types
Date Mon, 13 Jun 2016 20:21:07 GMT
which version of Hive are you using?

Dr Mich Talebzadeh



LinkedIn * https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
<https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*



http://talebzadehmich.wordpress.com



On 13 June 2016 at 16:00, Michael Häusler <michael@akatose.de> wrote:

> Hi there,
>
>
> when testing column statistics I stumbled upon the following error message:
>
> DROP TABLE IF EXISTS foo;
> CREATE TABLE foo (foo BIGINT, bar ARRAY<BIGINT>, foobar
> STRUCT<key:STRING,value:STRING>);
>
> ANALYZE TABLE foo COMPUTE STATISTICS FOR COLUMNS;
> FAILED: UDFArgumentTypeException Only primitive type arguments are
> accepted but array<bigint> is passed.
>
> ANALYZE TABLE foo COMPUTE STATISTICS FOR COLUMNS foobar, bar;
> FAILED: UDFArgumentTypeException Only primitive type arguments are
> accepted but struct<key:string,value:string> is passed.
>
>
> 1) Basically, it seems that column statistics don't work for non-primitive
> types. Are there any workarounds or any plans to change this?
>
> 2) Furthermore, the convenience syntax to compute statistics for all
> columns does not work as soon as there is a non-supported column. Are there
> any plans to change this, so it is easier to compute statistics for all
> supported columns?
>
> 3) ANALYZE TABLE will only provide the first failing *type* in the error
> message. Especially for wide tables it would be much easier if all
> non-supported column *names* would be printed.
>
>
> Best regards
> Michael
>
>

Mime
View raw message