hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jesus Camacho Rodriguez (JIRA)" <>
Subject [jira] [Commented] (HIVE-9512) HIVE-9327 causing regression in stats annotation
Date Mon, 02 Feb 2015 19:15:34 GMT


Jesus Camacho Rodriguez commented on HIVE-9512:

Thanks! I agree with you, it could be generalized; this is a fix for that specific case.

Just for giving a bit more info on what was going on with UDFs with the former patch in case
it helps.
E.g. consider udf_inline.q. You get a plan TS[0]-UDFT[1]-Limit[2]-FS[3].
The stats were lost after UDFT was applied because they are stored for input column col_0,
and they propagate to the Limit operator still referring to col_0.
However, the UDF generates tuples with schema (col_1, col_2).
As the logic for calculating the stats for the limit operator was looking for stats for columns
col_1, col_2, and there is a mismatch schema-stats, it was removing the stats.

> HIVE-9327 causing regression in stats annotation
> ------------------------------------------------
>                 Key: HIVE-9512
>                 URL:
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>            Reporter: Prasanth Jayachandran
>            Assignee: Jesus Camacho Rodriguez
>             Fix For: 1.2.0
>         Attachments: HIVE-9512.01.patch, HIVE-9512.02.patch, HIVE-9512.patch
> HIVE-9327 causes regression to statistics annotation test case. Regression can be seen
> The expected data size is 194 but 0 is returned.

This message was sent by Atlassian JIRA

View raw message