hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth Jayachandran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-13260) ReduceSinkDeDuplication throws exception when pRS key is empty
Date Thu, 10 Mar 2016 22:56:40 GMT

    [ https://issues.apache.org/jira/browse/HIVE-13260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15190128#comment-15190128
] 

Prasanth Jayachandran commented on HIVE-13260:
----------------------------------------------

Can you add explain to the test case with RS dedup enable and disabled?

Also in the patch, you are copying the number of distribution keys from cRS but not the fields.
I suspect that. We might need a test run to see if that breaks anything.

> ReduceSinkDeDuplication throws exception when pRS key is empty
> --------------------------------------------------------------
>
>                 Key: HIVE-13260
>                 URL: https://issues.apache.org/jira/browse/HIVE-13260
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Pengcheng Xiong
>            Assignee: Pengcheng Xiong
>         Attachments: HIVE-13260.01.patch
>
>
> Steps to reproduce:
> {code}
> set hive.mapred.mode=nonstrict;
> set hive.cbo.enable=false;
> set hive.map.aggr=false;
> set hive.groupby.skewindata=false;
> set mapred.reduce.tasks=31;
> select compute_stats(a,16),compute_stats(b,16),compute_stats(c,16),compute_stats(d,16)
> from
> (
> select
>   avg(DISTINCT substr(src.value,5)) as a,
>   max(substr(src.value,5)) as b,
>   variance(substr(src.value,5)) as c,
>   var_samp(substr(src.value,5)) as d
>  from src)subq;
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message