hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jesus Camacho Rodriguez (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-13287) Add logic to estimate stats for IN operator
Date Thu, 07 Apr 2016 19:54:25 GMT

    [ https://issues.apache.org/jira/browse/HIVE-13287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15230932#comment-15230932
] 

Jesus Camacho Rodriguez commented on HIVE-13287:
------------------------------------------------

I have uploaded a new patch; to keep it short, original patch had the problem that was taking
original number of columns as zero for some cases (from evaluatedRowCount). New patch solves
that issue.

> Add logic to estimate stats for IN operator
> -------------------------------------------
>
>                 Key: HIVE-13287
>                 URL: https://issues.apache.org/jira/browse/HIVE-13287
>             Project: Hive
>          Issue Type: Bug
>          Components: Statistics
>    Affects Versions: 2.1.0
>            Reporter: Jesus Camacho Rodriguez
>            Assignee: Jesus Camacho Rodriguez
>         Attachments: HIVE-13287.01.patch, HIVE-13287.02.patch, HIVE-13287.patch
>
>
> Currently, IN operator is considered in the default case: reduces the input rows to the
half. This may lead to wrong estimates for the number of rows produced by Filter operators.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message