hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vineet Garg (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-17066) Query78 filter wrong estimatation is generating bad plan
Date Tue, 11 Jul 2017 00:49:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-17066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Vineet Garg updated HIVE-17066:
-------------------------------
    Status: Open  (was: Patch Available)

> Query78 filter wrong estimatation is generating bad plan
> --------------------------------------------------------
>
>                 Key: HIVE-17066
>                 URL: https://issues.apache.org/jira/browse/HIVE-17066
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Vineet Garg
>            Assignee: Vineet Garg
>         Attachments: HIVE-17066.1.patch, HIVE-17066.2.patch
>
>
> Filter operator is estimating 1 row following a left outer join causing bad estimates
> {noformat}
> Reducer 12 
>             Execution mode: vectorized, llap
>             Reduce Operator Tree:
>               Map Join Operator
>                 condition map:
>                      Left Outer Join0 to 1
>                 keys:
>                   0 KEY.reducesinkkey0 (type: bigint), KEY.reducesinkkey1 (type: bigint)
>                   1 KEY.reducesinkkey0 (type: bigint), KEY.reducesinkkey1 (type: bigint)
>                 outputColumnNames: _col0, _col1, _col3, _col4, _col5, _col6, _col8
>                 input vertices:
>                   1 Map 14
>                 Statistics: Num rows: 71676270660 Data size: 3727166074320 Basic stats:
COMPLETE Column stats: COMPLETE
>                 Filter Operator
>                   predicate: _col8 is null (type: boolean)
>                   Statistics: Num rows: 1 Data size: 52 Basic stats: COMPLETE Column
stats: COMPLETE
>                   Select Operator
>                     expressions: _col0 (type: bigint), _col1 (type: bigint), _col3 (type:
int), _col4 (type: double), _col5 (type: double), _col6 (type: bigint)
>                     outputColumnNames: _col0, _col1, _col3, _col4, _col5, _col6
>                     Statistics: Num rows: 1 Data size: 52 Basic stats: COMPLETE Column
stats: COMPLETE
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message