hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Dere (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-10704) Errors in Tez HashTableLoader when estimated table size is 0
Date Fri, 15 May 2015 18:59:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-10704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jason Dere updated HIVE-10704:
------------------------------
    Attachment: HIVE-10704.2.patch

Patch v2 - removed redundant calls. Also changed the logic so that the memory for all hashtables
are computed at the same time. If we have to do the fallback logic for one table, we should
probably do it for all tables.

> Errors in Tez HashTableLoader when estimated table size is 0
> ------------------------------------------------------------
>
>                 Key: HIVE-10704
>                 URL: https://issues.apache.org/jira/browse/HIVE-10704
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Jason Dere
>            Assignee: Jason Dere
>         Attachments: HIVE-10704.1.patch, HIVE-10704.2.patch
>
>
> Couple of issues:
> - If the table sizes in MapJoinOperator.getParentDataSizes() are 0 for all tables, the
largest small table selection is wrong and could select the large table (which results in
NPE)
> - The memory estimates can either divide-by-zero, or allocate 0 memory if the table size
is 0. Try to come up with a sensible default for this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message