hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jeff Hammerbacher (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-969) choose the largest table for join as the table to be streamed
Date Thu, 03 Dec 2009 21:56:20 GMT

    [ https://issues.apache.org/jira/browse/HIVE-969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12785554#action_12785554

Jeff Hammerbacher commented on HIVE-969:

My question still stands. Are you planning on implementing a statistics collection process
to capture table size in the metastore for this issue?

> choose the largest table for join as the table to be streamed
> -------------------------------------------------------------
>                 Key: HIVE-969
>                 URL: https://issues.apache.org/jira/browse/HIVE-969
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>            Reporter: Namit Jain
> In absence of statistics, the join order is random. This creates a problem for the users
since mostly they dont care about the size of the tables
> So, instead of choosing the join order randomly, we can make the largest table as the
table to be streamed if the user did not specify any hints.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message