hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pengcheng Xiong (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-10900) Fix the indeterministic stats for some hive queries
Date Fri, 05 Jun 2015 22:32:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-10900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Pengcheng Xiong updated HIVE-10900:
-----------------------------------
    Attachment: HIVE-10900.01.patch

temporary fix for accumulo stats. [~ashutoshc], could you please take a look? Also ccing [~jpullokkaran]

> Fix the indeterministic stats for some hive queries 
> ----------------------------------------------------
>
>                 Key: HIVE-10900
>                 URL: https://issues.apache.org/jira/browse/HIVE-10900
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Pengcheng Xiong
>            Assignee: Pengcheng Xiong
>            Priority: Minor
>         Attachments: HIVE-10900.01.patch
>
>
> If we do not run compute stats for a table and then we do some operation on that table,
we will get different stats numbers when we run explain. The main reason is due to the different
OS/FS configurations that Hive Stats depends on when there is no table stats. A simple fix
is to add compute stats for those  indeterministic stats.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message