hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pengcheng Xiong (JIRA)" <>
Subject [jira] [Updated] (HIVE-10900) Fix the indeterministic stats for some hive queries
Date Fri, 05 Jun 2015 22:32:00 GMT


Pengcheng Xiong updated HIVE-10900:
    Attachment: HIVE-10900.01.patch

temporary fix for accumulo stats. [~ashutoshc], could you please take a look? Also ccing [~jpullokkaran]

> Fix the indeterministic stats for some hive queries 
> ----------------------------------------------------
>                 Key: HIVE-10900
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Pengcheng Xiong
>            Assignee: Pengcheng Xiong
>            Priority: Minor
>         Attachments: HIVE-10900.01.patch
> If we do not run compute stats for a table and then we do some operation on that table,
we will get different stats numbers when we run explain. The main reason is due to the different
OS/FS configurations that Hive Stats depends on when there is no table stats. A simple fix
is to add compute stats for those  indeterministic stats.

This message was sent by Atlassian JIRA

View raw message