hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashutosh Chauhan (JIRA)" <>
Subject [jira] [Updated] (HIVE-3777) add a property in the partition to figure out if stats are accurate
Date Fri, 01 Nov 2013 18:05:19 GMT


Ashutosh Chauhan updated HIVE-3777:

    Attachment: HIVE-3777.2.patch

Review request up at:
.q.out needs to be updated, so some failures in Hive QA is expected.

> add a property in the partition to figure out if stats are accurate
> -------------------------------------------------------------------
>                 Key: HIVE-3777
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Ashutosh Chauhan
>         Attachments: HIVE-3777.2.patch, HIVE-3777.patch
> Currently, stats task tries to update the statistics in the table/partition
> being updated after the table/partition is loaded. In case of a failure to 
> update these stats (due to the any reason), the operation either succeeds
> (writing inaccurate stats) or fails depending on whether hive.stats.reliable
> is set to true. This can be bad for applications who do not always care about
> reliable stats, since the query may have taken a long time to execute and then
> fail eventually.
> Another property should be added to the partition: areStatsAccurate. If hive.stats.reliable
> set to false, and stats could not be computed correctly, the operation would
> still succeed, update the stats, but set areStatsAccurate to false.
> If the application cares about accurate stats, it can be obtained in the 
> background.

This message was sent by Atlassian JIRA

View raw message