hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Namit Jain (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-3292) Table Statistics
Date Tue, 24 Jul 2012 09:50:33 GMT
Namit Jain created HIVE-3292:
--------------------------------

             Summary: Table Statistics 
                 Key: HIVE-3292
                 URL: https://issues.apache.org/jira/browse/HIVE-3292
             Project: Hive
          Issue Type: Bug
            Reporter: Namit Jain


There is a problem with table statistics:

If 2 partitions of a table are being created/updated concurrently, 
table level stats may not be correct.

For eg. if a table T has 2 partitions P1 and P2 with number of rows 10 and 20
respectively, and both the partitions are being updated concurrently (via 
insert overwrite ...), the table object is obtained in StatsTask and updated.
Even with concurrency turned on, the table is not locked, and the two table
statistics updates may lead to one to them being lost.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message