hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gang Tim Liu (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-3917) Support fast operation for analyze command
Date Sat, 19 Jan 2013 02:12:13 GMT
Gang Tim Liu created HIVE-3917:
----------------------------------

             Summary: Support fast operation for analyze command
                 Key: HIVE-3917
                 URL: https://issues.apache.org/jira/browse/HIVE-3917
             Project: Hive
          Issue Type: Improvement
          Components: Statistics
    Affects Versions: 0.11.0
            Reporter: Gang Tim Liu
            Assignee: Gang Tim Liu


hive supports analyze command to gather statistics from existing tables/partition https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables

It collects:
1. Number of Rows
2. Number of files
3. Size in Bytes

If table/partition is big, the operation would take time since it will open all files and
scan all data.

It would be nice to support fast operation to gather statistics which doesn't require to open
all files like
1.. Number of files
2. Size in Bytes






--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message