hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashutosh Chauhan (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-12309) TableScan should colStats when available for better data size estimate
Date Fri, 30 Oct 2015 21:20:27 GMT
Ashutosh Chauhan created HIVE-12309:
---------------------------------------

             Summary: TableScan should colStats when available for better data size estimate
                 Key: HIVE-12309
                 URL: https://issues.apache.org/jira/browse/HIVE-12309
             Project: Hive
          Issue Type: Improvement
          Components: Statistics
            Reporter: Ashutosh Chauhan
            Assignee: Ashutosh Chauhan


Currently, all other operators use column stats to figure out data size, whereas TableScan
relies on rawDataSize. This inconsistency can result in an inconsistency where TS may have
lower Datasize then subsequent operators.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message