hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth Jayachandran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-9560) When hive.stats.collect.rawdatasize=true, 'rawDataSize' for an ORC table will result in value '0' after running 'analyze table TABLE_NAME compute statistics;'
Date Wed, 04 Feb 2015 02:20:34 GMT

    [ https://issues.apache.org/jira/browse/HIVE-9560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14304508#comment-14304508
] 

Prasanth Jayachandran commented on HIVE-9560:
---------------------------------------------

[~ashutoshc] Yeah. That make sense. I will put up patch to make that automatic switch in case
of ORC.

> When hive.stats.collect.rawdatasize=true, 'rawDataSize' for an ORC table will result
in value '0' after running 'analyze table TABLE_NAME compute statistics;'
> --------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-9560
>                 URL: https://issues.apache.org/jira/browse/HIVE-9560
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Xin Hao
>
> When hive.stats.collect.rawdatasize=true, 'rawDataSize' for an ORC table will result
in value '0' after running 'analyze table TABLE_NAME compute statistics;'
> Reproduce step:
> (1) set hive.stats.collect.rawdatasize=trueï¼›
> (2) Generate an ORC table in hive, and the value of its 'rawDataSize' is NOT zero.
> You can find the value of 'rawDataSize' (NOT zero) by executing  'describe extended TABLE_NAME;'

> (4) Execute 'analyze table TABLE_NAME compute statistics;'
> (5) Execute  'describe extended TABLE_NAME;' again, and you will find that  the value
of 'rawDataSize' will be changed to '0'.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message