hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amareshwari Sriramadasu (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-3962) Number of distinct values are wrong in column statistics
Date Wed, 30 Jan 2013 10:05:13 GMT

     [ https://issues.apache.org/jira/browse/HIVE-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Amareshwari Sriramadasu updated HIVE-3962:
------------------------------------------

    Summary: Number of distinct values are wrong in column statistics  (was: number of distinct
values are in column statistics)
    
> Number of distinct values are wrong in column statistics
> --------------------------------------------------------
>
>                 Key: HIVE-3962
>                 URL: https://issues.apache.org/jira/browse/HIVE-3962
>             Project: Hive
>          Issue Type: Bug
>          Components: Statistics
>    Affects Versions: 0.10.0
>            Reporter: Amareshwari Sriramadasu
>
> When we run the query on hive ql src table :
> select count(distinct(key)), count(distinct(value) from src;
> 309 309
> After running the following analyze query, the stats in metastore seem wrong:
> analyze table src compute statistics for columns key, value; 
> --- stats in metastore ---
> mysql > select * from TAB_COL_STATS where TABLE_NAME="src";
> | CS_ID | DB_NAME | TABLE_NAME | COLUMN_NAME | COLUMN_TYPE | TBL_ID | LONG_LOW_VALUE
| LONG_HIGH_VALUE | DOUBLE_HIGH_VALUE | DOUBLE_LOW_VALUE | BIG_DECIMAL_LOW_VALUE | BIG_DECIMAL_HIGH_VALUE
| NUM_NULLS | NUM_DISTINCTS | AVG_COL_LEN | MAX_COL_LEN | NUM_TRUES | NUM_FALSES | LAST_ANALYZED
|
> |     5 | default | src        | key         | int         |     11 |              0
|             498 |            0.0000 |           0.0000 | NULL                  | NULL  
                |         0 |           291 |      0.0000 |           0 |         0 |    
     0 |    1359539181 |
> |     6 | default | src        | value       | string      |     11 |              0
|               0 |            0.0000 |           0.0000 | NULL                  | NULL  
                |         0 |           112 |      6.8120 |           7 |         0 |    
     0 |    1359539181 |

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message