hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lefty Leverenz (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-12411) Remove counter based stats collection mechanism
Date Wed, 25 Nov 2015 01:31:11 GMT

     [ https://issues.apache.org/jira/browse/HIVE-12411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Lefty Leverenz updated HIVE-12411:
----------------------------------
    Labels: TODOC2.0  (was: )

> Remove counter based stats collection mechanism
> -----------------------------------------------
>
>                 Key: HIVE-12411
>                 URL: https://issues.apache.org/jira/browse/HIVE-12411
>             Project: Hive
>          Issue Type: Task
>          Components: Statistics
>    Affects Versions: 1.2.0, 1.2.1
>            Reporter: Pengcheng Xiong
>            Assignee: Pengcheng Xiong
>              Labels: TODOC2.0
>             Fix For: 2.0.0
>
>         Attachments: HIVE-12411.01.patch, HIVE-12411.02.patch
>
>
> Following HIVE-12005, HIVE-12164, we have removed jdbc and hbase stats collection mechanism.
Now we are targeting counter based stats collection mechanism. The main advantages are as
follows (1) counter based stats has limitation on the length of the counter itself, if it
is too long, MD5 will be applied. (2) when there are a large number of partitions and columns,
we need to create a large number of counters in memory. This will put a heavy load on the
M/R AM or Tez AM etc. FS based stats will do a better job.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message