hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Szehon Ho (JIRA)" <>
Subject [jira] [Commented] (HIVE-6500) Stats collection via filesystem
Date Sun, 12 Oct 2014 22:43:33 GMT


Szehon Ho commented on HIVE-6500:

Oh, I see where you got it from, it is a regular expression.  Then I think even the paren's
is not needed, as it's part of the regex.  You can say jdbc:<database> and explain <database>
is derby, mysql, etc, it should be ok as an example.  Even I dont know the whole list, from
what I can tell the code uses <database> for some special logic if its derby, but not
anything else.  Hope that helps.

> Stats collection via filesystem
> -------------------------------
>                 Key: HIVE-6500
>                 URL:
>             Project: Hive
>          Issue Type: New Feature
>          Components: Statistics
>            Reporter: Ashutosh Chauhan
>            Assignee: Ashutosh Chauhan
>              Labels: TODOC13, TODOC14
>             Fix For: 0.13.0
>         Attachments: HIVE-6500.2.patch, HIVE-6500.3.patch, HIVE-6500.patch
> Recently, support for stats gathering via counter was [added |]
Although, its useful it has following issues:
> * [Length of counter group name is limited |]
> * [Length of counter name is limited |]
> * [Number of distinct counter groups are limited |]
> * [Number of distinct counters are limited |]
> Although, these limits are configurable, but setting them to higher value implies increased
memory load on AM and job history server.
> Now, whether these limits makes sense or not is [debatable |]
it is desirable that Hive doesn't make use of counters features of framework so that it we
can evolve this feature without relying on support from framework. Filesystem based counter
collection is a step in that direction.

This message was sent by Atlassian JIRA

View raw message