hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ádám Szita (Jira) <j...@apache.org>
Subject [jira] [Updated] (HIVE-22284) Improve LLAP CacheContentsTracker to collect and display correct statistics
Date Mon, 14 Oct 2019 15:16:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-22284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ádám Szita updated HIVE-22284:
------------------------------
    Attachment: HIVE-22284.7.patch

> Improve LLAP CacheContentsTracker to collect and display correct statistics
> ---------------------------------------------------------------------------
>
>                 Key: HIVE-22284
>                 URL: https://issues.apache.org/jira/browse/HIVE-22284
>             Project: Hive
>          Issue Type: Improvement
>          Components: llap
>            Reporter: Ádám Szita
>            Assignee: Ádám Szita
>            Priority: Major
>         Attachments: HIVE-22284.0.patch, HIVE-22284.1.patch, HIVE-22284.2.patch, HIVE-22284.3.patch,
HIVE-22284.4.patch, HIVE-22284.5.patch, HIVE-22284.6.patch, HIVE-22284.7.patch
>
>
> When keeping track of which buffers correspond to what Hive objects, CacheContentsTracker
relies on cache tags.
> Currently a tag is a simple String that ideally holds DB and table name, and a partition
spec concatenated by . and / . The information here is derived from the Path of the file that
is getting cached. Needless to say sometimes this produces a wrong tag especially for external
tables.
> Also there's a bug when calculating aggregated stats for a 'parent' tag (corresponding
to the table of the partition) because the overall maxCount and maxSize do not add up to the
sum of those in the partitions. This happens when buffers get removed from the cache.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message