hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Dai (JIRA)" <>
Subject [jira] [Updated] (HIVE-16520) Cache hive metadata in metastore
Date Mon, 01 May 2017 20:56:04 GMT


Daniel Dai updated HIVE-16520:
       Resolution: Fixed
     Hadoop Flags: Reviewed
    Fix Version/s: 3.0.0
           Status: Resolved  (was: Patch Available)

Patch pushed to master. Thanks Vaibhav for contributing aggregate stats and cache refresh,
thanks Thejes for review!

To use CachedStore, please set hive.metastore.rawstore.impl to "org.apache.hadoop.hive.metastore.cache.CachedStore"
in hive-site.xml.

> Cache hive metadata in metastore
> --------------------------------
>                 Key: HIVE-16520
>                 URL:
>             Project: Hive
>          Issue Type: New Feature
>          Components: Metastore
>            Reporter: Daniel Dai
>            Assignee: Daniel Dai
>             Fix For: 3.0.0
>         Attachments: HIVE-16520-1.patch, HIVE-16520.2.patch, HIVE-16520.3.patch, HIVE-16520.4.patch,
HIVE-16520-proto-2.patch, HIVE-16520-proto.patch
> During Hive 2 benchmark, we find Hive metastore operation take a lot of time and thus
slow down Hive compilation. In some extreme case, it takes much longer than the actual query
run time. Especially, we find the latency of cloud db is very high and 90% of total query
runtime is waiting for metastore SQL database operations. Based on this observation, the metastore
operation performance will be greatly enhanced if we have a memory structure which cache the
database query result.

This message was sent by Atlassian JIRA

View raw message