hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anja Gruenheid (JIRA)" <>
Subject [jira] Commented: (HIVE-1940) Query Optimization Using Column Metadata and Histograms
Date Thu, 03 Feb 2011 21:58:28 GMT


Anja Gruenheid commented on HIVE-1940:

I have set up the last stable version, but as far as I understood, some features have been
added during the current iteration, which also have had impact on the design of the MetaStore.
Is there an up-to-date overview of the MetaStore somewhere or should I retrace the updates
that have been made since the last release?

If I can collect all the data that I need, I'll create the model.

> Query Optimization Using Column Metadata and Histograms
> -------------------------------------------------------
>                 Key: HIVE-1940
>                 URL:
>             Project: Hive
>          Issue Type: New Feature
>          Components: Metastore, Query Processor
>            Reporter: Anja Gruenheid
> The current basis for cost-based query optimization in Hive is information gathered on
tables and partitions. To make further improvements in query optimization possible, the next
step is to develop and implement possibilities to gather information on columns as discussed
in issue HIVE-33. After that, an implementation of histograms is a possible option to use
and collect run-time statistics. Next to the actual implementation of these features, it is
also necessary to develop a consistent storage model for the MetaStore.

This message is automatically generated by JIRA.
For more information on JIRA, see:


View raw message