Return-Path: Delivered-To: apmail-hive-dev-archive@www.apache.org Received: (qmail 67372 invoked from network); 7 Feb 2011 19:42:23 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 7 Feb 2011 19:42:22 -0000 Received: (qmail 28691 invoked by uid 500); 7 Feb 2011 19:42:21 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 28189 invoked by uid 500); 7 Feb 2011 19:42:21 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 27866 invoked by uid 500); 7 Feb 2011 19:42:20 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 27847 invoked by uid 99); 7 Feb 2011 19:42:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Feb 2011 19:42:20 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Feb 2011 19:42:18 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 87E1118F407 for ; Mon, 7 Feb 2011 19:41:57 +0000 (UTC) Date: Mon, 7 Feb 2011 19:41:57 +0000 (UTC) From: "John Sichi (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: <227989142.418.1297107717553.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1673761671.4200.1296607948990.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] Commented: (HIVE-1940) Query Optimization Using Column Metadata and Histograms MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HIVE-1940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12991542#comment-12991542 ] John Sichi commented on HIVE-1940: ---------------------------------- Awesome diagram! Can you add it as an attachment and check the radio button to grant license to ASF so that we can use it in the Hive wiki? Try loading some data into your partitions; maybe it deferred that part of the schema creation until then. There's a tool which can force generation of the entire schema: http://www.datanucleus.org/products/accessplatform/rdbms/schematool.html There's an ant target generate-schema which invokes it (in metastore/build.xml), but it's out-of-date because it still references jpox instead of datanucleus (e.g. it should be invoking org.datanucleus.store.rdbms.SchemaTool instead of org.jpox.SchemaTool). If you get it working, submit a patch and we can update it. > Query Optimization Using Column Metadata and Histograms > ------------------------------------------------------- > > Key: HIVE-1940 > URL: https://issues.apache.org/jira/browse/HIVE-1940 > Project: Hive > Issue Type: New Feature > Components: Metastore, Query Processor > Reporter: Anja Gruenheid > > The current basis for cost-based query optimization in Hive is information gathered on tables and partitions. To make further improvements in query optimization possible, the next step is to develop and implement possibilities to gather information on columns as discussed in issue HIVE-33. After that, an implementation of histograms is a possible option to use and collect run-time statistics. Next to the actual implementation of these features, it is also necessary to develop a consistent storage model for the MetaStore. -- This message is automatically generated by JIRA. - For more information on JIRA, see: http://www.atlassian.com/software/jira