Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 790C21109A for ; Wed, 13 Aug 2014 21:28:15 +0000 (UTC) Received: (qmail 60707 invoked by uid 500); 13 Aug 2014 21:28:15 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 60627 invoked by uid 500); 13 Aug 2014 21:28:15 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 60602 invoked by uid 500); 13 Aug 2014 21:28:15 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 60595 invoked by uid 99); 13 Aug 2014 21:28:15 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Aug 2014 21:28:14 +0000 Date: Wed, 13 Aug 2014 21:28:14 +0000 (UTC) From: "pengcheng xiong (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HIVE-7506) MetadataUpdater: provide a mechanism to edit the statistics of a column in a table (or a partition of a table) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-7506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengcheng xiong updated HIVE-7506: ---------------------------------- Attachment: HIVE-7506.8.patch Address QA's Operation2Privilege issue > MetadataUpdater: provide a mechanism to edit the statistics of a column in a table (or a partition of a table) > -------------------------------------------------------------------------------------------------------------- > > Key: HIVE-7506 > URL: https://issues.apache.org/jira/browse/HIVE-7506 > Project: Hive > Issue Type: New Feature > Components: Statistics > Reporter: pengcheng xiong > Assignee: pengcheng xiong > Priority: Minor > Attachments: HIVE-7506.1.patch, HIVE-7506.3.patch, HIVE-7506.4.patch, HIVE-7506.5.patch, HIVE-7506.6.patch, HIVE-7506.7.patch, HIVE-7506.8.patch, HIVE-7506.patch > > Original Estimate: 252h > Remaining Estimate: 252h > > Two motivations: > (1) Cost-based Optimizer (CBO) depends heavily on the statistics of a column in a table (or a partition of a table). If we would like to test whether CBO chooses the best plan under different statistics, it would be time consuming if we load the whole table and create the statistics from ground up. > (2) As database runs, the statistics of a column in a table (or a partition of a table) may change. We need a way or a mechanism to synchronize. > We propose the following command to achieve that: > ALTER TABLE table_name PARTITION partition_spec [COLUMN col_name] UPDATE STATISTICS col_statistics [COMMENT col_comment] -- This message was sent by Atlassian JIRA (v6.2#6252)