Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B263B90A8 for ; Fri, 16 Mar 2012 20:40:03 +0000 (UTC) Received: (qmail 8982 invoked by uid 500); 16 Mar 2012 20:40:03 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 8919 invoked by uid 500); 16 Mar 2012 20:40:03 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 8909 invoked by uid 500); 16 Mar 2012 20:40:03 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 8906 invoked by uid 99); 16 Mar 2012 20:40:03 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Mar 2012 20:40:03 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Mar 2012 20:40:00 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 1040024EB6 for ; Fri, 16 Mar 2012 20:39:40 +0000 (UTC) Date: Fri, 16 Mar 2012 20:39:40 +0000 (UTC) From: "Phabricator (Commented) (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: <784461567.26331.1331930380077.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <744829849.1672.1317166245929.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HIVE-2471) Add timestamp column to the partition stats table. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231603#comment-13231603 ] Phabricator commented on HIVE-2471: ----------------------------------- njain has commented on the revision "HIVE-2471 [jira] Add timestamp column with index to the partition stats table.". INLINE COMMENTS ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsSetupConstants.java:26 Write a big comment here that it is the users responsibility to delete the old table ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsUtils.java:128 I am not sure this will work - I am assuming this is invoked by StatsAggregator, but the data is inserted by StatsPublisher. The timestamp will be different in the 2 places REVISION DETAIL https://reviews.facebook.net/D2367 > Add timestamp column to the partition stats table. > -------------------------------------------------- > > Key: HIVE-2471 > URL: https://issues.apache.org/jira/browse/HIVE-2471 > Project: Hive > Issue Type: Improvement > Reporter: Kevin Wilfong > Assignee: Kevin Wilfong > Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch, HIVE-2471.D2367.2.patch > > > Occasionally, when entries are added to the partition stats table the program is halted before it can delete those entries, by an exception, keyboard interrupt, etc. These build up to the point where the table gets very large, and it hurts the performance of the update statement which is often called. In order to fix this, I am adding a column to the table which is auto-populated with the current timestamp. This will allow us to create scripts that go through periodically and clean out old entries from the table. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira