Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C2403EEC0 for ; Tue, 15 Jan 2013 18:46:13 +0000 (UTC) Received: (qmail 50078 invoked by uid 500); 15 Jan 2013 18:46:13 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 50025 invoked by uid 500); 15 Jan 2013 18:46:13 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 49983 invoked by uid 500); 15 Jan 2013 18:46:13 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 49955 invoked by uid 99); 15 Jan 2013 18:46:13 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Jan 2013 18:46:13 +0000 Date: Tue, 15 Jan 2013 18:46:13 +0000 (UTC) From: "Kevin Wilfong (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-3897) Add a way to get the uncompressed/compressed sizes of columns from an RC File MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13554126#comment-13554126 ] Kevin Wilfong commented on HIVE-3897: ------------------------------------- https://cwiki.apache.org/confluence/display/Hive/RCFileCat > Add a way to get the uncompressed/compressed sizes of columns from an RC File > ----------------------------------------------------------------------------- > > Key: HIVE-3897 > URL: https://issues.apache.org/jira/browse/HIVE-3897 > Project: Hive > Issue Type: New Feature > Affects Versions: 0.11.0 > Reporter: Kevin Wilfong > Assignee: Kevin Wilfong > Fix For: 0.11.0 > > Attachments: HIVE-3897.1.patch.txt > > > The uncompressed, compressed size of each column of an RCFile is stored in the header of an RCFile block. Currently, we have no convenient way to get at this data. This would be useful for identifying where RCFile is doing a poor job of compression, so that we can better focus our efforts. > RCFileCat seems like a logical tool to extend to add this. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira