Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 933692297 for ; Mon, 25 Apr 2011 09:22:46 +0000 (UTC) Received: (qmail 1604 invoked by uid 500); 25 Apr 2011 09:22:46 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 1567 invoked by uid 500); 25 Apr 2011 09:22:46 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 1559 invoked by uid 500); 25 Apr 2011 09:22:46 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 1556 invoked by uid 99); 25 Apr 2011 09:22:46 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Apr 2011 09:22:46 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Apr 2011 09:22:44 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id BE0B0B1199 for ; Mon, 25 Apr 2011 09:22:05 +0000 (UTC) Date: Mon, 25 Apr 2011 09:22:05 +0000 (UTC) From: "He Yongqiang (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: <680198349.79605.1303723325774.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <227112398.77025.1303499225823.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HIVE-2126) Hive's symlink text input format should be able to work with ComineHiveInputFormat MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HIVE-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13024760#comment-13024760 ] He Yongqiang commented on HIVE-2126: ------------------------------------ review board: https://reviews.apache.org/r/653/ > Hive's symlink text input format should be able to work with ComineHiveInputFormat > ---------------------------------------------------------------------------------- > > Key: HIVE-2126 > URL: https://issues.apache.org/jira/browse/HIVE-2126 > Project: Hive > Issue Type: Improvement > Reporter: He Yongqiang > Assignee: He Yongqiang > Attachments: HIVE-2126.1.patch > > > at compile time, if a partition's file format is SymlinkTextInputFormat, will replace the symlink path with paths in the symlink file. This way, it will work with Hive's HiveCombineFileInputFormat. > The reason we are doing it at compile time is because: > 1) At run time, the input path is not only used to get record reader, but also used for hive to get aliases and thus operator tree. But the CombineHiveInputFormat can have multiple paths for each split, and when switching paths, it also set the job with new input file name. So it always require a real input path name. Can not fake it. > 2) if write a new input format, it will require a lot of duplication work with existing CombineHiveInputFormat. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira