Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4D77D10A1D for ; Fri, 15 Nov 2013 22:21:24 +0000 (UTC) Received: (qmail 49807 invoked by uid 500); 15 Nov 2013 22:21:21 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 49752 invoked by uid 500); 15 Nov 2013 22:21:21 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 49715 invoked by uid 500); 15 Nov 2013 22:21:21 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 49702 invoked by uid 99); 15 Nov 2013 22:21:21 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Nov 2013 22:21:21 +0000 Date: Fri, 15 Nov 2013 22:21:21 +0000 (UTC) From: "Shuaishuai Nie (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HIVE-5795) Hive should be able to skip header and footer rows when reading data file for a table MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-5795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shuaishuai Nie updated HIVE-5795: --------------------------------- Status: Patch Available (was: Open) > Hive should be able to skip header and footer rows when reading data file for a table > ------------------------------------------------------------------------------------- > > Key: HIVE-5795 > URL: https://issues.apache.org/jira/browse/HIVE-5795 > Project: Hive > Issue Type: Bug > Reporter: Shuaishuai Nie > Assignee: Shuaishuai Nie > Attachments: HIVE-5795.1.patch > > > Hive should be able to skip header and footer lines when reading data file from table. In this way, user don't need to processing data which generated by other application with a header or footer and directly use the file for table operations. > To implement this, the idea is adding new properties in table descriptions to define the number of lines in header and footer and skip them when reading the record from record reader. An DDL example for creating a table with header and footer should be like this: > {code} > Create external table testtable (name string, message string) row format delimited fields terminated by '\t' lines terminated by '\n' location '/testtable' tblproperties ("skip.header.number"="1", "skip.footer.number"="2"); > {code} -- This message was sent by Atlassian JIRA (v6.1#6144)