hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-5795) Hive should be able to skip header and footer rows when reading data file for a table
Date Wed, 01 Jan 2014 01:25:50 GMT

    [ https://issues.apache.org/jira/browse/HIVE-5795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13859817#comment-13859817
] 

Hive QA commented on HIVE-5795:
-------------------------------



{color:green}Overall{color}: +1 all checks pass

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12620989/HIVE-5795.5.patch

{color:green}SUCCESS:{color} +1 4818 tests passed

Test results: http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/781/testReport
Console output: http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/781/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12620989

> Hive should be able to skip header and footer rows when reading data file for a table
> -------------------------------------------------------------------------------------
>
>                 Key: HIVE-5795
>                 URL: https://issues.apache.org/jira/browse/HIVE-5795
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Shuaishuai Nie
>            Assignee: Shuaishuai Nie
>         Attachments: HIVE-5795.1.patch, HIVE-5795.2.patch, HIVE-5795.3.patch, HIVE-5795.4.patch,
HIVE-5795.5.patch
>
>
> Hive should be able to skip header and footer lines when reading data file from table.
In this way, user don't need to processing data which generated by other application with
a header or footer and directly use the file for table operations.
> To implement this, the idea is adding new properties in table descriptions to define
the number of lines in header and footer and skip them when reading the record from record
reader. An DDL example for creating a table with header and footer should be like this:
> {code}
> Create external table testtable (name string, message string) row format delimited fields
terminated by '\t' lines terminated by '\n' location '/testtable' tblproperties ("skip.header.line.count"="1",
"skip.footer.line.count"="2");
> {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message