hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "jiraposter@reviews.apache.org (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-2404) Allow RCFile Reader to tolerate corruptions
Date Thu, 25 Aug 2011 17:43:31 GMT

    [ https://issues.apache.org/jira/browse/HIVE-2404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13091163#comment-13091163
] 

jiraposter@reviews.apache.org commented on HIVE-2404:
-----------------------------------------------------


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1660/
-----------------------------------------------------------

Review request for Yongqiang He and Paul Yang.


Summary
-------

Sometimes it is useful to tolerate corruptions during a query and return results based on
the files that can be processed. A single corrupt block of data should not prevent reading
the rest of the data.

We need a way to gracefully ignore errors while reading a RC File


This addresses bug HIVE-2404.
    https://issues.apache.org/jira/browse/HIVE-2404


Diffs
-----

  trunk/ql/src/java/org/apache/hadoop/hive/ql/io/RCFile.java 1161660 

Diff: https://reviews.apache.org/r/1660/diff


Testing
-------

Manual testing with a corrupt RC file


Thanks,

Ramkumar



> Allow RCFile Reader to tolerate corruptions
> -------------------------------------------
>
>                 Key: HIVE-2404
>                 URL: https://issues.apache.org/jira/browse/HIVE-2404
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.7.1
>            Reporter: Ramkumar Vadali
>            Assignee: Ramkumar Vadali
>            Priority: Minor
>         Attachments: toleratecorruptions.patch
>
>
> Sometimes it is useful to tolerate corruptions during a query and return results based
on the files that can be processed. A single corrupt block of data should not prevent reading
the rest of the data.
> We need a way to gracefully ignore errors while reading a RC File

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message