orc-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ORC-162) Handle 0 byte files as empty ORC files
Date Thu, 16 Mar 2017 15:49:41 GMT
Owen O'Malley created ORC-162:

             Summary: Handle 0 byte files as empty ORC files
                 Key: ORC-162
                 URL: https://issues.apache.org/jira/browse/ORC-162
             Project: ORC
          Issue Type: Bug
            Reporter: Owen O'Malley
            Assignee: Owen O'Malley

Hive often creates empty files for empty buckets, which can introduce significant load on
the HDFS cluster. Therefore, they made the Hive OrcOutputFormat and OrcInputFormat use 0 byte
ORC files as a special case.

We need to make the other readers treat them reasonably.

This message was sent by Atlassian JIRA

View raw message