hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron Dossett (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-11977) Hive should handle an external avro table with zero length files present
Date Mon, 28 Sep 2015 18:26:04 GMT
Aaron Dossett created HIVE-11977:
------------------------------------

             Summary: Hive should handle an external avro table with zero length files present
                 Key: HIVE-11977
                 URL: https://issues.apache.org/jira/browse/HIVE-11977
             Project: Hive
          Issue Type: Bug
            Reporter: Aaron Dossett


If a zero length file is in the top level directory housing an external avro table,  all hive
queries on the table fail.

This issue is that org.apache.hadoop.hive.ql.io.avro.AvroGenericRecordReader creates a new
org.apache.avro.file.DataFileReader and DataFileReader throws an exception when trying to
read an empty file (because the empty file lacks the magic number marking it as avro).  

AvroGenericRecordReader should detect an empty file and then behave reasonably.

Caused by: java.io.IOException: Not a data file.
at org.apache.avro.file.DataFileStream.initialize(DataFileStream.java:102)
at org.apache.avro.file.DataFileReader.<init>(DataFileReader.java:97)
at org.apache.hadoop.hive.ql.io.avro.AvroGenericRecordReader.<init>(AvroGenericRecordReader.java:81)
at org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat.getRecordReader(AvroContainerInputFormat.java:51)
at org.apache.hadoop.hive.ql.io.HiveInputFormat.getRecordReader(HiveInputFormat.java:246)
... 25 more



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message