TextInputFormat should not create input splits for 0 byte files
---------------------------------------------------------------
Key: HADOOP-2952
URL: https://issues.apache.org/jira/browse/HADOOP-2952
Project: Hadoop Core
Issue Type: Improvement
Components: mapred
Reporter: Owen O'Malley
As part of HADOOP-2027, I discovered that we create input splits for 0 byte files. (In theory
this is for both sequence file and text files, but in practice sequence files can't be 0 bytes.)
I think 0 byte files can and should be dropped, since they have no input to process.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
|