hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Allen Wittenauer (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-340) TextInputFormat should not create input splits for 0 byte files
Date Thu, 17 Jul 2014 21:30:05 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Allen Wittenauer updated MAPREDUCE-340:
---------------------------------------

    Labels: newbie  (was: )

> TextInputFormat should not create input splits for 0 byte files
> ---------------------------------------------------------------
>
>                 Key: MAPREDUCE-340
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-340
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>            Reporter: Owen O'Malley
>              Labels: newbie
>
> As part of HADOOP-2027, I discovered that we create input splits for 0 byte files. (In
theory this is for both sequence file and text files, but in practice sequence files can't
be 0 bytes.) I think 0 byte files can and should be dropped, since they have no input to process.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message