hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Greg Roelofs (JIRA)" <j...@apache.org>
Subject [jira] Created: (MAPREDUCE-1795) add error option if file-based record-readers fail to consume all input (e.g., concatenated gzip, bzip2)
Date Tue, 18 May 2010 01:20:43 GMT
add error option if file-based record-readers fail to consume all input (e.g., concatenated
gzip, bzip2)
--------------------------------------------------------------------------------------------------------

                 Key: MAPREDUCE-1795
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1795
             Project: Hadoop Map/Reduce
          Issue Type: Improvement
            Reporter: Greg Roelofs
            Assignee: Ravi Gummadi


When running MapReduce with concatenated gzip files as input only the first part is read,
which is confusing, to say the least. Concatenated gzip is described in http://www.gnu.org/software/gzip/manual/gzip.html#Advanced-usage
and in http://www.ietf.org/rfc/rfc1952.txt. (See original report at http://www.nabble.com/Problem-with-Hadoop-and-concatenated-gzip-files-to21383097.html)


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message