hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andy Isaacson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
Date Tue, 23 Oct 2012 00:44:12 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13481990#comment-13481990
] 

Andy Isaacson commented on HADOOP-8900:
---------------------------------------

bq. Andy or Colin, can you please review the merged branch-1 patch.

hadoop-8900.branch-1.patch looks good to me.  Thanks for the backport!
                
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed
size
> ---------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-8900
>                 URL: https://issues.apache.org/jira/browse/HADOOP-8900
>             Project: Hadoop Common
>          Issue Type: Bug
>    Affects Versions: 1-win, 2.0.1-alpha
>         Environment: Encountered failure when processing large GZIP file
>            Reporter: Slavik Krassovsky
>            Assignee: Andy Isaacson
>             Fix For: 3.0.0, 2.0.3-alpha
>
>         Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch,
hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
>  ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
>  at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
>  at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
>  at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
>  at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
>  at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
>  at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
>  at java.security.AccessController.doPrivileged(Native Method)
>  at javax.security.auth.Subject.doAs(Subject.java:415)
>  at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
>  at org.apache.hadoop.mapred.Child.main(Child.java:260)
>  Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
>  at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
>  at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
>  at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
>  at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
>  at java.io.InputStream.read(InputStream.java:102)
>  at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
>  at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
>  at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
>  at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
>  at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
>  at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
>  at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
>  ... 9 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message