hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From edward choi <mp2...@gmail.com>
Subject How to read LZO compressed files?
Date Mon, 02 Jan 2012 05:34:24 GMT

I'm having trouble trying to handle lzo compressed files.
The input files are compressed by LzopCodec provided by hadoop-lzo package.
And I am using Cloudera 3 update 2 version Hadoop.

I don't need to split the input file, so there is no need telling me to
index the input file and to use LzoTextInputFormat, unless that is the only
way to handle lzo-compressed files.

I thought all I needed to do was set the job input format as
"TextInputFormat" and hadoop will take care of the rest.
When I do that, I don't get any error messages but log files tell me that
input files are not decompressed at all. Input files are being handled as
raw text files.

Is there a specific way to read files with lzo extension?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message