hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Harsh J <ha...@cloudera.com>
Subject Re: How to find compression codec
Date Tue, 26 Jun 2012 06:04:33 GMT
The codec classname is serialized into a sequence file itself.

You can detect the codec of a SequenceFile, using its Reader:
http://hadoop.apache.org/common/docs/r0.20.2/api/org/apache/hadoop/io/SequenceFile.Reader.html#getCompressionCodec()

Or non-programmatically by viewing the header of the file (cat first
hundred bytes or so), and finding the pattern
"org.apache.hadoop.io.compress" in the stream printed.

On Tue, Jun 26, 2012 at 3:55 AM, Mohit Anchlia <mohitanchlia@gmail.com> wrote:
> Is there a way to look at the sequence file or a block report to see which
> compression is being used?



-- 
Harsh J

Mime
View raw message