hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sachin Sudarshana <sachin.had...@gmail.com>
Subject Compression in Hive using different file formats
Date Sun, 09 Jun 2013 09:29:33 GMT
Hi,

I was testing Compression in Hive using different file formats.

I have a table stored as a sequence file ,* facts_normal_seq*.

Now I wish to create another table *facts_snappy_seq *by using Snappy
compression codec.

Is this the correct way to do this:

*CREATE TABLE facts_snappy_seq (<column1> , <column2> ....) ROW FORMAT....
STORED AS SEQUENCEFILE;*
*
*
*SET hive.exec.compress.output=true;*
*SET
mapred.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec;*
*SET mapred.output.compression.type=BLOCK; *
*
*
*INSERT OVERWRITE TABLE facts_snappy_seq SELECT * FROM facts_normal_seq;*
*
*
When i populate the table in this manner, the file in HDFS doesn not seem
to have the .snappy extension.

Any pointers in this regard would really be helpful

Thank you,
Sachin

Mime
View raw message