hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ravi Mummulla (BIG DATA)" <rav...@microsoft.com>
Subject RE: Compression in Hive
Date Mon, 10 Jun 2013 13:14:17 GMT
Documentation is here https://cwiki.apache.org/confluence/display/Hive/CompressedStorage. Performance
overhead is trivial for larger amounts of data but may be magnified as data size gets smaller.
Typically where you gain is data transfers between nodes and disk reads/writes. Again, the
larger the data size the more the gain.

Thanks.

From: Sachin Sudarshana [mailto:sachin.hadoop@gmail.com]
Sent: Sunday, June 9, 2013 11:04 PM
To: user@hive.apache.org
Subject: Compression in Hive

Hi,

I have been testing the usefulness of compression in Hive. I have a general question,

I would like to know if there are any particular cases where compression in hive can actually
prove useful while running any MR jobs.

Any pointers/examples would really be useful!

Thank you,
Sachin


Mime
View raw message