hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ravi Mummulla (BIG DATA)" <>
Subject RE: Compression in Hive
Date Mon, 10 Jun 2013 13:14:17 GMT
Documentation is here Performance
overhead is trivial for larger amounts of data but may be magnified as data size gets smaller.
Typically where you gain is data transfers between nodes and disk reads/writes. Again, the
larger the data size the more the gain.


From: Sachin Sudarshana []
Sent: Sunday, June 9, 2013 11:04 PM
Subject: Compression in Hive


I have been testing the usefulness of compression in Hive. I have a general question,

I would like to know if there are any particular cases where compression in hive can actually
prove useful while running any MR jobs.

Any pointers/examples would really be useful!

Thank you,

View raw message