hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sugandha Naolekar <sugandha....@gmail.com>
Subject Compression issues!!
Date Wed, 15 Jul 2009 05:39:57 GMT

Few days back, I had asked about the compression of data placed in hadoop..I
did get apt replies as::

Place the data first in HDFS and then compress it, so that the data would be
in sequence files.

But, here my query is, I want to compress the data before placing it in
HDFS, so that redundancy won't come into picture..!

How to do that...!Also, will I have to use external compression algo. or
simply api's would solve the purpose?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message