hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From r K <apcemailingl...@gmail.com>
Subject Compression Problem
Date Sun, 30 Mar 2014 21:52:38 GMT
Hello Everyone,

I'm new to hadoop and tried to compress few files, using a streaming job.
Used streaming job mentioned in this post

Used bzip format instead. After they compressed, because there were too
many small files, I did

hadoop fs -cat /input/* | hadoop fs -put - /output/day1

Tried to build an external table from this day1 file and i get weird
characters.Renamed the file to day1.bz and then tried to rebuild table but
still get weird characters.

Realized I messed up pretty bad. Is there anyway to salvage this data ?
Is there anyway to use this compressed file /output/day1 at all ?

Thanks in advance.

View raw message