hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Azuryy Yu <azury...@gmail.com>
Subject Re: Best format to use
Date Tue, 09 Apr 2013 01:16:07 GMT
impala can work with compressed files, but it's sequence file, not
compressed directly.

On Tue, Apr 9, 2013 at 7:48 AM, Mark <static.void.dev@gmail.com> wrote:

> Trying to determine what the best format to use for storing daily logs. We
> recently switch from snappy (.snappy) to gzip (.deflate) but I'm wondering
> if there is something better? Our main clients for these daily logs are pig
> and hive using an external table. We were thinking about testing out impala
> but we see that it doesn't work with compressed text files. Any suggestions?
> Thanks

View raw message