hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Serge Blazhievsky <Serge.Blazhiyevs...@nice.com>
Subject Re: Convergence on File Format?
Date Thu, 08 Mar 2012 23:10:04 GMT
We started using Avro few month ago and results are great!

Easy to use, reliable, feature rich, great integration with MapReduce

On 3/8/12 3:07 PM, "Michal Klos" <mklos@compete.com> wrote:

>It seems that  Avro is poised to become "the" file format, is that still
>the case?
>We've looked at Text, RCFile and Avro. Text is nice, but we'd really need
>to extend it. RCFile is great for Hive, but it has been a challenge using
>it outside of Hive. Avro has a great feature set, but is comparably (to
>RCFile) significantly slower and larger on disk in our testing, but if it
>has the highest rate of development, it may be the right choice.
>If you were choosing a File Format today to build a general purpose
>cluster (general purpose in the sense of using all the Hadoop tools, not
>just Hive), what would you choose? (one of the choices being development
>of a Custom format)

View raw message