hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shushant Arora <shushantaror...@gmail.com>
Subject writing in parquet hive table using custom MR
Date Sat, 14 Nov 2015 14:51:58 GMT
Hi

I have a requirement to dump parquet files in hive table using custom MR.

Parquet has so many data models- avro-parquet,proto-parquet,hive-parquet ?
Which one is recommended over other for inmemory plain java objects.

Hive internally uses MapredParquetOutputformat . Is it better than
AvroParquetOutPutFormat ?
I did n't find any sample program using MapredParquetOutputformat. What are
pros and cons of MapredParquetOutputformat over AvroParquetOutPutFormat  .
Why ParquetOutputFormat needs a datamodel for converting java objects to
parquet records ?
Why not like ORCOutputFormat use a serde and convert any writable to
ParquetRecord.

Thanks

Mime
View raw message