arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sourav Mazumder <sourav.mazumde...@gmail.com>
Subject Comparing with Parquet
Date Thu, 25 Feb 2016 16:10:22 GMT
Hi All,

New to this. And still trying to figure out where exactly Arrow fits in the
ecosystem of various Big Data technologies.

In that respect first thing which came to my mind is how does Arrow compare
with parquet.

In my understanding Parquet also supports a very efficient columnar format
(with support for nested structure). It is already embraced (supported) by
various technologies like Impala (origin), Spark, Drill etc.

The only think I see missing in Parquet is support for SIMD based
vectorized operations.

Am I right or am I missing many other differences between Arrow and parquet
?

Regards,
Sourav

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message