arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Wes McKinney <>
Subject [ANNOUNCE] Apache Arrow 0.2.0 released
Date Sun, 19 Feb 2017 16:46:20 GMT
The Apache Arrow community is pleased to announce the 0.2.0
release. It includes 192 resolved issues ([1]) since the first
ASF release on October 7, 2016.

The released source artifacts are located at [2]. Maven, conda,
and other artifacts will be published in the near future.

What is Apache Arrow?

Apache Arrow is a columnar in-memory analytics layer designed to accelerate big
data. It houses a set of canonical in-memory representations of flat and
hierarchical data along with multiple language-bindings for structure
manipulation. It also provides low-overhead streaming and batch messaging,
zero-copy interprocess communication (IPC), and common algorithm

Release Highlights

This release is a major milestone for the project, as we now have
integration tests validating binary compatibility between the
Java and C++ (and Python) implementations. These tests are now
being run continuously in Travis CI.

Other highlights include:

- A new streaming binary format (with Java and C++/Python implementations)
- Prototype for dictionary-encoded data in memory
- Significantly expanded Python functionality, particularly pandas and Apache
  Parquet interoperability
- A JSON file "format" for specifying integration tests
- Expanded zero-copy or low-overhead threadsafe IO for C++
- Build and packaging improvements

Please report any feedback to the mailing lists ([3])

The Apache Arrow community


View raw message