arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Wes McKinney <wesmck...@gmail.com>
Subject [ANNOUNCE] Apache Arrow 0.2.0 released
Date Sun, 19 Feb 2017 16:46:20 GMT
The Apache Arrow community is pleased to announce the 0.2.0
release. It includes 192 resolved issues ([1]) since the first
ASF release on October 7, 2016.

The released source artifacts are located at [2]. Maven, conda,
and other artifacts will be published in the near future.

What is Apache Arrow?
---------------------

Apache Arrow is a columnar in-memory analytics layer designed to accelerate big
data. It houses a set of canonical in-memory representations of flat and
hierarchical data along with multiple language-bindings for structure
manipulation. It also provides low-overhead streaming and batch messaging,
zero-copy interprocess communication (IPC), and common algorithm
implementations.

Release Highlights
------------------

This release is a major milestone for the project, as we now have
integration tests validating binary compatibility between the
Java and C++ (and Python) implementations. These tests are now
being run continuously in Travis CI.

Other highlights include:

- A new streaming binary format (with Java and C++/Python implementations)
- Prototype for dictionary-encoded data in memory
- Significantly expanded Python functionality, particularly pandas and Apache
  Parquet interoperability
- A JSON file "format" for specifying integration tests
- Expanded zero-copy or low-overhead threadsafe IO for C++
- Build and packaging improvements

Please report any feedback to the mailing lists ([3])

Regards,
The Apache Arrow community

[1]: https://issues.apache.org/jira/issues/?jql=project%20%3D%20ARROW%20AND%20fixVersion%20%3D%200.2.0%20ORDER%20BY%20priority%20DESC
[2]: https://dist.apache.org/repos/dist/release/arrow/
[3]: https://lists.apache.org/list.html?dev@arrow.apache.org

Mime
View raw message