www-announce mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sally Khudairi ...@apache.org>
Subject The Apache® Software Foundation Announces Apache Arrow™ Momentum
Date Tue, 19 Feb 2019 11:00:50 GMT
[this announcement is available online at https://s.apache.org/zhaP ]

Open Source Big Data in-memory columnar layer adopted by dozens of Open Source and commercial
technologies; exceeded 1,000,000 monthly downloads within first three years as an Apache Top-Level
Project

Wakefield, MA —19 February 2019— The Apache Software Foundation (ASF), the all-volunteer
developers, stewards, and incubators of more than 350 Open Source projects and initiatives,
today announced momentum with Apache® Arrow™, the Open Source Big Data in-memory columnar
layer.

Since the founding of the project in January 2016, Apache Arrow has quickly become the defacto
standard for representing and processing analytical data in memory, accelerating analytical
processing and interchange by more than 100x.

"When we became a Top-Level Project, we projected that the majority of the world's data will
be processed through Arrow within the next decade," said Jacques Nadeau, Vice President of
Apache Arrow. "In just three years time, we are proud to see Arrow's substantial industry
adoption and increased value across a wide range of analytical, machine learning, and artificial
intelligence workloads."

Highlights of Apache Arrow's success include:

- Industry Adoption —more than 20 major technologies adopted Arrow to accelerate in-memory
analytics, including Apache Spark, NVIDIA RAPIDS, pandas, and Dremio, among others. A list
of known Open Source and commercial implementations can be found at https://arrow.apache.org/powered_by/

- Millions of Downloads —leveraging and integrating Apache Arrow into many other technologies
has bolstered downloads to more than 1,000,000 each month.

- New Language Support —as a cross-language development platform, supporting multiple programming
languages is paramount. Apache Arrow has grown from supporting one language to eleven different
languages today; they include C++, Java, Python, R, C#, Javascript, and Ruby, among others.

- Seamless Data Format Support —Arrow supports different data types, both simple and nested,
located in arbitrary memory such as regular system RAM, memory-mapped files or on-GPU memory.
In addition, it can ingest data from popular storage formats such as Apache Parquet, CSV files,
Apache ORC, JSON, and more.

- Major Code Donations —Apache Arrow's new features and expanded functionality are due in
part to code and component donations that include:
  -- C# Library
  -- Gandiva LLVM-based Expression Compiler
  -- Go Library
  -- Javascript Library
  -- Plasma Shared Memory Object Store
  -- Ruby Libraries (Apache Arrow and Apache Parquet)
  -- Rust Libraries (Parquet and DataFusion Query Engine)

- Community and Contributor Growth —over the past 12 months, nearly 300 individuals have
submitted more than 3,000 contributions that have grown the Apache Arrow code base by 300,000
lines of code. The Arrow community is welcoming approximately 10 new contributors each month.


In January the project announced its most recent release, Apache Arrow 0.12.0, which reflects
more than 600 enhancements developed during Q4 2018. The Apache Arrow community is actively
working on a number of impactful new initiatives that include solving high performance analytical
problems and allowing for more efficient data distribution across entire clusters.

"Apache Arrow's rapid industry adoption and developer community growth supports our original
thesis of the importance of a language-independent open standard for columnar data," said
Wes McKinney, member of the Apache Arrow Project Management Committee, and creator of Python's
pandas project. "Additionally, we are seeing productive collaborations take place not only
between programming languages but also between the database systems and data science worlds.
We look forward to welcoming more data system developers into our community."

About Apache Arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized
language-independent columnar memory format for flat and hierarchical data, organized for
efficient analytic operations on modern hardware. It also provides computational libraries
and zero-copy streaming messaging and interprocess communication. Languages currently supported
include C, C++, C#, Go, Java, JavaScript, MATLAB, Python, R, Ruby, and Rust.

Availability and Oversight
Apache Arrow software is released under the Apache License v2.0 and is overseen by a self-selected
team of active contributors to the project. A Project Management Committee (PMC) guides the
Project's day-to-day operations, including community development and product releases. For
downloads, documentation, and ways to become involved with Apache Arrow, visit http://arrow.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source
projects, including Apache HTTP Server --the world's most popular Web server software. Through
the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members
and 7,000 Committers across six continents successfully collaborate to develop freely available
enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions
are distributed under the Apache License; and the community actively participates in ASF mailing
lists, mentoring initiatives, and ApacheCon, the Foundation's official global conference series.
The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate
sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget
Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks,
Huawei, IBM, Indeed, Inspur, LeaseWeb, Microsoft, Oath, ODPi, Pineapple Fund, Pivotal, Private
Internet Access, Red Hat, Target, Tencent, Union Investment, and Workday. For more information,
visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Arrow", "Apache Arrow", and "ApacheCon" are
registered trademarks or trademarks of the Apache Software Foundation in the United States
and/or other countries. All other brands and trademarks are the property of their respective
owners.

# # #

NOTE: you are receiving this message because you are subscribed to the announce@apache.org
distribution list. To unsubscribe, send email from the recipient account to announce-unsubscribe@apache.org
with the word "Unsubscribe" in the subject line.

Mime
View raw message