arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Wes McKinney <wesmck...@gmail.com>
Subject Re: Getting started on board report... input?
Date Thu, 12 Jan 2017 14:11:26 GMT
+1, thanks Jacques

On Thu, Jan 12, 2017 at 1:40 AM, Jacques Nadeau <jacques@apache.org> wrote:
> Thanks for the ideas. Here is what I posted:
>
> ## Description:
> Arrow is a columnar in-memory analytics layer designed to accelerate big
> data.
> It houses a set of canonical in-memory representations of flat and
> hierarchical data along with multiple language-bindings for structure
> manipulation. It also provides IPC and common algorithm implementations.
>
> ## Issues:
> - There are no issues requiring board attention at this time.
>
> ## Activity:
> - Arrow has made great progress since the last report.
> - The community has actively been driving towards a set of cross-language
>   compatibility tests. These are now complete.
> - The compatibility tests were a key gate identified to seeing the
>   specification as solidified. Now that it is, the community will be
> starting
>   work on our second release.
> - This release will show the arrow projects java arrow, arrow-cpp, py-arrow
>   and Parquet's parquet-cpp all working nicely together.
>
> ## Health report:
> - A core group of community members continue to make good progress on
> various
>   aspects of both the Java, C++ and python projects.
> - We're seen a small number of casual contributors arrive and provide
>   additional patches to the project.
> - Multiple people have been doing community outreach through the various
> blog
>   posts, meetups and conference presentations. Examples include
>   - Upcoming talk at Strata San Jose in March
>   - Upcoming talk at Dataworks Summit Munich in April
>   - Arrow and Pandas vision: https://s.apache.org/arrow_1701_01
>   - Python Data Wrangling talk: https://s.apache.org/arrow_1701_02
> - We continue to see nice growth in mailing list and jira activity.
>
> ## PMC changes:
>
> - Currently 18 PMC members.
> - Wes McKinney was added to the PMC on Wed Oct 19 2016
>
> ## Committer base changes:
>
> - Currently 21 committers.
> - Uwe Korn was added as a committer on Thu Oct 27 2016
>
> ## Releases:
>
> - Last release was 0.1.0 on Tue Oct 11 2016
>
> ## JIRA activity:
>
> - 140 JIRA tickets created in the last 3 months
> - 117 JIRA tickets closed/resolved in the last 3 months
>
> On Mon, Jan 9, 2017 at 10:16 AM, Wes McKinney <wesmckinn@gmail.com> wrote:
>
>> The integration test success is the biggest news. We should definitely
>> try to release 0.2 as soon as possible.
>>
>> Other features
>>
>> - Significantly improved performance in Python pandas interoperability
>> - Refactored parquet-cpp to depend on Arrow C++ memory management
>> (buffers) and IO components
>>
>> I wrote 3 blog posts recently that include some words about Arrow
>>
>> http://wesmckinney.com/blog/outlook-for-2017/
>> http://wesmckinney.com/blog/high-perf-arrow-to-pandas/
>> http://wesmckinney.com/blog/python-hdfs-interfaces/
>>
>> I talked some about Arrow in my recent talk:
>> http://www.slideshare.net/wesm/python-data-wrangling-
>> preparing-for-the-future.
>>
>> Thanks
>> Wes
>>
>> On Mon, Jan 9, 2017 at 12:45 PM, Julien Le Dem <julien@dremio.com> wrote:
>> > Now that we have cross compatibility tests between Java and C++ we're
>> going
>> > to release a 0.2 soon.
>> > arrow-cpp, py-arrow and parquet-cpp work together nicely.
>> > I'm going to talk about arrow at Starta San Jose (March) and Dataworks
>> > Summit in Munich (April)
>> >
>> >
>> > On Mon, Jan 9, 2017 at 9:38 AM, Jacques Nadeau <jacques@apache.org>
>> wrote:
>> >
>> >> Hey All, I'm going to full together a arrow board report. Anything
>> people
>> >> think I should note? Presentations, progress, key features, etc?
>> >>
>> >> thanks,
>> >> jacques
>> >>
>> >
>> >
>> >
>> > --
>> > Julien
>>

Mime
View raw message