arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Johan Mabille (JIRA)" <>
Subject [jira] [Commented] (ARROW-602) C++: Provide iterator access to primitive elements inside a Column/ChunkedArray
Date Tue, 21 Mar 2017 14:30:42 GMT


Johan Mabille commented on ARROW-602:

I thought I would add this interface to numerical arrays for a start, but I agree it might
be complicated to go back if we experience any problem in adding this interface to the other
array classes in the future.

By the way I guess your first question is adressing the handling of missing values ? Sylvain
has done some work on it for xtensor, you might be interested in [].

More generally, what would be the best process for you? I can open a dedicated issue for this
topic, so that we can identify all the possible problems and start trying to solve them. Or
I can start a draft and we can iterate (or maybe you already started something I can contribute
to). I guess this document could go into arrow/cpp/doc, so I may open a PR with the initial

> C++: Provide iterator access to primitive elements inside a Column/ChunkedArray
> -------------------------------------------------------------------------------
>                 Key: ARROW-602
>                 URL:
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Uwe L. Korn
>              Labels: beginner, newbie
> Given a ChunkedArray, an Arrow user must currently iterate over all its chunks and then
cast them to their types to extract the primitive memory regions to access the values. A convenient
way to access the underlying values would be to offer a function that takes a ChunkedArray
and returns a C++ iterator over all elements.
> While this may not be the most performant way to access the underlying data, it should
have sufficient performance and adds a convenience layer for new users.

This message was sent by Atlassian JIRA

View raw message