From user-return-1235-archive-asf-public=cust-asf.ponee.io@arrow.apache.org Wed May 19 17:36:00 2021 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mxout1-ec2-va.apache.org (mxout1-ec2-va.apache.org [3.227.148.255]) by mx-eu-01.ponee.io (Postfix) with ESMTPS id B600018063D for ; Wed, 19 May 2021 19:36:00 +0200 (CEST) Received: from mail.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by mxout1-ec2-va.apache.org (ASF Mail Server at mxout1-ec2-va.apache.org) with SMTP id DE94A3EF82 for ; Wed, 19 May 2021 17:35:59 +0000 (UTC) Received: (qmail 6881 invoked by uid 500); 19 May 2021 17:35:59 -0000 Mailing-List: contact user-help@arrow.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@arrow.apache.org Delivered-To: mailing list user@arrow.apache.org Received: (qmail 6870 invoked by uid 99); 19 May 2021 17:35:59 -0000 Received: from spamproc1-he-fi.apache.org (HELO spamproc1-he-fi.apache.org) (95.217.134.168) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 May 2021 17:35:59 +0000 Received: from localhost (localhost [127.0.0.1]) by spamproc1-he-fi.apache.org (ASF Mail Server at spamproc1-he-fi.apache.org) with ESMTP id 7BFADC0440 for ; Wed, 19 May 2021 17:35:58 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamproc1-he-fi.apache.org X-Spam-Flag: NO X-Spam-Score: 0.498 X-Spam-Level: X-Spam-Status: No, score=0.498 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.2, MIME_HTML_ONLY=0.3, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamproc1-he-fi.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net Received: from mx1-he-de.apache.org ([116.203.227.195]) by localhost (spamproc1-he-fi.apache.org [95.217.134.168]) (amavisd-new, port 10024) with ESMTP id J_mqcW4n5s1L for ; Wed, 19 May 2021 17:35:58 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=212.227.15.19; helo=mout.gmx.net; envelope-from=bjoernbachmann@gmx.de; receiver= Received: from mout.gmx.net (mout.gmx.net [212.227.15.19]) by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with ESMTPS id D989F7FE77 for ; Wed, 19 May 2021 17:35:57 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1621445757; bh=+vYU9hilSsy5VLSPsolpxsiaOqAxcZiyRPXKsrTSZYg=; h=X-UI-Sender-Class:From:To:Subject:Date; b=arp1TsrCVZsYC8gA1w08G02HcSSSiXwgV0VsioyAlneWtlaVYEauNdl492BtYnI/R 2fmrWIoHGyilo0goYTaxqQJS2QuHqvzHgKrwxNz+LvDzoz+tuC9EY28dXl/3wdZjCc aAdnlpQ8X8qR2L1HrHsUOUQdlsLJONm7A1vNMQ94= X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c Received: from [80.187.118.189] ([80.187.118.189]) by web-mail.gmx.net (3c-app-gmx-bs32.server.lan [172.19.170.84]) (via HTTP); Wed, 19 May 2021 19:35:57 +0200 MIME-Version: 1.0 Message-ID: From: Bjoern Bachmann To: user@arrow.apache.org Subject: [C++] Data Movement using Apache Arrow Content-Type: text/html; charset=UTF-8 Date: Wed, 19 May 2021 19:35:57 +0200 Importance: normal Sensitivity: Normal X-Priority: 3 X-Provags-ID: V03:K1:MpyNxb9slVPJiXiNydPPL67vsooWMJn+Tbzn3CvQz+kM/rxK9ljupaBBVcm/gKl3S3UTc TQpHGWxXtUhuX6oQI4Tb80kjhToSRBeUdE0xrcXLKtX/dWrMcGjEHt1VUkuifAe5JByFYo/Ir23w /YVrnBn5G26bdjVpUjT2lp2n1ZyDW1dNwGM7QUZiFQSGBj+Yex0lDmpGtIbxaX0hWEfHg9MD+VWU bbBVOeN3Ye2PnfKnU6PqUpzCxUDAoHs2oO/5/RElQzUIU+UdkTnVkgcKb9KAZdx6CjFlsPBRWvDR EA= X-UI-Out-Filterresults: notjunk:1;V03:K0:qPGJPsTMQbc=:aC9eqp9Ah8y3Cc3iwesWqs dh8IX/RG6ukrUu2ufPloDFzmiHRbJrFaxFnOv0MbCqpstxdtIpoNqN8slQe36DRODkXf4zez4 Rpxhd3W97UmIc7Ei1mLfAY/uKRRTpifP1oP3hlFTIoARe4H/nobzMLV9NiNJw5RVuTW09x7WK D8hJjYQ7g7P+MCirBODexBknS7BWCgCbx0esEahaQ511CIP6+i+8uXd/1llz4DhqrLDCumTf1 kSkOmxaG6qpHJ8q5dge1rijR0wXLY8crnYr+MC2SlUfFx3T+fg6+5ofkJ+BjUbHIXrx9p2O6V tsbkP/dwE5YSU/O5XfO1RyNhpDDHLjKn8sFggDnQk1r2hCVMuFxSBnrbhdScBjcveZQg5rpvD +9wA4tJ2BccwyJ7xNtOUD08OptdPNsGP/iKwJ806p81wt19EC5pokA0aVh5PudMvwO/qwmhh1 5bleL+u2PXmj29KoVr0YlqFNmEUt3wkZuCnq6ZJuvUcwKA4cCA/4cebbkfY8NLuBpUAE4cWIU fMxeOdrEIPCqkLYO3slI1ZaJ6W+e4HnLdFwbVWjignrBTEf00P4pV3GDogcecUh4LH8Z7S32L d/dp4fcVDnJi/8WHX0wigWpxRN9X5fLGBelpR6HIPI6wmV2WTICOpssDDS7Vzq/sccPic1SjW /svpBgTtIL1utL4xwDFInO6VQAjENWs1dVobt8soKlirtTUT05V+0ztaUAu0+ZH+Q58crFVCB HVXj2J5cQ078mYHyZo0YeBWYdZkiw3cXdFNf2iWFmh0dMIYHqwLy/V08xmo5bjJcbomZoUWeE kwrAUGEewTqiFDyKnWyV0pbtdW5Me8ZuQBFOcxHCcNpo18nZmXitTcb1ZdmYZnHpDZeLADJP4 w8K+2efLc4IsTJE8TaJce8F9Fj2Z1MoM8LsLk1+HqxeucnBv7uYtDAieprcQgO4ql9JTRD1Ab onMKymbqDNg==
Hi Apache Arrow User,

I'm looking for some help to get started with Apache Arrow. We have a distributed system which has the following operation sequence:

Compute Node 1: Captures measurement data, 2x float arrays (32Bit with 150k values per array) which are transfered to a kafka cluster

Compute Node 2: Should listen on kafka for new data logs (the data logs are microbatches of measurment data), fetch data when received and then transfer it to another process/thread where the data is analysed.

The system is mainly written in C++ and parts of the final data analysis in c#. How could I effectivly utilize Arrow in such a scenario? Or should I use an alternative solutions? Are there any examples or code snippets which could help?

Thanks!