From user-return-1026-archive-asf-public=cust-asf.ponee.io@arrow.apache.org Sun Feb 28 17:05:46 2021 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mxout1-he-de.apache.org (mxout1-he-de.apache.org [95.216.194.37]) by mx-eu-01.ponee.io (Postfix) with ESMTPS id EA823180652 for ; Sun, 28 Feb 2021 18:05:45 +0100 (CET) Received: from mail.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by mxout1-he-de.apache.org (ASF Mail Server at mxout1-he-de.apache.org) with SMTP id 15D7963435 for ; Sun, 28 Feb 2021 17:05:44 +0000 (UTC) Received: (qmail 28520 invoked by uid 500); 28 Feb 2021 17:05:43 -0000 Mailing-List: contact user-help@arrow.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@arrow.apache.org Delivered-To: mailing list user@arrow.apache.org Received: (qmail 28510 invoked by uid 99); 28 Feb 2021 17:05:43 -0000 Received: from spamproc1-he-de.apache.org (HELO spamproc1-he-de.apache.org) (116.203.196.100) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 28 Feb 2021 17:05:43 +0000 Received: from localhost (localhost [127.0.0.1]) by spamproc1-he-de.apache.org (ASF Mail Server at spamproc1-he-de.apache.org) with ESMTP id 142F21FF39A for ; Sun, 28 Feb 2021 17:05:43 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamproc1-he-de.apache.org X-Spam-Flag: NO X-Spam-Score: -0.201 X-Spam-Level: X-Spam-Status: No, score=-0.201 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamproc1-he-de.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=icloud.com Received: from mx1-he-de.apache.org ([116.203.227.195]) by localhost (spamproc1-he-de.apache.org [116.203.196.100]) (amavisd-new, port 10024) with ESMTP id 9-h1Az4JRoKv for ; Sun, 28 Feb 2021 17:05:42 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=17.58.23.189; helo=mr85p00im-ztdg06021201.me.com; envelope-from=yeshsriram@icloud.com; receiver= Received: from mr85p00im-ztdg06021201.me.com (mr85p00im-ztdg06021201.me.com [17.58.23.189]) by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with ESMTPS id 91D5E7FC9B for ; Sun, 28 Feb 2021 17:05:42 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=icloud.com; s=1a1hai; t=1614531934; bh=7hkPKSabmzCMmNkSR83/Fcrnwx3GD9k+dyTVsTPGtfs=; h=From:Content-Type:Mime-Version:Subject:Message-Id:Date:To; b=L9LwehMqP0LAnbBhgoAvbMBjU1gOMSfNJTXGuTETUxkwyK8LdpvrCTNor3gieucRe JY+7tq1u0ISZPo/7fW8ypPltRm9g1T74s159qQ92SUL+RUC+tVO8iTXVo/+Lsgp55V qEsEvbK+UGGOwV2fczH2EtfFszGTklKSJY4Ag3t9/Vr0iP3VKbTOPAEIkl2sJ9Le+k GcLsbV96S6xtnKoCQRSG15bU3vbM3kJKTaMSD2jYjnvtbQfXYh9raNXfCqy9I+bfS/ P8MCRcADgHLhuKzxHf5EcKs8ju3nQBzKkTsZWAvuSQyXYcYZdrqFb5gotm8RO1bOIc TRasZLsFmnVVA== Received: from [10.0.0.83] (c-73-189-132-104.hsd1.ca.comcast.net [73.189.132.104]) by mr85p00im-ztdg06021201.me.com (Postfix) with ESMTPSA id B964312059B for ; Sun, 28 Feb 2021 17:05:34 +0000 (UTC) From: Yeshwanth Sriram Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 14.0 \(3654.40.0.2.32\)) Subject: [C++] - How to extract indices of nested MapArray Message-Id: <6A4907F2-7327-4BA6-B3C6-C1A5BE0C5412@icloud.com> Date: Sun, 28 Feb 2021 09:05:33 -0800 To: user@arrow.apache.org X-Mailer: Apple Mail (2.3654.40.0.2.32) X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.369,18.0.761 definitions=2021-02-28_06:2021-02-26,2021-02-28 signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1011 mlxscore=0 mlxlogscore=507 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-2006250000 definitions=main-2102280146 Using C++//Arrow to filter out large parquet files and I=E2=80=99m able = to do this successfully. The current poc implementation is based on = nested for/loops which I would like to avoid this and instead use = built-in filter/take functions or some recommendations to extract (take = functions ?) arrays of indices or booleans to filter out rows. The input (data) array/column type is MapArray[key:String, = value:StructArray[id:String, =E2=80=A6]]=20 The input filter is a {filter_key: =E2=80=9Csome string=E2=80=9D, = filter_ids: [=E2=80=9Caaa=E2=80=9D, =E2=80=9Cbee=E2=80=9D, =E2=80=9Csee=E2= =80=9D, ..] } - Where filter_key, and filter_ids is to match contents of input = MapArray The output I=E2=80=99m looking for is either array of booleans or = indices of input array that match the input filer. Thank you=