From user-return-134-archive-asf-public=cust-asf.ponee.io@arrow.apache.org Mon May 20 18:46:07 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id C2831180627 for ; Mon, 20 May 2019 20:46:06 +0200 (CEST) Received: (qmail 47244 invoked by uid 500); 20 May 2019 18:46:06 -0000 Mailing-List: contact user-help@arrow.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@arrow.apache.org Delivered-To: mailing list user@arrow.apache.org Received: (qmail 47234 invoked by uid 99); 20 May 2019 18:46:06 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 May 2019 18:46:06 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 865931825D2 for ; Mon, 20 May 2019 18:46:05 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.199 X-Spam-Level: X-Spam-Status: No, score=-0.199 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id zBYvXUZkbFZr for ; Mon, 20 May 2019 18:46:04 +0000 (UTC) Received: from mail-it1-f171.google.com (mail-it1-f171.google.com [209.85.166.171]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id DB3A261165 for ; Mon, 20 May 2019 18:46:03 +0000 (UTC) Received: by mail-it1-f171.google.com with SMTP id 9so642755itf.4 for ; Mon, 20 May 2019 11:46:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :content-transfer-encoding; bh=3rIOw6TlOXBXRbMe0JxdWz3ciAY42AHtCqbR7qUAo1c=; b=SlCdV/ysBnxALUjbS8HBSonMIOYzvHBiIhR2nIuBp+MNfb3G/Q9h4QOtNJuE2UCFq9 0TQWSAlZ58SxpU5E8podCBJvgY9aWNj56FN1XwGUl3tT7atXoGPkdVaG+Xpgpuvh4RyI 4mbIRvUa6z3brIT9B2MMjSVWu5KvaQrxmXGh+vOjSKe1WQLXNb7QDdyNSBQ4fA1/6fJj d5Gf9kBnZQW8F8HoinO9iOy7KOun3XybOa2kL+pjhbgllZZd4f6xEFM8BMgF9/tiET7a S442SOXMwXAl14S28VfYtmSGJfjN/zPfVXBUP1xqIq/9BeHlnSeKXLzR5s3o5OqRl9PR WgRg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:content-transfer-encoding; bh=3rIOw6TlOXBXRbMe0JxdWz3ciAY42AHtCqbR7qUAo1c=; b=GUBkadxfdgnItLJr+01SVq0QcaEvDwIEt5XVM9o0/b65sBljx2oSG2s5nh+VBvyiA7 5kgx1pQS24boYZAL9PXiLAIt2GRwNcBYMU6+HsfvV+UxWVvPQsvVf9cZmQrQNctb+N4A m+KYq5+34P9eORecg/813O9VPq5leqtoqOc/zWv4oyIyREbXcoQM8d49NyeEJx8HuiUi eZovWIuq15R6817+LpCMBoJ384hfInUz1nX+s9kghse5ps1z18AQwQ8HHTGGI87mONs+ dEhdzyFpEuPSh92+49Kh/2a0MaXMnQu8esIB0B7Q0FtYvGssa7gp9PZW2ssJuX19DXfD XGQw== X-Gm-Message-State: APjAAAURwa8A2RHjbRnzp1aRyTI8JP98uwkhNn4WDF/Iry76VPoYhRJZ 05/FEjviqA9LY/iboSMju978WUH2h6O340msbHaIuE1oxZE= X-Google-Smtp-Source: APXvYqzYJhEw66jC+Z6sVToXaLSULKm1B+4mwBPPDz9QS8Jyg1YBhLBZFB5lguCsQUUKu/fUz+3O6EVDme6GY090R7s= X-Received: by 2002:a02:3ecb:: with SMTP id s194mr49714508jas.29.1558377962753; Mon, 20 May 2019 11:46:02 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Wes McKinney Date: Mon, 20 May 2019 13:45:26 -0500 Message-ID: Subject: Re: [C++] Storing/retreiving a Table in plasma To: user@arrow.apache.org Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable hi Miki, In https://github.com/353solutions/carrow/blob/plasma/_misc/plasma.cc#L47 GetRecordBatchSize does not represent the entire size of the stream including schema. If you are serializing Schema separate from RecordBatch then you need to use the lower level arrow::ipc::ReadRecordBatch/WriteRecordBatch functions. Have a look at the unit tests If you are going to use RecordBatchStreamWriter then you need to compute the size using MockOutputStream per my original e-mail - Wes On Mon, May 20, 2019 at 12:50 PM Miki Tebeka wrote: >> >> That link didn't work for me. > > Doh! I moved it to https://github.com/353solutions/carrow/blob/plasma/_mi= sc/plasma.cc > >> >> Would it not be better to do this work in Apache Arrow rather than an ex= ternal project? I would guess the >> community would be interested in this. > > I do plan to suggest this as a patch to arrow once the code is usable, cu= rrently it's just noise. > > The idea behind carrow is to use the underlying C++ both in Python & Go s= o that in the same process we can simply share pointers (and maybe later us= ed shared memory allocator to do it between processes). I don't see a clea= r path to do it with the current Go implementation since it's uses the Go r= untime to allocate memory, and carrow has a complicated build process that = currently won't with with simple "go get". > > To get initial usable Go<->Python IPC quickly, I'm trying to utilize plas= ma for now. However in the long run I'd like to just share pointers with no= serializaton at all. > > I'd love to discuss how we can make this project usable and get the commu= nity help in solving some "easy of build" issues later on. Would love to ha= ve it in the main arrow eventually.