From user-return-174-archive-asf-public=cust-asf.ponee.io@arrow.apache.org Fri Aug 9 09:47:10 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 678EA18063F for ; Fri, 9 Aug 2019 11:47:10 +0200 (CEST) Received: (qmail 82620 invoked by uid 500); 9 Aug 2019 09:47:09 -0000 Mailing-List: contact user-help@arrow.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@arrow.apache.org Delivered-To: mailing list user@arrow.apache.org Received: (qmail 82610 invoked by uid 99); 9 Aug 2019 09:47:09 -0000 Received: from ui-eu-02.ponee.io (HELO localhost) (116.202.110.96) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Aug 2019 09:47:09 +0000 To: From: Игорь Ястребов Content-Type: text/plain; charset=utf-8 x-ponymail-agent: PonyMail Composer/0.2 X-Mailer: LuaSocket 3.0-rc1 In-Reply-To: MIME-Version: 1.0 References: Subject: Pyarrow: best way to store scheme Message-ID: Date: Fri, 09 Aug 2019 09:47:08 -0000 x-ponymail-sender: 9d30f2b33f506cd15f39152f9f8cebb5dcb53041 Hi everyone! Is there a recommended way to store schemata for Arrow tables on disk? I want to load them later to provide information to csv reader (by constructing a dictionary or directly if it gets implemented in the future). This is necessary to read multiple csv files that follow the same origin but may get wrong inferred type due to a lack of data in this particular file (null fields, integer types instead of float types).