From dev-return-11102-archive-asf-public=cust-asf.ponee.io@arrow.apache.org Sat Mar 16 23:55:17 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 49BC1180648 for ; Sun, 17 Mar 2019 00:55:17 +0100 (CET) Received: (qmail 23972 invoked by uid 500); 16 Mar 2019 23:55:16 -0000 Mailing-List: contact dev-help@arrow.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@arrow.apache.org Delivered-To: mailing list dev@arrow.apache.org Received: (qmail 23958 invoked by uid 99); 16 Mar 2019 23:55:15 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Mar 2019 23:55:15 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id CFAF7183098 for ; Sat, 16 Mar 2019 23:55:14 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.048 X-Spam-Level: ** X-Spam-Status: No, score=2.048 tagged_above=-999 required=6.31 tests=[DKIMWL_WL_MED=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id JtszKOiyZUla for ; Sat, 16 Mar 2019 23:55:13 +0000 (UTC) Received: from mail-io1-f51.google.com (mail-io1-f51.google.com [209.85.166.51]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id A001A5F3CC for ; Sat, 16 Mar 2019 23:55:13 +0000 (UTC) Received: by mail-io1-f51.google.com with SMTP id x7so11362870ioh.4 for ; Sat, 16 Mar 2019 16:55:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=+B5MZ09GJr+FR6J8v8xCzf1MTNGc5Y+dUPTt5h4Z6H4=; b=WMupKtBh78MCycDsYFA71IDTb9j790WB9/KZMQwQqLyKK2aNoMUyuQ4BLUP3r3ZrNj 00YgUAHsxYz4Y3sfdlairO69Uhcus0EtTp4IUR+rjK4QSqI1F40rcuU94JB+JPv61G9f 1bFGXqdxZ/zw5HrBMkAVTtSVtpgFbMwijW9k0EqAvN+pXiuTMSzinaNPhXqvfTFnIi0r wb3ivSKUPfQWT3tGJgNCyesFHqOlFI+w7vXLuCUlmVvBQ+D7wBInIaxaxxa8RAIwwzSd oEPzbhJqgR5IlJYOGjJ3a68oByoWzz9yBVMjyaqsxvG/LLZpFNYSFR4AZCiddiXbZeKh Ixfg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=+B5MZ09GJr+FR6J8v8xCzf1MTNGc5Y+dUPTt5h4Z6H4=; b=cKNt4JVhoaWTFFvo4eovGp9OxlPFBfuoLYjKms0H5Wf3yOkpQFK0H/2dY/bOz2B7hb Zl/8+F0Zqfwf3hT2dY2pj24gf2i+oCBjKYEeanWq0UfyeebwDyyQW1XtY+xe+gks6p2e 52/V+pHRh+47+Ag+pOwknDKFHd+mVXk2K/bNG1VCoWVWDnW6tax9+qXI1XbIW9XW9Pyw EgMwUk3k0iFzU5NNI9nCC2QkoWT5uCsPWHdZDQ2tBstfEpcntsv1prnproNHV3APOYM+ 9L/rHGLC4SCIdoW7SObregmV97uvkLcCViN0US1z4WuFePnpHR4W0HIuztVmIT4DPtJD 4AtQ== X-Gm-Message-State: APjAAAU9jcTk+/g7YZkcUCwFeqrX5QCKzi9+FvAUWhxH7fEQX17DzLSI TZcTEpjsFW7Bko3VlaOM7J0hzzPTYrWLgv/WOoyiZw== X-Google-Smtp-Source: APXvYqzkt/whCBMkl1MOAVE6PQn9F7qqDZfRqOtWlDs9NSP7WORgoI1BQghIFRZ4VGIEjoIafpIEFrDvhcFrZCKz1CQ= X-Received: by 2002:a6b:4419:: with SMTP id r25mr6456845ioa.12.1552780512753; Sat, 16 Mar 2019 16:55:12 -0700 (PDT) MIME-Version: 1.0 From: Andy Grove Date: Sat, 16 Mar 2019 17:55:01 -0600 Message-ID: Subject: Parquet test files with all data types To: dev@arrow.apache.org Content-Type: multipart/alternative; boundary="000000000000e07b5105843ee2e7" --000000000000e07b5105843ee2e7 Content-Type: text/plain; charset="UTF-8" Hi, The currently available Parquet files in the Arrow repository don't seem to be sufficient for unit testing support for all date/time data types. For example the "alltypes_plain.parquet" only has one time column and it uses the deprecated INT96 encoding. I guess I'm volunteering to try and create a Parquet test file that covers all types needed, probably using the Java implementation since that would be easiest for me. Before I do this though, am I missing something? It seems odd that we don't already have this? Thanks, Andy. --000000000000e07b5105843ee2e7--