From user-return-1153-archive-asf-public=cust-asf.ponee.io@arrow.apache.org Wed Mar 31 18:00:04 2021 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mxout1-he-de.apache.org (mxout1-he-de.apache.org [95.216.194.37]) by mx-eu-01.ponee.io (Postfix) with ESMTPS id 1E853180674 for ; Wed, 31 Mar 2021 20:00:04 +0200 (CEST) Received: from mail.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by mxout1-he-de.apache.org (ASF Mail Server at mxout1-he-de.apache.org) with SMTP id 045E86043D for ; Wed, 31 Mar 2021 18:00:02 +0000 (UTC) Received: (qmail 47577 invoked by uid 500); 31 Mar 2021 17:59:58 -0000 Mailing-List: contact user-help@arrow.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@arrow.apache.org Delivered-To: mailing list user@arrow.apache.org Received: (qmail 47567 invoked by uid 99); 31 Mar 2021 17:59:58 -0000 Received: from spamproc1-he-de.apache.org (HELO spamproc1-he-de.apache.org) (116.203.196.100) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 31 Mar 2021 17:59:58 +0000 Received: from localhost (localhost [127.0.0.1]) by spamproc1-he-de.apache.org (ASF Mail Server at spamproc1-he-de.apache.org) with ESMTP id E32721FF39C for ; Wed, 31 Mar 2021 17:59:57 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamproc1-he-de.apache.org X-Spam-Flag: NO X-Spam-Score: 0.248 X-Spam-Level: X-Spam-Status: No, score=0.248 tagged_above=-999 required=6.31 tests=[HEADER_FROM_DIFFERENT_DOMAINS=0.249, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-ec2-va.apache.org ([116.203.227.195]) by localhost (spamproc1-he-de.apache.org [116.203.196.100]) (amavisd-new, port 10024) with ESMTP id jIHnnmreLa00 for ; Wed, 31 Mar 2021 17:59:57 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=116.202.254.214; helo=ciao.gmane.io; envelope-from=gcaau-arrow-user@m.gmane-mx.org; receiver= Received: from ciao.gmane.io (ciao.gmane.io [116.202.254.214]) by mx1-ec2-va.apache.org (ASF Mail Server at mx1-ec2-va.apache.org) with ESMTPS id 244E2BD8D7 for ; Wed, 31 Mar 2021 17:59:57 +0000 (UTC) Received: from list by ciao.gmane.io with local (Exim 4.92) (envelope-from ) id 1lRf89-0003aF-Bz for user@arrow.apache.org; Wed, 31 Mar 2021 19:59:49 +0200 X-Injected-Via-Gmane: http://gmane.org/ To: user@arrow.apache.org From: Antoine Pitrou Subject: Re: How to know what pyarrow expect to works with '.gz' file Date: Wed, 31 Mar 2021 19:59:44 +0200 Message-ID: <20210331195944.201fa686@fsol> References: <20210329155425.16382ed7@fsol> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Newsreader: Claws Mail 3.17.5 (GTK+ 2.24.32; x86_64-pc-linux-gnu) On Mon, 29 Mar 2021 16:09:33 +0200 jonathan mercier wrote: > Le lundi 29 mars 2021 =C3=A0 15:54 +0200, Antoine Pitrou a =C3=A9crit=C2= =A0: > >=20 > > First, does the file decompress successfully (using e.g. gunzip)? > > =20 >=20 > Yes, And I=C2=A0can use pyarrow on this gunziped file through read_csv > method without error. >=20 > Pandas is able to read this compressed file Ok, thank you for giving me access to the compressed file. I have filed this as https://issues.apache.org/jira/browse/ARROW-12169 (with PR attached). Regards Antoine.