From user-return-1139-archive-asf-public=cust-asf.ponee.io@arrow.apache.org Mon Mar 29 14:09:45 2021 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mxout1-ec2-va.apache.org (mxout1-ec2-va.apache.org [3.227.148.255]) by mx-eu-01.ponee.io (Postfix) with ESMTPS id 727BB180670 for ; Mon, 29 Mar 2021 16:09:45 +0200 (CEST) Received: from mail.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by mxout1-ec2-va.apache.org (ASF Mail Server at mxout1-ec2-va.apache.org) with SMTP id B0EDF3EF1B for ; Mon, 29 Mar 2021 14:09:44 +0000 (UTC) Received: (qmail 8268 invoked by uid 500); 29 Mar 2021 14:09:44 -0000 Mailing-List: contact user-help@arrow.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@arrow.apache.org Delivered-To: mailing list user@arrow.apache.org Received: (qmail 8252 invoked by uid 99); 29 Mar 2021 14:09:43 -0000 Received: from spamproc1-he-fi.apache.org (HELO spamproc1-he-fi.apache.org) (95.217.134.168) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Mar 2021 14:09:43 +0000 Received: from localhost (localhost [127.0.0.1]) by spamproc1-he-fi.apache.org (ASF Mail Server at spamproc1-he-fi.apache.org) with ESMTP id 0852DC02D5 for ; Mon, 29 Mar 2021 14:09:42 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamproc1-he-fi.apache.org X-Spam-Flag: NO X-Spam-Score: -0.001 X-Spam-Level: X-Spam-Status: No, score=-0.001 tagged_above=-999 required=6.31 tests=[SPF_PASS=-0.001] autolearn=disabled Received: from mx1-ec2-va.apache.org ([116.203.227.195]) by localhost (spamproc1-he-fi.apache.org [95.217.134.168]) (amavisd-new, port 10024) with ESMTP id 5yTR4wX2OSdi for ; Mon, 29 Mar 2021 14:09:41 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=193.50.0.66; helo=sirona.cnrgh.fr; envelope-from=jonathan.mercier@cnrgh.fr; receiver= Received: from sirona.cnrgh.fr (sirona.cnrgh.fr [193.50.0.66]) by mx1-ec2-va.apache.org (ASF Mail Server at mx1-ec2-va.apache.org) with ESMTP id 0B11DBD0E7 for ; Mon, 29 Mar 2021 14:09:40 +0000 (UTC) Received: from [192.168.1.77] (129.203.39.62.rev.sfr.net [62.39.203.129]) (Authenticated sender: jmercier) by sirona.cnrgh.fr (Postfix) with ESMTPSA id B3EF9DF3E4 for ; Mon, 29 Mar 2021 16:09:33 +0200 (CEST) Message-ID: Subject: Re: How to know what pyarrow expect to works with '.gz' file From: jonathan mercier To: user@arrow.apache.org Date: Mon, 29 Mar 2021 16:09:33 +0200 In-Reply-To: <20210329155425.16382ed7@fsol> References: <20210329155425.16382ed7@fsol> Content-Type: text/plain; charset="UTF-8" User-Agent: Evolution 3.38.4 (3.38.4-1.fc33) MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Le lundi 29 mars 2021 à 15:54 +0200, Antoine Pitrou a écrit : > > First, does the file decompress successfully (using e.g. gunzip)? > Yes, And I can use pyarrow on this gunziped file through read_csv method without error. Pandas is able to read this compressed file --                 Researcher computational biology                 PhD, Jonathan MERCIER                              Bioinformatics (LBI)                 2, rue Gaston                 Crémieux                 91057 Evry Cedex                                           Tel :(+33)1 60 87 83 44                 Email :jonathan.mercier@cnrgh.fr