arrow-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Antoine Pitrou <anto...@python.org>
Subject Re: How to know what pyarrow expect to works with '.gz' file
Date Mon, 29 Mar 2021 13:54:25 GMT

First, does the file decompress successfully (using e.g. gunzip)?



On Mon, 29 Mar 2021 13:56:02 +0200
jonathan mercier <jonathan.mercier@cnrgh.fr> wrote:
> Dear,
> 
> I failed to read a gzip compressed tabular file:
> 
> ```bash
> $ file a_file.gz: gzip compressed data, extra field
> ```
> 
> ```python
> (pdb) read_csv(a_file)
> *** OSError: Truncated compressed stream
> ```
> 
> 
> So to my understanding the problem de not come from reader or parser
> option but from th decompression.
> 
> So how can I get the reason pyarrow failed to read a such file ?
> 
> thanks
> 
> Best regards 
> 




Mime
View raw message