arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yacko (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ARROW-1213) can we add support for pyarrow to read from s3 based on partitions
Date Thu, 13 Jul 2017 14:01:00 GMT
Yacko created ARROW-1213:
----------------------------

             Summary: can we add support for pyarrow to read from s3 based on partitions
                 Key: ARROW-1213
                 URL: https://issues.apache.org/jira/browse/ARROW-1213
             Project: Apache Arrow
          Issue Type: Improvement
            Reporter: Yacko
            Priority: Minor


Pyarrow dataset function can't read from s3 using s3fs as the filesystem. Is  there a way
we can add the support for read from s3 based on partitioned files ?

I am trying to address the problem mentioned in the stackoverflow link :
https://stackoverflow.com/questions/45082832/how-to-read-partitioned-parquet-files-from-s3-using-pyarrow-in-python



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message