beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <>
Subject [jira] [Commented] (BEAM-3099) Implement HDFS FileSystem for Python SDK
Date Wed, 24 Jan 2018 00:46:00 GMT


ASF GitHub Bot commented on BEAM-3099:

udim opened a new pull request #4471: [BEAM-3099] Split out BufferedReader and BufferedWriter
from gcsio.
   Most of the code in is copied verbatim from
   The Downloader and Uploader classes are new.

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:

> Implement HDFS FileSystem for Python SDK
> ----------------------------------------
>                 Key: BEAM-3099
>                 URL:
>             Project: Beam
>          Issue Type: New Feature
>          Components: sdk-py-core
>            Reporter: Chamikara Jayalath
>            Assignee: Udi Meiri
>            Priority: Major
> Currently Java SDK has HDFS support but Python SDK does not. With current portability
efforts other runners may soon be able to use Python SDK. Having HDFS support will allow these
runners to execute large scale jobs without using GCS. 
> Following suggests some libraries that can be used to connect to HDFS from Python.

This message was sent by Atlassian JIRA

View raw message