beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Luke Cwik (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (BEAM-76) FileBasedSource should take a list of files/globs
Date Fri, 04 Mar 2016 19:48:40 GMT

    [ https://issues.apache.org/jira/browse/BEAM-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15180435#comment-15180435
] 

Luke Cwik commented on BEAM-76:
-------------------------------

What if instead of a FileBasedSource taking a list of files/globs, we support a generic concat
source that allows you to join multiple custom sources.

This would allow mixed mode sources in addition to implicitly supporting multiple files/globs
by concatenating multiple single file/glob FileBasedSources

> FileBasedSource should take a list of files/globs
> -------------------------------------------------
>
>                 Key: BEAM-76
>                 URL: https://issues.apache.org/jira/browse/BEAM-76
>             Project: Beam
>          Issue Type: New Feature
>          Components: sdk-java-core
>            Reporter: Daniel Halperin
>
> FileBasedSource takes a [single glob|https://github.com/GoogleCloudPlatform/DataflowJavaSDK/blob/master/sdk/src/main/java/com/google/cloud/dataflow/sdk/io/FileBasedSource.java#L96].
If the user wants to read from an explicit file list or multiple globs they have to use multiple
sources and Flatten. This seems like an oversight.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message