flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-9545) Support read a file multiple times in Flink DataStream
Date Fri, 15 Jun 2018 08:57:00 GMT

    [ https://issues.apache.org/jira/browse/FLINK-9545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16513553#comment-16513553

ASF GitHub Bot commented on FLINK-9545:

Github user kl0u commented on the issue:

    Hi @bowenli86, 
    Me, @zentol and @aljoscha both seem to have doubts about the utility of the feature.
    So given this, and to have a clean JIRA and list of PRs we have to work on, I would suggest

    to close the PR and the related issue.

> Support read a file multiple times in Flink DataStream 
> -------------------------------------------------------
>                 Key: FLINK-9545
>                 URL: https://issues.apache.org/jira/browse/FLINK-9545
>             Project: Flink
>          Issue Type: Improvement
>          Components: DataStream API
>    Affects Versions: 1.6.0
>            Reporter: Bowen Li
>            Assignee: Bowen Li
>            Priority: Major
>             Fix For: 1.6.0
> Motivation: We have the requirements to read a bunch files, each file to read multiple
times, to feed our streams
> Specifically we need {{StreamExecutionEnvironment.readFile/readTextFile}} to be able
to read a file for a specified {{N}} times, but currently it only supports reading file once.
> We've implemented this internally. Would be good to get it back to the community version.
This jira is to add support for the feature. 
> Plan:
> add a new processing mode as PROCESSING_N_TIMES, and add additional parameter {{numTimes}}
for {{StreamExecutionEnvironment.readFile/readTextFile}}

This message was sent by Atlassian JIRA

View raw message