flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Fabian Tschirschnitz (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-1038) Adding a collection output format
Date Thu, 11 Sep 2014 12:22:33 GMT

    [ https://issues.apache.org/jira/browse/FLINK-1038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129949#comment-14129949
] 

Fabian Tschirschnitz commented on FLINK-1038:
---------------------------------------------

Applied the remarks from Stephan Ewen. Should be fine now?!

> Adding a collection output format
> ---------------------------------
>
>                 Key: FLINK-1038
>                 URL: https://issues.apache.org/jira/browse/FLINK-1038
>             Project: Flink
>          Issue Type: New Feature
>            Reporter: Sebastian Kruse
>            Priority: Minor
>
> Similar to the existing LocalCollectionOutputFormat or Spark's collect() method, it would
be nice to have a CollectionOutputFormat that also works when running jobs on a cluster. This
output format gathers all results of a sink from all TaskManagers in the JVM that submitted
the job plan and provides these as a collection, similar to accumulators. After all, this
can help to avoid the tedious task of going to HDFS and read and parse the single result files.
> PS. We have already created such an output format and can contribute it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message