flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gyula Fora (JIRA)" <j...@apache.org>
Subject [jira] [Created] (FLINK-2261) Remove reduce/aggregation from DataStream
Date Mon, 22 Jun 2015 18:12:00 GMT
Gyula Fora created FLINK-2261:

             Summary: Remove reduce/aggregation from DataStream
                 Key: FLINK-2261
                 URL: https://issues.apache.org/jira/browse/FLINK-2261
             Project: Flink
          Issue Type: Improvement
          Components: Java API, Scala API, Streaming
    Affects Versions: 0.10
            Reporter: Gyula Fora
            Assignee: Gyula Fora

Currently we have reduce and aggregation methods for non-grouped DataStreams as well, which
will produce local aggregates depending on the parallelism of the operator.

This behaviour is neither intuitive nor useful as it only produces sensible results if the
user specifically sets the parallelism to 1 which should not be encouraged. 

I would like to remove these methods from the DataStream api and only keep it for GroupedDataStreams
and WindowedDataStream where the aggregation is either executed per-key or per-window.

This message was sent by Atlassian JIRA

View raw message