flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Felix Neutatz <neut...@googlemail.com>
Subject Should collect() and count() be treated as data sinks?
Date Thu, 02 Apr 2015 15:33:20 GMT

I have run the following program:

final ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();

List l = Arrays.asList(new Tuple1<Long>(1L));
TypeInformation t = TypeInfoParser.parse("Tuple1<Long>");
DataSet<Tuple1<Long>> data = env.fromCollection(l, t);

long value = data.count();


Since there is no "real" data sink, I get the following:
Exception in thread "main" java.lang.RuntimeException: No data sinks have
been created yet. A program needs at least one sink that consumes data.
Examples are writing the data set or printing it.

In my opinion, we should handle count() and collect() like print().

What do you think?

Best regards,


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message