flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sourav Mazumder <sourav.mazumde...@gmail.com>
Subject What is the equivalent of Spark RDD is Flink
Date Thu, 24 Dec 2015 15:48:53 GMT
Hi,

I am new to Flink. Trying to understand some of the basics of Flink.

What is the equivalent of Spark's RDD in Flink ? In my understanding the
closes think is DataSet API. But wanted to reconfirm.

Also using DataSet API if I ingest a large volume of data (val lines :
DataSet[String] = env.readTextFile(<some file path and name>)), which may
not fit in single slave node, will that data get automatically distributed
in the memory of other slave nodes ?

Regards,
Sourav

Mime
View raw message