flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dima Arbuzin <arbuzins...@gmail.com>
Subject Maintaining global variables. Best practices.
Date Wed, 10 Aug 2016 16:18:40 GMT
Hi there,

I'm working on a clustering algorithm on a DataStream based on this paper
<http://www1.cse.wustl.edu/~ychen/public/sigproc-sp.pdf> and in order to
implement that, I need to maintain some global values (initially HashMap
but might be DataSet) and update it on every item arriving.

If this is even possible with Flink? (some kind of reverse broadcasting)
And if yes, what would be the right way to do it? or what are the
alternatives?

As a stream simulation I use source function based on the demo
<https://github.com/dataArtisans/flink-streaming-demo/blob/master/src/main/scala/com/dataartisans/flink_demo/sources/TaxiRideSource.scala>
.

Thanks,


-- 
Best regards,
Dima Arbuzin

Mime
View raw message