flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Márton Balassi <balassi.mar...@gmail.com>
Subject Re: [GSoc][flink-streaming] Interested in pursuing FLINK-1617 and FLINK-1534
Date Mon, 16 Mar 2015 09:52:08 GMT
Dear Akshay,

Thanks again for your interest and for the recent contribution to streaming.

Both of the projects mentioned wold be largely appreciated by the
community, and you can also propose other project suggestions here for
discussion.

Regarding FLINK-1534, the thesis I mentioned serves as a starting point and
indeed the basic solution can be implemented with filtering and
windowing/mapping with some state storing whether the cause of an event has
been already seen. Solely relying on the now existing windowing API this
however might cause performance issues if the events also have an
expiration timeout - some optimization there would be included. The further
challenge is to try to further exploit the parallel job execution of Flink
to possibly scale a pattern matching query.

Best,

Marton

On Sun, Mar 15, 2015 at 3:22 PM, Akshay Dixit <akshaydixi@gmail.com> wrote:

> Hi,
> I'm Akshay Dixit[1], a 4th year undergrad at VIT Vellore, India. I'm
> currently interested in distributed systems and stream processing and am
> looking to delve deeper into the subject, and hope to get some insight by
> contributing to Apache Flink. I've gathered some idea of the
> flink-streaming codebase by recently working on a PR for FLINK-1450[2].
>
> Both FLINK-1617[3] and FLINK-1534[4] are interesting projects that I would
> love to work on over the summer. I was wondering which amongst these would
> be more appreciated by the community, so I can start working towards a
> proposal for either one.
>
> Regarding FLINK-1534, I was wondering why would simply merging and
> filtering the existing streams for events we want to detect not work? Also
> on going through the document mentioned by @mbalassi in the JIRA
> comment[5], the authors specify some Runtime Event Detection concepts in
> Section 5.2. I'm assuming the project entails on building a similar analogy
> using Flink and the deliverables would include working pattern matching
> operators over Flink DataStreams as described in the report. If so, then
> shouldn't it be trivial to implement the described the Binary operator
> using a WindowedStream and a Filter?
> I hope my questions don't seem misplaced here and I would appreciate links
> to literature where I can learn more on the topic.
>
> Regards,
> Akshay Dixit
>
> [1] : http://akshaydixi.me
> [2] : https://github.com/apache/flink/pull/481
> [3] : https://issues.apache.org/jira/browse/FLINK-1617
> [4] : https://issues.apache.org/jira/browse/FLINK-1534
> [5] :
> http://www.doc.ic.ac.uk/teaching/distinguished-projects/2012/k.nagy.pdf
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message