beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jingsong Lee (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (BEAM-1393) Update Flink Runner to Flink 1.2.0
Date Fri, 10 Feb 2017 09:41:41 GMT

    [ https://issues.apache.org/jira/browse/BEAM-1393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15860992#comment-15860992
] 

Jingsong Lee commented on BEAM-1393:
------------------------------------

Good point! The processing of pushed-back events is indeed a trouble. For non-keyed operators,
we store the elements in SPLIT_DISTRIBUTE state, this is no problem. But for keyed operators,
we can't find the prepared events when a new side-input element come if we use {{KeyedStateBackend}}.
We need to find all the pushed-back events that have the side-input window. Just like the
processing of timer.
Maybe we need override {{AbstractStreamOperator.snapshotState}} to store pushed-back events
by KeyGroups way with snapshot TimerService. I see that only one {{startNewKeyGroup}} can
be called, so we have to override the TimerService snapshot instead of calling super.

> Update Flink Runner to Flink 1.2.0
> ----------------------------------
>
>                 Key: BEAM-1393
>                 URL: https://issues.apache.org/jira/browse/BEAM-1393
>             Project: Beam
>          Issue Type: Improvement
>          Components: runner-flink
>            Reporter: Aljoscha Krettek
>            Assignee: Jingsong Lee
>
> When we update to 1.2.0 we can use the new internal Timer API that is available to Flink
operators: {{InternalTimerService}} and also use broadcast state to store side-input data.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message