flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bowenli86 <...@git.apache.org>
Subject [GitHub] flink issue #5342: [FLINK-8479] Timebounded stream join
Date Fri, 26 Jan 2018 21:46:46 GMT
Github user bowenli86 commented on the issue:

    https://github.com/apache/flink/pull/5342
  
    Very interesting! two things:
    1. can you make the google doc publicly viewable? I cannot access it right now
    2. how does it handle event time window joins of two streams, where data in one stream
always quite late than the other? For example, we are joining stream A and B on a 10 min event-time
tumbling window from 12:00 -12:10, 12:10 - 12:20.... data in stream B always arrive 30 mins
later than the data in stream A. How does the operators handle that? Does it cache A's data
until B's data arrives, do the join, and remove A's data from cache?   (I haven't read the
code in detail, just try to get a general idea of the overall design)


---

Mime
View raw message