flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jark Wu (JIRA)" <j...@apache.org>
Subject [jira] [Created] (FLINK-7001) Improve performance of Sliding Time Window with pane optimization
Date Sun, 25 Jun 2017 14:35:00 GMT
Jark Wu created FLINK-7001:
------------------------------

             Summary: Improve performance of Sliding Time Window with pane optimization
                 Key: FLINK-7001
                 URL: https://issues.apache.org/jira/browse/FLINK-7001
             Project: Flink
          Issue Type: Improvement
          Components: DataStream API
            Reporter: Jark Wu
            Assignee: Jark Wu
             Fix For: 1.4.0


Currently, the implementation of time-based sliding windows treats each window individually
and replicates records to each window. For a window of 10 minute size that slides by 1 second
the data is replicated 600 fold (10 minutes / 1 second). We can optimize sliding window by
divide windows into panes (aligned with slide), so that we can avoid record duplication and
leverage the checkpoint.

I will attach a more detail design doc to the issue.

The following issues are similar to this issue: FLINK-5387, FLINK-6990



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message