flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ziyad Muhammed Mohiyudheen (JIRA)" <j...@apache.org>
Subject [jira] [Created] (FLINK-5888) ForwardedFields annotation is not generating optimised execution plan in example KMeans job
Date Wed, 22 Feb 2017 16:19:44 GMT
Ziyad Muhammed Mohiyudheen created FLINK-5888:
-------------------------------------------------

             Summary: ForwardedFields annotation is not generating optimised execution plan
in example KMeans job
                 Key: FLINK-5888
                 URL: https://issues.apache.org/jira/browse/FLINK-5888
             Project: Flink
          Issue Type: Bug
          Components: DataSet API, Examples, Java API
    Affects Versions: 1.1.3
            Reporter: Ziyad Muhammed Mohiyudheen


Flink KMeans java example [1] shows the usage of ForwardedFields function annotation. How
ever, the example job was taking more time than expected on medium sized data itself. By merely
removing the function annotation from the example code (with out any other change), a better
execution plan and run time was obtained. The execution plan shows that no combiner is used
and the two Map tasks are not chained when ForwardedFields is enabled. The experiment is documented
in [2]

[1] https://github.com/apache/flink/blob/master/flink-examples/flink-examples-batch/src/main/java/org/apache/flink/examples/java/clustering/KMeans.java
[2] https://drive.google.com/open?id=0B0IlZv0uHBuvVEZ5ZmNpN19jVVU




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message