beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Work logged] (BEAM-5520) Flink runner per operator SDK harness option
Date Mon, 01 Oct 2018 16:37:00 GMT

     [ https://issues.apache.org/jira/browse/BEAM-5520?focusedWorklogId=150081&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-150081
]

ASF GitHub Bot logged work on BEAM-5520:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 01/Oct/18 16:36
            Start Date: 01/Oct/18 16:36
    Worklog Time Spent: 10m 
      Work Description: tweise commented on a change in pull request #6524:  [BEAM-5520] Flink
pipeline option to run SDK harness per subtask.
URL: https://github.com/apache/beam/pull/6524#discussion_r221675858
 
 

 ##########
 File path: runners/flink/src/main/java/org/apache/beam/runners/flink/translation/functions/FlinkExecutableStageContext.java
 ##########
 @@ -35,8 +36,12 @@
     FlinkExecutableStageContext get(JobInfo jobInfo);
   }
 
-  static Factory factory() {
-    return FlinkDefaultExecutableStageContext.ReferenceCountingFactory.REFERENCE_COUNTING;
+  static Factory factory(FlinkPipelineOptions options) {
+    if ("[stage]".equals(options.getSdkWorkerParallelism())) {
 
 Review comment:
   done

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 150081)
    Time Spent: 2h  (was: 1h 50m)

> Flink runner per operator SDK harness option
> --------------------------------------------
>
>                 Key: BEAM-5520
>                 URL: https://issues.apache.org/jira/browse/BEAM-5520
>             Project: Beam
>          Issue Type: Improvement
>          Components: runner-flink
>            Reporter: Thomas Weise
>            Assignee: Thomas Weise
>            Priority: Major
>              Labels: portability, portability-flink
>          Time Spent: 2h
>  Remaining Estimate: 0h
>
> For streaming pipelines, the runner currently uses a single SDK harness process for all
subtasks of a job that get deployed on the same task manager. In common deployments with 16
or more tasks slots, many executable stage operators all use the same SDK harness process.
To scale, we need an option to run separate harness processes per subtask.
>    



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message