Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id DA3A8200B8C for ; Tue, 23 Aug 2016 03:24:23 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id D9250160ABC; Tue, 23 Aug 2016 01:24:23 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3016B160ABD for ; Tue, 23 Aug 2016 03:24:23 +0200 (CEST) Received: (qmail 21118 invoked by uid 500); 23 Aug 2016 01:24:22 -0000 Mailing-List: contact issues-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list issues@flink.apache.org Received: (qmail 20893 invoked by uid 99); 23 Aug 2016 01:24:22 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Aug 2016 01:24:22 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 32EE02C0151 for ; Tue, 23 Aug 2016 01:24:22 +0000 (UTC) Date: Tue, 23 Aug 2016 01:24:22 +0000 (UTC) From: "ASF GitHub Bot (JIRA)" To: issues@flink.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (FLINK-4348) Implement communication from ResourceManager to TaskManager MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 23 Aug 2016 01:24:24 -0000 [ https://issues.apache.org/jira/browse/FLINK-4348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15431935#comment-15431935 ] ASF GitHub Bot commented on FLINK-4348: --------------------------------------- Github user beyond1920 commented on a diff in the pull request: https://github.com/apache/flink/pull/2389#discussion_r75789477 --- Diff: flink-runtime/src/main/java/org/apache/flink/runtime/rpc/resourcemanager/ResourceManager.java --- @@ -52,18 +61,45 @@ public class ResourceManager extends RpcEndpoint { private final ExecutionContext executionContext; private final Map jobMasterGateways; + private final Map taskExecutorGateways; + private final Map heartbeatSchedulers; + private final LeaderElectionService leaderElectionService; + private UUID leaderSessionID; + // TODO private final SlotManager slotManager; --- End diff -- SlotManager is doing by kurt. So I marked some TODO here waiting for his PR. It would be removed after SlotManager code committed > Implement communication from ResourceManager to TaskManager > ----------------------------------------------------------- > > Key: FLINK-4348 > URL: https://issues.apache.org/jira/browse/FLINK-4348 > Project: Flink > Issue Type: Sub-task > Components: Cluster Management > Reporter: Kurt Young > Assignee: zhangjing > > There are mainly 3 logics initiated from RM to TM: > * Heartbeat, RM use heartbeat to sync with TM's slot status > * SlotRequest, when RM decides to assign slot to JM, should first try to send request to TM for slot. TM can either accept or reject this request. > * FailureNotify, in some corner cases, TM will be marked as invalid by cluster manager master(e.g. yarn master), but TM itself does not realize. RM should send failure notify to TM and TM can terminate itself -- This message was sent by Atlassian JIRA (v6.3.4#6332)