From issues-return-536163-archive-asf-public=cust-asf.ponee.io@flink.apache.org Fri Sep 10 11:30:03 2021 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mxout1-he-de.apache.org (mxout1-he-de.apache.org [95.216.194.37]) by mx-eu-01.ponee.io (Postfix) with ESMTPS id 4BCA9180663 for ; Fri, 10 Sep 2021 13:30:03 +0200 (CEST) Received: from mail.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by mxout1-he-de.apache.org (ASF Mail Server at mxout1-he-de.apache.org) with SMTP id 1DB9D641C6 for ; Fri, 10 Sep 2021 11:30:01 +0000 (UTC) Received: (qmail 39186 invoked by uid 500); 10 Sep 2021 11:30:01 -0000 Mailing-List: contact issues-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list issues@flink.apache.org Received: (qmail 39172 invoked by uid 99); 10 Sep 2021 11:30:01 -0000 Received: from ec2-52-204-25-47.compute-1.amazonaws.com (HELO mailrelay1-ec2-va.apache.org) (52.204.25.47) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Sep 2021 11:30:01 +0000 Received: from jira2-he-de.apache.org (jira2-he-de.apache.org [168.119.33.54]) by mailrelay1-ec2-va.apache.org (ASF Mail Server at mailrelay1-ec2-va.apache.org) with ESMTPS id 34FD03E8DE for ; Fri, 10 Sep 2021 11:30:01 +0000 (UTC) Received: from jira2-he-de.apache.org (localhost.localdomain [127.0.0.1]) by jira2-he-de.apache.org (ASF Mail Server at jira2-he-de.apache.org) with ESMTP id 5B1BDC805CC for ; Fri, 10 Sep 2021 11:30:00 +0000 (UTC) Date: Fri, 10 Sep 2021 11:30:00 +0000 (UTC) From: =?utf-8?Q?Julio_P=C3=A9rez_=28Jira=29?= To: issues@flink.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (FLINK-24031) I am trying to deploy Flink in kubernetes but when I launch the taskManager in other container I get a Exception MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/FLINK-24031?page=3Dcom.atlassi= an.jira.plugin.system.issuetabpanels:all-tabpanel ] Julio P=C3=A9rez updated FLINK-24031: -------------------------------- Attachment: taskmanager.log > I am trying to deploy Flink in kubernetes but when I launch the taskManag= er in other container I get a Exception > -------------------------------------------------------------------------= --------------------------------------- > > Key: FLINK-24031 > URL: https://issues.apache.org/jira/browse/FLINK-24031 > Project: Flink > Issue Type: Bug > Components: Deployment / Kubernetes > Affects Versions: 1.13.0, 1.13.2 > Reporter: Julio P=C3=A9rez > Priority: Minor > Labels: pull-request-available > Fix For: 1.13.1 > > Attachments: flink-map.yml, jobmanager.log, jobmanager.yml, taskm= anager.log, taskmanager.yml > > > =C2=A0I explain here -> [https://github.com/apache/flink/pull/17020] > I have a problem when I try to run Flink in k8s with the follow manifests > I have the following exception > # JobManager : > {quote}2021-08-27 09:16:57,917 ERROR akka.remote.EndpointWriter [] - drop= ping message [class akka.actor.ActorSelectionMessage] for non-local recipie= nt [Actor[akka.tcp://flink@jobmanager-hs:6123/]] arriving at [akka.tcp://fl= ink@jobmanager-hs:6123] inbound addresses are [akka.tcp://flink@cluster:612= 3] > 2021-08-27 09:17:01,255 DEBUG org.apache.flink.runtime.resourcemanager.S= tandaloneResourceManager [] - Trigger heartbeat request. > 2021-08-27 09:17:01,284 DEBUG org.apache.flink.runtime.resourcemanager.S= tandaloneResourceManager [] - Trigger heartbeat request. > 2021-08-27 09:17:10,008 DEBUG akka.remote.transport.netty.NettyTransport= [] - Remote connection to [/172.17.0.1:34827] was disconnected because of = [id: 0x13ae1d03, /172.17.0.1:34827 :> /172.17.0.23:6123] DISCONNECTED > 2021-08-27 09:17:10,008 DEBUG akka.remote.transport.ProtocolStateActor [= ] - Association between local [tcp://flink@cluster:6123] and remote [tcp://= flink@172.17.0.1:34827] was disassociated because the ProtocolStateActor fa= iled: Unknown > 2021-08-27 09:17:10,009 WARN akka.remote.ReliableDeliverySupervisor [] -= Association with remote system [akka.tcp://flink@172.17.0.24:6122] has fai= led, address is now gated for [50] ms. Reason: [Disassociated] > {quote} > TaskManager: > {quote}INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Could= not resolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123= /user/rpc/resourcemanager__, retrying in 10000 ms: Could not connect to rpc= endpoint under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/res= ourcemanager__. > INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Could not r= esolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123/user/= rpc/resourcemanager__, retrying in 10000 ms: Could not connect to rpc endpo= int under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcem= anager__. > {quote} > Best regards, > Julio -- This message was sent by Atlassian Jira (v8.3.4#803005)