From issues-return-453632-archive-asf-public=cust-asf.ponee.io@flink.apache.org Thu Mar 11 08:23:07 2021 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mxout1-he-de.apache.org (mxout1-he-de.apache.org [95.216.194.37]) by mx-eu-01.ponee.io (Postfix) with ESMTPS id 3F33418062C for ; Thu, 11 Mar 2021 09:23:07 +0100 (CET) Received: from mail.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by mxout1-he-de.apache.org (ASF Mail Server at mxout1-he-de.apache.org) with SMTP id 902CD63651 for ; Thu, 11 Mar 2021 08:23:06 +0000 (UTC) Received: (qmail 67034 invoked by uid 500); 11 Mar 2021 08:23:02 -0000 Mailing-List: contact issues-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list issues@flink.apache.org Received: (qmail 67024 invoked by uid 99); 11 Mar 2021 08:23:02 -0000 Received: from mailrelay1-he-de.apache.org (HELO mailrelay1-he-de.apache.org) (116.203.21.61) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Mar 2021 08:23:02 +0000 Received: from jira2-he-de.apache.org (jira2-he-de.apache.org [168.119.33.54]) by mailrelay1-he-de.apache.org (ASF Mail Server at mailrelay1-he-de.apache.org) with ESMTPS id D1C683E847 for ; Thu, 11 Mar 2021 08:23:00 +0000 (UTC) Received: from jira2-he-de.apache.org (localhost.localdomain [127.0.0.1]) by jira2-he-de.apache.org (ASF Mail Server at jira2-he-de.apache.org) with ESMTP id 1A607C801AB for ; Thu, 11 Mar 2021 08:23:00 +0000 (UTC) Date: Thu, 11 Mar 2021 08:23:00 +0000 (UTC) From: "Matthias (Jira)" To: issues@flink.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (FLINK-21597) testMapAfterRepartitionHasCorrectParallelism2 Fail because of "NoResourceAvailableException" MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/FLINK-21597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299400#comment-17299400 ] Matthias commented on FLINK-21597: ---------------------------------- Ok, thanks for your insights. I'm gonna have another look as well. > testMapAfterRepartitionHasCorrectParallelism2 Fail because of "NoResourceAvailableException" > --------------------------------------------------------------------------------------------- > > Key: FLINK-21597 > URL: https://issues.apache.org/jira/browse/FLINK-21597 > Project: Flink > Issue Type: Bug > Components: Runtime / Coordination > Affects Versions: 1.13.0 > Reporter: Guowei Ma > Priority: Major > Labels: test-stability > > {code:java} > 2021-03-04T00:17:41.2017402Z [ERROR] testMapAfterRepartitionHasCorrectParallelism2[Execution mode = CLUSTER](org.apache.flink.api.scala.operators.PartitionITCase) Time elapsed: 300.117 s <<< ERROR! > 2021-03-04T00:17:41.2018058Z org.apache.flink.runtime.client.JobExecutionException: Job execution failed. > 2021-03-04T00:17:41.2018525Z at org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:144) > 2021-03-04T00:17:41.2019563Z at org.apache.flink.runtime.minicluster.MiniClusterJobClient.lambda$getJobExecutionResult$3(MiniClusterJobClient.java:137) > 2021-03-04T00:17:41.2020129Z at java.util.concurrent.CompletableFuture.uniApply(CompletableFuture.java:616) > 2021-03-04T00:17:41.2021974Z at java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:591) > 2021-03-04T00:17:41.2022634Z at java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:488) > 2021-03-04T00:17:41.2023118Z at java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:1975) > 2021-03-04T00:17:41.2023682Z at org.apache.flink.runtime.rpc.akka.AkkaInvocationHandler.lambda$invokeRpc$0(AkkaInvocationHandler.java:237) > 2021-03-04T00:17:41.2024244Z at java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:774) > 2021-03-04T00:17:41.2024749Z at java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:750) > 2021-03-04T00:17:41.2025261Z at java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:488) > 2021-03-04T00:17:41.2026070Z at java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:1975) > 2021-03-04T00:17:41.2026814Z at org.apache.flink.runtime.concurrent.FutureUtils$1.onComplete(FutureUtils.java:1066) > 2021-03-04T00:17:41.2027633Z at akka.dispatch.OnComplete.internal(Future.scala:264) > 2021-03-04T00:17:41.2028245Z at akka.dispatch.OnComplete.internal(Future.scala:261) > 2021-03-04T00:17:41.2028796Z at akka.dispatch.japi$CallbackBridge.apply(Future.scala:191) > 2021-03-04T00:17:41.2029327Z at akka.dispatch.japi$CallbackBridge.apply(Future.scala:188) > 2021-03-04T00:17:41.2030017Z at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:36) > 2021-03-04T00:17:41.2030795Z at org.apache.flink.runtime.concurrent.Executors$DirectExecutionContext.execute(Executors.java:73) > 2021-03-04T00:17:41.2031885Z at scala.concurrent.impl.CallbackRunnable.executeWithValue(Promise.scala:44) > 2021-03-04T00:17:41.2032678Z at scala.concurrent.impl.Promise$DefaultPromise.tryComplete(Promise.scala:252) > 2021-03-04T00:17:41.2033428Z at akka.pattern.PromiseActorRef.$bang(AskSupport.scala:572) > 2021-03-04T00:17:41.2034197Z at akka.pattern.PipeToSupport$PipeableFuture$$anonfun$pipeTo$1.applyOrElse(PipeToSupport.scala:22) > 2021-03-04T00:17:41.2035094Z at akka.pattern.PipeToSupport$PipeableFuture$$anonfun$pipeTo$1.applyOrElse(PipeToSupport.scala:21) > 2021-03-04T00:17:41.2035915Z at scala.concurrent.Future$$anonfun$andThen$1.apply(Future.scala:436) > 2021-03-04T00:17:41.2036617Z at scala.concurrent.Future$$anonfun$andThen$1.apply(Future.scala:435) > 2021-03-04T00:17:41.2037537Z at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:36) > 2021-03-04T00:17:41.2038019Z at akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(BatchingExecutor.scala:55) > 2021-03-04T00:17:41.2038554Z at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply$mcV$sp(BatchingExecutor.scala:91) > 2021-03-04T00:17:41.2039117Z at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply(BatchingExecutor.scala:91) > 2021-03-04T00:17:41.2039671Z at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply(BatchingExecutor.scala:91) > 2021-03-04T00:17:41.2040159Z at scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:72) > 2021-03-04T00:17:41.2040632Z at akka.dispatch.BatchingExecutor$BlockableBatch.run(BatchingExecutor.scala:90) > 2021-03-04T00:17:41.2041086Z at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:40) > 2021-03-04T00:17:41.2041810Z at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(ForkJoinExecutorConfigurator.scala:44) > 2021-03-04T00:17:41.2042514Z at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) > 2021-03-04T00:17:41.2042977Z at akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) > 2021-03-04T00:17:41.2043425Z at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) > 2021-03-04T00:17:41.2043887Z at akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) > 2021-03-04T00:17:41.2044399Z Caused by: org.apache.flink.runtime.JobException: Recovery is suppressed by NoRestartBackoffTimeStrategy > 2021-03-04T00:17:41.2044991Z at org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.handleFailure(ExecutionFailureHandler.java:130) > 2021-03-04T00:17:41.2045695Z at org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.getFailureHandlingResult(ExecutionFailureHandler.java:81) > 2021-03-04T00:17:41.2046343Z at org.apache.flink.runtime.scheduler.DefaultScheduler.handleTaskFailure(DefaultScheduler.java:221) > 2021-03-04T00:17:41.2047000Z at org.apache.flink.runtime.scheduler.DefaultScheduler.maybeHandleTaskFailure(DefaultScheduler.java:212) > 2021-03-04T00:17:41.2047579Z at org.apache.flink.runtime.scheduler.DefaultScheduler.updateTaskExecutionStateInternal(DefaultScheduler.java:203) > 2021-03-04T00:17:41.2048171Z at org.apache.flink.runtime.scheduler.SchedulerBase.updateTaskExecutionState(SchedulerBase.java:696) > 2021-03-04T00:17:41.2049092Z at org.apache.flink.runtime.scheduler.UpdateSchedulerNgOnInternalFailuresListener.notifyTaskFailure(UpdateSchedulerNgOnInternalFailuresListener.java:51) > 2021-03-04T00:17:41.2049893Z at org.apache.flink.runtime.executiongraph.ExecutionGraph.notifySchedulerNgAboutInternalTaskFailure(ExecutionGraph.java:1470) > 2021-03-04T00:17:41.2050492Z at org.apache.flink.runtime.executiongraph.Execution.processFail(Execution.java:1111) > 2021-03-04T00:17:41.2050989Z at org.apache.flink.runtime.executiongraph.Execution.processFail(Execution.java:1051) > 2021-03-04T00:17:41.2051474Z at org.apache.flink.runtime.executiongraph.Execution.markFailed(Execution.java:885) > 2021-03-04T00:17:41.2052211Z at org.apache.flink.runtime.executiongraph.ExecutionVertex.markFailed(ExecutionVertex.java:661) > 2021-03-04T00:17:41.2052877Z at org.apache.flink.runtime.scheduler.DefaultExecutionVertexOperations.markFailed(DefaultExecutionVertexOperations.java:41) > 2021-03-04T00:17:41.2053654Z at org.apache.flink.runtime.scheduler.DefaultScheduler.handleTaskDeploymentFailure(DefaultScheduler.java:505) > 2021-03-04T00:17:41.2054285Z at org.apache.flink.runtime.scheduler.DefaultScheduler.lambda$assignResourceOrHandleError$6(DefaultScheduler.java:490) > 2021-03-04T00:17:41.2054838Z at java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:836) > 2021-03-04T00:17:41.2055323Z at java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:811) > 2021-03-04T00:17:41.2055805Z at java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:488) > 2021-03-04T00:17:41.2056318Z at java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:1990) > 2021-03-04T00:17:41.2056943Z at org.apache.flink.runtime.scheduler.SharedSlot.cancelLogicalSlotRequest(SharedSlot.java:222) > 2021-03-04T00:17:41.2057554Z at org.apache.flink.runtime.scheduler.SlotSharingExecutionSlotAllocator.cancelLogicalSlotRequest(SlotSharingExecutionSlotAllocator.java:164) > 2021-03-04T00:17:41.2058220Z at org.apache.flink.runtime.scheduler.SharingPhysicalSlotRequestBulk.cancel(SharingPhysicalSlotRequestBulk.java:86) > 2021-03-04T00:17:41.2058875Z at org.apache.flink.runtime.jobmaster.slotpool.PhysicalSlotRequestBulkWithTimestamp.cancel(PhysicalSlotRequestBulkWithTimestamp.java:66) > 2021-03-04T00:17:41.2059642Z at org.apache.flink.runtime.jobmaster.slotpool.PhysicalSlotRequestBulkCheckerImpl.lambda$schedulePendingRequestBulkWithTimestampCheck$0(PhysicalSlotRequestBulkCheckerImpl.java:91) > 2021-03-04T00:17:41.2060319Z at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > 2021-03-04T00:17:41.2060938Z at java.util.concurrent.FutureTask.run(FutureTask.java:266) > 2021-03-04T00:17:41.2061472Z at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRunAsync(AkkaRpcActor.java:440) > 2021-03-04T00:17:41.2062265Z at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:208) > 2021-03-04T00:17:41.2062824Z at org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:77) > 2021-03-04T00:17:41.2063375Z at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:158) > 2021-03-04T00:17:41.2063821Z at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26) > 2021-03-04T00:17:41.2064246Z at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21) > 2021-03-04T00:17:41.2064669Z at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123) > 2021-03-04T00:17:41.2065093Z at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21) > 2021-03-04T00:17:41.2065537Z at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170) > 2021-03-04T00:17:41.2065975Z at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171) > 2021-03-04T00:17:41.2066390Z at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171) > 2021-03-04T00:17:41.2066798Z at akka.actor.Actor$class.aroundReceive(Actor.scala:517) > 2021-03-04T00:17:41.2067249Z at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225) > 2021-03-04T00:17:41.2067916Z at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592) > 2021-03-04T00:17:41.2068415Z at akka.actor.ActorCell.invoke(ActorCell.scala:561) > 2021-03-04T00:17:41.2068785Z at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258) > 2021-03-04T00:17:41.2069166Z at akka.dispatch.Mailbox.run(Mailbox.scala:225) > 2021-03-04T00:17:41.2069523Z at akka.dispatch.Mailbox.exec(Mailbox.scala:235) > 2021-03-04T00:17:41.2069784Z ... 4 more > 2021-03-04T00:17:41.2070383Z Caused by: java.util.concurrent.CompletionException: org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: Slot request bulk is not fulfillable! Could not allocate the required slot within slot request timeout > 2021-03-04T00:17:41.2071162Z at java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:292) > 2021-03-04T00:17:41.2071905Z at java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:308) > 2021-03-04T00:17:41.2072420Z at java.util.concurrent.CompletableFuture.uniApply(CompletableFuture.java:607) > 2021-03-04T00:17:41.2073089Z at java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:591) > 2021-03-04T00:17:41.2073462Z ... 31 more > 2021-03-04T00:17:41.2073977Z Caused by: org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: Slot request bulk is not fulfillable! Could not allocate the required slot within slot request timeout > 2021-03-04T00:17:41.2074838Z at org.apache.flink.runtime.jobmaster.slotpool.PhysicalSlotRequestBulkCheckerImpl.lambda$schedulePendingRequestBulkWithTimestampCheck$0(PhysicalSlotRequestBulkCheckerImpl.java:86) > 2021-03-04T00:17:41.2075386Z ... 24 more > 2021-03-04T00:17:41.2075706Z Caused by: java.util.concurrent.TimeoutException: Timeout has occurred: 300000 ms > 2021-03-04T00:17:41.2076048Z ... 25 more > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)