flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-9838) Slot request failed Exceptions after completing a job
Date Mon, 23 Jul 2018 08:05:00 GMT

    [ https://issues.apache.org/jira/browse/FLINK-9838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16552459#comment-16552459
] 

ASF GitHub Bot commented on FLINK-9838:
---------------------------------------

Github user bowenli86 commented on the issue:

    https://github.com/apache/flink/pull/6373
  
    Ran into this bug too.
    
    +1 for the fix


> Slot request failed Exceptions after completing a job
> -----------------------------------------------------
>
>                 Key: FLINK-9838
>                 URL: https://issues.apache.org/jira/browse/FLINK-9838
>             Project: Flink
>          Issue Type: Bug
>          Components: Distributed Coordination
>    Affects Versions: 1.5.1, 1.6.0
>            Reporter: Nico Kruber
>            Assignee: Till Rohrmann
>            Priority: Major
>              Labels: pull-request-available
>
> Currently, after a job finished, e.g. the following one, several exceptions are logged
(at INFO level) about failed slot requests although the job has run successfully.
> {code}
> StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();
> env.fromElements(1, 2, 3, 4).print();
> env.execute();
> {code}
> {code}
> 16:28:16,106 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager 
- Closing the SlotManager.
> 16:28:16,106 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager 
- Suspending the SlotManager.
> 16:28:16,106 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager 
- Unregister TaskManager aa20e76adb9aee0cdadc50dbc06ea208 from the SlotManager.
> 16:28:16,107 INFO  org.apache.flink.runtime.resourcemanager.StandaloneResourceManager
 - Slot request with allocation id f99ff6d66f7bc618a9ee6e9470e0cdb1 for job 1bdaafd1072e210790790b99e7741b6a
failed.
> org.apache.flink.util.FlinkException: The assigned slot b21f8807-5d0a-4e53-9e55-b6522b4a41c0_0
was removed.
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.removeSlot(SlotManager.java:786)
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.removeSlots(SlotManager.java:756)
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.internalUnregisterTaskManager(SlotManager.java:948)
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.unregisterTaskManager(SlotManager.java:372)
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.suspend(SlotManager.java:234)
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.close(SlotManager.java:251)
> 	at org.apache.flink.runtime.resourcemanager.ResourceManager.postStop(ResourceManager.java:224)
> 	at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.postStop(AkkaRpcActor.java:105)
> 	at org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.postStop(FencedAkkaRpcActor.java:40)
> 	at akka.actor.Actor$class.aroundPostStop(Actor.scala:515)
> 	at akka.actor.UntypedActor.aroundPostStop(UntypedActor.scala:95)
> 	at akka.actor.dungeon.FaultHandling$class.akka$actor$dungeon$FaultHandling$$finishTerminate(FaultHandling.scala:210)
> 	at akka.actor.dungeon.FaultHandling$class.terminate(FaultHandling.scala:172)
> 	at akka.actor.ActorCell.terminate(ActorCell.scala:374)
> 	at akka.actor.ActorCell.invokeAll$1(ActorCell.scala:467)
> 	at akka.actor.ActorCell.systemInvoke(ActorCell.scala:483)
> 	at akka.dispatch.Mailbox.processAllSystemMessages(Mailbox.scala:282)
> 	at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:260)
> 	at akka.dispatch.Mailbox.run(Mailbox.scala:224)
> 	at akka.dispatch.Mailbox.exec(Mailbox.scala:234)
> 	at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
> 	at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
> 	at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
> 	at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
> 16:28:16,109 INFO  org.apache.flink.runtime.taskexecutor.TaskExecutor            - Stopping
TaskExecutor akka://flink/user/taskmanager_0.
> 16:28:16,110 INFO  org.apache.flink.runtime.state.TaskExecutorLocalStateStoresManager
 - Shutting down TaskExecutorLocalStateStoresManager.
> 16:28:16,109 INFO  org.apache.flink.runtime.dispatcher.StandaloneDispatcher      - Stopping
dispatcher akka://flink/user/dispatcher421f3c27-5248-40d4-b219-f0c23480bd6f.
> 16:28:16,111 INFO  org.apache.flink.runtime.dispatcher.StandaloneDispatcher      - Stopping
all currently running jobs of dispatcher akka://flink/user/dispatcher421f3c27-5248-40d4-b219-f0c23480bd6f.
> ...
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message