flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gyula Fora (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-3065) Can't cancel failing jobs
Date Tue, 24 Nov 2015 09:16:11 GMT

    [ https://issues.apache.org/jira/browse/FLINK-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15024064#comment-15024064

Gyula Fora commented on FLINK-3065:

A failing job that gets stuck (because one of the tasks deadlocks in cancelling ) 

Concrete case:
Flink fails with a kafka issue, deadlocks while failing in zookeeper. I cannot "kill" the
job from the webfrontend or the command line client.

> Can't cancel failing jobs
> -------------------------
>                 Key: FLINK-3065
>                 URL: https://issues.apache.org/jira/browse/FLINK-3065
>             Project: Flink
>          Issue Type: Bug
>          Components: Command-line client, Webfrontend
>    Affects Versions: 0.10.0, 1.0.0
>            Reporter: Gyula Fora
>            Priority: Blocker
> It is currently not possible to stop a failing streaming job (if it get's stuck while
failing for instance).
> There is no cancel button in the web interface, also it doesnt show on the list of running
jobs in the command line.
> This means jobs getting stuck while failing will take down the cluster eventually.

This message was sent by Atlassian JIRA

View raw message