hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mahadev konar (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-3054) Unable to kill submitted jobs
Date Mon, 26 Sep 2011 15:24:26 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13114720#comment-13114720
] 

Mahadev konar commented on MAPREDUCE-3054:
------------------------------------------

I think most of the client commands should be fire and forget. Longer term I think the NodeManager
should be sending a SIGTERM, and wait for a while before sending SIGKILL, to let AM/others
do some work before getting a SIGKILL.

We always have to force terminate, I am not sure how you can get away from that. We should
always be sending a KILL SIGNAL to RM, else we cannot confirm an AM is KILLED. How do you
intend to enforce that an AM is dead?




> Unable to kill submitted jobs
> -----------------------------
>
>                 Key: MAPREDUCE-3054
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3054
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2
>    Affects Versions: 0.23.0
>            Reporter: Siddharth Seth
>            Assignee: Mahadev konar
>            Priority: Blocker
>             Fix For: 0.23.0
>
>         Attachments: MAPREDUCE-3054.patch, MAPREDUCE-3054.patch, MAPREDUCE-3054.patch,
MAPREDUCE-3054.patch
>
>
> Found by Philip Su
> The "mapred job -kill" command
> appears to succeed, but listing the jobs again shows that the job supposedly killed is
still there. 
> {code}
> mapred job -list
> Total jobs:2
> JobId   State   StartTime       UserName        Queue   Priority        SchedulingInfo
> job_1316203984216_0002  PREP    1316204924937   hadoopqa        default NORMAL
> job_1316203984216_0001  PREP    1316204031206   hadoopqa        default NORMAL
> mapred job -kill job_1316203984216_0002
> Killed job job_1316203984216_0002
> mapred job -list
> Total jobs:2
> JobId   State   StartTime       UserName        Queue   Priority        SchedulingInfo
> job_1316203984216_0002  PREP    1316204924937   hadoopqa        default NORMAL
> job_1316203984216_0001  PREP    1316204031206   hadoopqa        default NORMAL
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message