hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vinod Kumar Vavilapalli (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-3054) Unable to kill submitted jobs
Date Mon, 26 Sep 2011 16:11:26 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13114749#comment-13114749
] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-3054:
----------------------------------------------------

That's a little different. Cleaning up of containers(or AM)' process tree is independent and
can be triggered once it is clear for the components in question that they are done with their
work - via TaskUmbilicalProtocol.done() for the task and AMRMProtocol.unregister() for the
AM case.

For the killed jobs, once AM gets the kill-signal, it can gracefully shut down by writing
history-file, sending notification url etc, then it can proceed to deregistering with the
RM. Once the deregistration comes through, the RM can then direct the NM to make sure the
AM is dead.

> Unable to kill submitted jobs
> -----------------------------
>
>                 Key: MAPREDUCE-3054
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3054
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2
>    Affects Versions: 0.23.0
>            Reporter: Siddharth Seth
>            Assignee: Mahadev konar
>            Priority: Blocker
>             Fix For: 0.23.0
>
>         Attachments: MAPREDUCE-3054.patch, MAPREDUCE-3054.patch, MAPREDUCE-3054.patch,
MAPREDUCE-3054.patch
>
>
> Found by Philip Su
> The "mapred job -kill" command
> appears to succeed, but listing the jobs again shows that the job supposedly killed is
still there. 
> {code}
> mapred job -list
> Total jobs:2
> JobId   State   StartTime       UserName        Queue   Priority        SchedulingInfo
> job_1316203984216_0002  PREP    1316204924937   hadoopqa        default NORMAL
> job_1316203984216_0001  PREP    1316204031206   hadoopqa        default NORMAL
> mapred job -kill job_1316203984216_0002
> Killed job job_1316203984216_0002
> mapred job -list
> Total jobs:2
> JobId   State   StartTime       UserName        Queue   Priority        SchedulingInfo
> job_1316203984216_0002  PREP    1316204924937   hadoopqa        default NORMAL
> job_1316203984216_0001  PREP    1316204031206   hadoopqa        default NORMAL
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message