hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Loughran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-8650) /bin/hadoop-daemon.sh to add "-f <timeout>" arg for forced shutdowns
Date Fri, 03 Aug 2012 22:07:02 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-8650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13428411#comment-13428411

Steve Loughran commented on HADOOP-8650:

in HA environments, and other situations, you may want to forcibly shut down a hadoop service
-even if it is hung. Currently, hadoop-daemon.sh sends a normal SIGTERM signal -one that the
process picks up and reacts to.

If the process is completely hung, it is possible that this signal is not acted on, so it
stays up. The only way to deal with this is by waiting a while, finding the pid and kill -9'ing
it. This must be done by hand, or in an external script. The latter is brittle to changes
in HADOOP_PID_DIR values, and requires everyone writing such scripts to code and test it themselves.

To replicate this: 
 # start a daemon: {{hadoop-daemon.sh start namenode}}
 # issue a {{kill -STOP <pid>}} to it's PID
 # try to stop the daemon via the {{hadoop-daemon.sh stop namenode}} command.
 # observe that the NN process remains present.

We could extend hadoop-daemon to support a "-f timeout" argument, which provides a timeout
after which the process must be terminated, else a kill -9 signal is issued.
> /bin/hadoop-daemon.sh to add "-f <timeout>" arg for forced shutdowns 
> ---------------------------------------------------------------------
>                 Key: HADOOP-8650
>                 URL: https://issues.apache.org/jira/browse/HADOOP-8650
>             Project: Hadoop Common
>          Issue Type: Improvement
>    Affects Versions: 1.0.3, 2.2.0-alpha
>            Reporter: Steve Loughran
> Add a timeout for the daemon script to trigger a kill -9 if the clean shutdown fails.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message