hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron T. Myers (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HADOOP-14855) Hadoop scripts may errantly believe a daemon is still running, preventing it from starting
Date Fri, 08 Sep 2017 18:22:00 GMT
Aaron T. Myers created HADOOP-14855:
---------------------------------------

             Summary: Hadoop scripts may errantly believe a daemon is still running, preventing
it from starting
                 Key: HADOOP-14855
                 URL: https://issues.apache.org/jira/browse/HADOOP-14855
             Project: Hadoop Common
          Issue Type: Bug
          Components: scripts
    Affects Versions: 3.0.0-alpha4
            Reporter: Aaron T. Myers


I encountered a case recently where the NN wouldn't start, with the error message "namenode
is running as process 16769.  Stop it first." In fact the NN was not running at all, but rather
another long-running process was running with this pid.

It looks to me like our scripts just check to see if _any_ process is running with the pid
that the NN (or any Hadoop daemon) most recently ran with. This is clearly not a fool-proof
way of checking to see if a particular type of daemon is now running, as some other process
could start running with the same pid since the daemon in question was previously shut down.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org


Mime
View raw message