ambari-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Onischuk (JIRA)" <j...@apache.org>
Subject [jira] [Created] (AMBARI-17646) Nodemanager is not started after installation
Date Mon, 11 Jul 2016 09:23:11 GMT
Andrew Onischuk created AMBARI-17646:
----------------------------------------

             Summary: Nodemanager is not started after installation
                 Key: AMBARI-17646
                 URL: https://issues.apache.org/jira/browse/AMBARI-17646
             Project: Ambari
          Issue Type: Bug
            Reporter: Andrew Onischuk
            Assignee: Andrew Onischuk
             Fix For: 2.4.0
         Attachments: AMBARI-17646.patch

Nodemanager is down on one of the nodes after installation. This has impacted
most of the splits in todays run (ambari-2.4.0.0-817).  
Nodemanager is found be down on one of the nodes in 3 node cluster and its
running on other two nodes.  
Live cluster is available here <https://172.22.66.85:8443/> and is alive for
another 24hrs

Below error is seen in nodemanager.log :

2016-07-10 04:40:59,678 INFO recovery.NMLeveldbStateStoreService
(NMLeveldbStateStoreService.java:checkVersion(1022)) - Loaded NM state version
info 1.0  
2016-07-10 04:40:59,889 WARN nodemanager.LinuxContainerExecutor
(LinuxContainerExecutor.java:init(195)) - Exit code from container executor
initialization is : 24  
ExitCodeException exitCode=24: File /etc/hadoop/2.4.2.0-258/0 must be owned by
root, but is owned by 2530

at org.apache.hadoop.util.Shell.runCommand(Shell.java:576)  
at org.apache.hadoop.util.Shell.run(Shell.java:487)  
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:753)  
at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(Linux
ContainerExecutor.java:192)  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManag
er.java:236)  
at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManag
er(NodeManager.java:547)  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java
:595)  
2016-07-10 04:40:59,893 INFO nodemanager.ContainerExecutor
(ContainerExecutor.java:logOutput(322)) -  
2016-07-10 04:40:59,893 INFO service.AbstractService
(AbstractService.java:noteFailure(272)) - Service NodeManager failed in state
INITED; cause: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed
to initialize container executor  
org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to initialize
container executor  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManag
er.java:238)  
at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManag
er(NodeManager.java:547)  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java
:595)  
Caused by: java.io.IOException: Linux container executor not configured
properly (error=24)  
at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(Linux
ContainerExecutor.java:198)  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManag
er.java:236)  
... 3 more  
Caused by: ExitCodeException exitCode=24: File /etc/hadoop/2.4.2.0-258/0 must
be owned by root, but is owned by 2530

at org.apache.hadoop.util.Shell.runCommand(Shell.java:576)  
at org.apache.hadoop.util.Shell.run(Shell.java:487)  
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:753)  
at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(Linux
ContainerExecutor.java:192)  
... 4 more  
2016-07-10 04:40:59,895 FATAL nodemanager.NodeManager
(NodeManager.java:initAndStartNodeManager(550)) - Error starting NodeManager  
org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to initialize
container executor  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManag
er.java:238)  
at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManag
er(NodeManager.java:547)  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java
:595)  
Caused by: java.io.IOException: Linux container executor not configured
properly (error=24)  
at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(Linux
ContainerExecutor.java:198)  
at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManag
er.java:236)  
... 3 more  
Caused by: ExitCodeException exitCode=24: File /etc/hadoop/2.4.2.0-258/0 must
be owned by root, but is owned by 2530

at org.apache.hadoop.util.Shell.runCommand(Shell.java:576)  
at org.apache.hadoop.util.Shell.run(Shell.java:487)  
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:753)  
at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(Linux
ContainerExecutor.java:192)  
... 4 more  
2016-07-10 04:40:59,898 INFO nodemanager.NodeManager
(LogAdapter.java:info(45)) - SHUTDOWN_MSG:  
/************************************************************  
SHUTDOWN_MSG: Shutting down NodeManager at nat-d7-xals-ambarieu-
newamb-242-1-1/172.22.66.62





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message