hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-3030) RM is not processing heartbeat and continuously giving the message 'Node not found rebooting'
Date Tue, 20 Sep 2011 13:22:10 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-3030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13108676#comment-13108676
] 

Hudson commented on MAPREDUCE-3030:
-----------------------------------

Integrated in Hadoop-Mapreduce-trunk #836 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/836/])
    MAPREDUCE-3030. Fixed a bug in NodeId.equals() that was causing RM to reject all NMs.
Contributed by Devaraj K.

vinodkv : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1172638
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-api/src/main/java/org/apache/hadoop/yarn/api/records/NodeId.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/api/TestNodeId.java


> RM is not processing heartbeat and continuously giving the message 'Node not found rebooting'
> ---------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3030
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3030
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, resourcemanager
>    Affects Versions: 0.23.0
>            Reporter: Devaraj K
>            Assignee: Devaraj K
>            Priority: Blocker
>             Fix For: 0.23.0, 0.24.0
>
>         Attachments: MAPREDUCE-3030.patch
>
>
> {code:title=Node Manager Logs|borderStyle=solid}
> 2011-09-19 13:39:29,816 INFO  webapp.WebApps (WebApps.java:start(162)) - Registered webapp
guice modules
> 2011-09-19 13:39:29,817 INFO  service.AbstractService (AbstractService.java:start(61))
- Service:org.apache.hadoop.yarn.server.nodemanager.webapp.WebServer is started.
> 2011-09-19 13:39:29,818 INFO  service.AbstractService (AbstractService.java:start(61))
- Service:Dispatcher is started.
> 2011-09-19 13:39:29,819 INFO  nodemanager.NodeStatusUpdaterImpl (NodeStatusUpdaterImpl.java:start(133))
- Configured ContainerManager Address is 10.18.52.124:45454
> 2011-09-19 13:39:29,819 INFO  ipc.YarnRPC (YarnRPC.java:create(47)) - Creating YarnRPC
for org.apache.hadoop.yarn.ipc.HadoopYarnProtoRPC
> 2011-09-19 13:39:29,822 INFO  ipc.HadoopYarnRPC (HadoopYarnProtoRPC.java:getProxy(49))
- Creating a HadoopYarnProtoRpc proxy for protocol interface org.apache.hadoop.yarn.server.api.ResourceTracker
> 2011-09-19 13:39:29,862 INFO  nodemanager.NodeStatusUpdaterImpl (NodeStatusUpdaterImpl.java:registerWithRM(165))
- Connected to ResourceManager at 0.0.0.0:8025
> 2011-09-19 13:39:30,369 INFO  nodemanager.NodeStatusUpdaterImpl (NodeStatusUpdaterImpl.java:registerWithRM(189))
- Registered with ResourceManager as 10.18.52.124:45454 with total resource of memory: 8192,

> 2011-09-19 13:39:30,369 INFO  service.AbstractService (AbstractService.java:start(61))
- Service:org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl is started.
> 2011-09-19 13:39:30,371 INFO  service.AbstractService (AbstractService.java:start(61))
- Service:org.apache.hadoop.yarn.server.nodemanager.NodeManager is started.
> {code}
> {code:title=Resource Manager Logs|borderStyle=solid}
> 2011-09-19 14:01:03,238 INFO  resourcemanager.ResourceTrackerService (ResourceTrackerService.java:nodeHeartbeat(201))
- Node not found rebooting 10.18.52.124:45454
> Call: protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
method=nodeHeartbeat
> 2011-09-19 14:01:04,240 INFO  resourcemanager.ResourceTrackerService (ResourceTrackerService.java:nodeHeartbeat(201))
- Node not found rebooting 10.18.52.124:45454
> Call: protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
method=nodeHeartbeat
> 2011-09-19 14:01:05,242 INFO  resourcemanager.ResourceTrackerService (ResourceTrackerService.java:nodeHeartbeat(201))
- Node not found rebooting 10.18.52.124:45454
> Call: protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
method=nodeHeartbeat
> 2011-09-19 14:01:06,244 INFO  resourcemanager.ResourceTrackerService (ResourceTrackerService.java:nodeHeartbeat(201))
- Node not found rebooting 10.18.52.124:45454
> Call: protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
method=nodeHeartbeat
> 2011-09-19 14:01:07,246 INFO  resourcemanager.ResourceTrackerService (ResourceTrackerService.java:nodeHeartbeat(201))
- Node not found rebooting 10.18.52.124:45454
> Call: protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
method=nodeHeartbeat
> 2011-09-19 14:01:08,247 INFO  resourcemanager.ResourceTrackerService (ResourceTrackerService.java:nodeHeartbeat(201))
- Node not found rebooting 10.18.52.124:45454
> {code}
> Node Manager is registered with Resource manager and the for every heartbeat, it is printing
the above message.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message