hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Krishna Kishore Bonagiri <write2kish...@gmail.com>
Subject Job running on YARN gets automatically killed after 10-12 minutes
Date Mon, 05 Nov 2012 16:32:31 GMT
Hi,

  My job that is running on YARN framework gets killed automatically after
10-12 minutes.

  I have changed the monitoring time limit Client.java that comes with
distributed shell example, and also bumped values for a set of interval
parameters in $HADOOP_CONF_DIR/yarn-site.xml by 10 fold. Then also the same
kind of error repeats.

Note: I am not sending frequent heartbeats to the RM from AM, also not
sending frequent container requests to RM.

Content from RM's log:
=====================


2012-11-05 05:50:41,721 INFO  fifo.FifoScheduler
(FifoScheduler.java:containerCompleted(721)) - Application
appattempt_1352112580456_0001_000001 released container
container_1352112580456_0001_01_000004 on node: host: isredeng:33055
#containers=2 available=4096 used=4096 with event: FINISHED
2012-11-05 06:03:03,855 INFO  util.AbstractLivelinessMonitor
(AbstractLivelinessMonitor.java:run(111)) -
Expired:appattempt_1352112580456_0001_000001 Timed out after 600 secs
2012-11-05 06:03:03,867 INFO  attempt.RMAppAttemptImpl
(RMAppAttemptImpl.java:handle(483)) - appattempt_1352112580456_0001_000001
State change from RUNNING to FAILED



Content from NM's log:
======================


2012-11-05 06:03:04,364 INFO  containermanager.AuxServices
(AuxServices.java:handle(160)) - Got event APPLICATION_STOP for appId
application_1352112580456_0001
2012-11-05 06:03:04,373 INFO  application.Application
(ApplicationImpl.java:handle(387)) - Application
application_1352112580456_0001 transitioned from
APPLICATION_RESOURCES_CLEANINGUP to FINISHED


Is this behavior not controllable by any of the parameters in XML
configuration files?

Thanks & Regards,
Kishore

Mime
View raw message