hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From manoj <manojm....@gmail.com>
Subject App Master takes ~30min to re-schedule task attempts.
Date Wed, 19 Aug 2015 17:40:41 GMT
Hello all,

I'm running Apache2.6.0.
I'm trying to remove a node from a Hadoop Cluster and the add it back.
The taskattempts on the node which was removed are rescheduled only after

During this 30min period looks like the App Master is trying to connect(
check the log below ) the same node which was removed and after about 30min
it reschedules those taskAttempts from the lost node and eventually the job

how can I reduce the 30min wait time?

2015-08-14 11:25:21,662 INFO [ContainerLauncher #7]
org.apache.hadoop.ipc.Client: Retrying connect to server:
host172/XX.XX.XX.XX:36158. Already tried 0 time(s); retry policy is
RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000

--Manoj Kumar M

View raw message