Return-Path: X-Original-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C577910690 for ; Mon, 30 Sep 2013 21:26:36 +0000 (UTC) Received: (qmail 65503 invoked by uid 500); 30 Sep 2013 21:26:31 -0000 Delivered-To: apmail-hadoop-yarn-issues-archive@hadoop.apache.org Received: (qmail 65313 invoked by uid 500); 30 Sep 2013 21:26:29 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-issues@hadoop.apache.org Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 65178 invoked by uid 99); 30 Sep 2013 21:26:26 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Sep 2013 21:26:26 +0000 Date: Mon, 30 Sep 2013 21:26:26 +0000 (UTC) From: "Arpit Gupta (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (YARN-1255) RM fails to start up with Failed to load/recover state error in a HA setup MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Arpit Gupta created YARN-1255: --------------------------------- Summary: RM fails to start up with Failed to load/recover state error in a HA setup Key: YARN-1255 URL: https://issues.apache.org/jira/browse/YARN-1255 Project: Hadoop YARN Issue Type: Bug Components: resourcemanager Affects Versions: 2.1.1-beta Reporter: Arpit Gupta {code} 2013-09-30 09:12:09,206 INFO capacity.CapacityScheduler (CapacityScheduler.java:parseQueue(408)) - Initialized queue: default: capacity=1.0, absoluteCapacity=1.0, usedResources=usedCapacity=0.0, absoluteUsedCapacity=0.0, numApps=0, numContainers=0 2013-09-30 09:12:09,206 INFO capacity.CapacityScheduler (CapacityScheduler.java:parseQueue(408)) - Initialized queue: root: numChildQueue= 1, capacity=1.0, absoluteCapacity=1.0, usedResources=usedCapacity=0.0, numApps=0, numContainers=0 2013-09-30 09:12:09,206 INFO capacity.CapacityScheduler (CapacityScheduler.java:initializeQueues(306)) - Initialized root queue root: numChildQueue= 1, capacity=1.0, absoluteCapacity=1.0, usedResources=usedCapacity=0.0, numApps=0, numContainers=0 2013-09-30 09:12:09,206 INFO capacity.CapacityScheduler (CapacityScheduler.java:reinitialize(270)) - Initialized CapacityScheduler with calculator=class org.apache.hadoop.yarn.util.resource.DefaultResourceCalculator, minimumAllocation=<>, maximumAllocation=<> 2013-09-30 09:12:09,240 INFO event.AsyncDispatcher (AsyncDispatcher.java:register(157)) - Registering class org.apache.hadoop.yarn.server.resourcemanager.RMAppManagerEventType for class org.apache.hadoop.yarn.server.resourcemanager.RMAppManager 2013-09-30 09:12:09,250 INFO event.AsyncDispatcher (AsyncDispatcher.java:register(157)) - Registering class org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncherEventType for class org.apache.hadoop.yarn.server.resourcemanager.amlauncher.ApplicationMasterLauncher 2013-09-30 09:12:09,252 INFO resourcemanager.RMNMInfo (RMNMInfo.java:(63)) - Registered RMNMInfo MBean 2013-09-30 09:12:09,253 INFO util.HostsFileReader (HostsFileReader.java:refresh(84)) - Refreshing hosts (include/exclude) list 2013-09-30 09:12:09,278 INFO security.UserGroupInformation (UserGroupInformation.java:loginUserFromKeytab(843)) - Login successful for user rm/hostname@realm using keytab file /etc/security/keytabs/rm.service.keytab 2013-09-30 09:12:09,278 INFO security.RMContainerTokenSecretManager (RMContainerTokenSecretManager.java:rollMasterKey(103)) - Rolling master-key for container-tokens 2013-09-30 09:12:09,279 INFO security.AMRMTokenSecretManager (AMRMTokenSecretManager.java:rollMasterKey(107)) - Rolling master-key for amrm-tokens 2013-09-30 09:12:09,281 INFO security.NMTokenSecretManagerInRM (NMTokenSecretManagerInRM.java:rollMasterKey(97)) - Rolling master-key for nm-tokens 2013-09-30 09:12:10,196 INFO recovery.FileSystemRMStateStore (FileSystemRMStateStore.java:loadRMAppState(131)) - Loading application from node: application_1380531989689_0002 2013-09-30 09:12:10,217 INFO recovery.FileSystemRMStateStore (FileSystemRMStateStore.java:loadRMAppState(131)) - Loading application from node: application_1380531989689_0003 2013-09-30 09:12:10,232 INFO security.RMDelegationTokenSecretManager (RMDelegationTokenSecretManager.java:recover(181)) - recovering RMDelegationTokenSecretManager. 2013-09-30 09:12:10,234 INFO resourcemanager.RMAppManager (RMAppManager.java:recover(329)) - Recovering 2 applications 2013-09-30 09:12:10,234 ERROR resourcemanager.ResourceManager (ResourceManager.java:serviceStart(640)) - Failed to load/recover state java.lang.NullPointerException at org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.recover(RMAppManager.java:332) at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.recover(ResourceManager.java:842) at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:636) at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193) at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.main(ResourceManager.java:855) 2013-09-30 09:12:10,236 INFO util.ExitUtil (ExitUtil.java:terminate(124)) - Exiting with status 1 2013-09-30 09:17:20,144 INFO resourcemanager.ResourceManager (StringUtils.java:startupShutdownMessage(601)) - STARTUP_MSG: {code} -- This message was sent by Atlassian JIRA (v6.1#6144)