Return-Path: X-Original-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7B8301762E for ; Sat, 28 Mar 2015 06:12:58 +0000 (UTC) Received: (qmail 17167 invoked by uid 500); 28 Mar 2015 06:12:53 -0000 Delivered-To: apmail-hadoop-yarn-issues-archive@hadoop.apache.org Received: (qmail 17049 invoked by uid 500); 28 Mar 2015 06:12:53 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-issues@hadoop.apache.org Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 16765 invoked by uid 99); 28 Mar 2015 06:12:53 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 28 Mar 2015 06:12:53 +0000 Date: Sat, 28 Mar 2015 06:12:53 +0000 (UTC) From: "Rohith (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-3410) YARN admin should be able to remove individual application records from RMStateStore MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/YARN-3410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14385153#comment-14385153 ] Rohith commented on YARN-3410: ------------------------------ Giving privilege to admin is good option for removing single application entry from state store. There would be some other configurations which effect RM upon restart , may be need to revisit or identifies those configs. bq. RM should be able to report all fatal errors (which will shutdown RM) when doing app recovery, this can save admin some time to remove apps in bad state. To be more clear, do you mean fatal error should be logged in logs or in console? > YARN admin should be able to remove individual application records from RMStateStore > ------------------------------------------------------------------------------------ > > Key: YARN-3410 > URL: https://issues.apache.org/jira/browse/YARN-3410 > Project: Hadoop YARN > Issue Type: Bug > Components: resourcemanager, yarn > Reporter: Wangda Tan > Assignee: Rohith > Priority: Critical > > When RM state store entered an unexpected state, one example is YARN-2340, when an attempt is not in final state but app already completed, RM can never get up unless format RMStateStore. > I think we should support remove individual application records from RMStateStore to unblock RM admin make choice of either waiting for a fix or format state store. > In addition, RM should be able to report all fatal errors (which will shutdown RM) when doing app recovery, this can save admin some time to remove apps in bad state. -- This message was sent by Atlassian JIRA (v6.3.4#6332)