Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 8BCA1200BAA for ; Thu, 27 Oct 2016 20:34:04 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 874C1160B05; Thu, 27 Oct 2016 18:34:04 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id D1E49160AF6 for ; Thu, 27 Oct 2016 20:34:03 +0200 (CEST) Received: (qmail 122 invoked by uid 500); 27 Oct 2016 18:34:00 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 99634 invoked by uid 99); 27 Oct 2016 18:34:00 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Oct 2016 18:34:00 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 532062C2A67 for ; Thu, 27 Oct 2016 18:34:00 +0000 (UTC) Date: Thu, 27 Oct 2016 18:34:00 +0000 (UTC) From: "Karthik Kambatla (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (YARN-4721) RM to try to auth with HDFS on startup, retry with max diagnostics on failure MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 27 Oct 2016 18:34:04 -0000 [ https://issues.apache.org/jira/browse/YARN-4721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karthik Kambatla updated YARN-4721: ----------------------------------- Component/s: security > RM to try to auth with HDFS on startup, retry with max diagnostics on failure > ----------------------------------------------------------------------------- > > Key: YARN-4721 > URL: https://issues.apache.org/jira/browse/YARN-4721 > Project: Hadoop YARN > Issue Type: Improvement > Components: resourcemanager, security > Affects Versions: 2.8.0 > Reporter: Steve Loughran > Assignee: Steve Loughran > Labels: oct16-medium > Attachments: HADOOP-12289-002.patch, HADOOP-12289-003.patch, HADOOP-12889-001.patch > > > If the RM can't auth with HDFS, this can first surface during job submission, which can cause confusion about what's wrong and whose credentials are playing up. > Instead, the RM could try to talk to HDFS on launch, {{ls /}} should suffice. If it can't auth, it can then tell UGI to log more and retry. > I don't know what the policy should be if the RM can't auth to HDFS at this point. Certainly it can't currently accept work. But should it fail fast or keep going in the hope that the problem is in the KDC or NN and will fix itself without an RM restart? -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: yarn-issues-help@hadoop.apache.org