hadoop-yarn-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Loughran (JIRA)" <j...@apache.org>
Subject [jira] [Created] (YARN-4721) RM to try to auth with HDFS on startup, retry with max diagnostics on failure
Date Tue, 23 Feb 2016 13:27:18 GMT
Steve Loughran created YARN-4721:
------------------------------------

             Summary: RM to try to auth with HDFS on startup, retry with max diagnostics on
failure
                 Key: YARN-4721
                 URL: https://issues.apache.org/jira/browse/YARN-4721
             Project: Hadoop YARN
          Issue Type: Improvement
          Components: resourcemanager
    Affects Versions: 2.8.0
            Reporter: Steve Loughran


If the RM can't auth with HDFS, this can first surface during job submission, which can cause
confusion about what's wrong and whose credentials are playing up.

Instead, the RM could try to talk to HDFS on launch, {{ls /}} should suffice. If it can't
auth, it can then tell UGI to log more and retry.

I don't know what the policy should be if the RM can't auth to HDFS at this point. Certainly
it can't currently accept work. But should it fail fast or keep going in the hope that the
problem is in the KDC or NN and will fix itself without an RM restart?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message