hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daryn Sharp (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-8865) RMStateStore contains large number of expired RMDelegationToken
Date Wed, 10 Oct 2018 20:51:00 GMT

    [ https://issues.apache.org/jira/browse/YARN-8865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16645520#comment-16645520
] 

Daryn Sharp commented on YARN-8865:
-----------------------------------

The RMDelegationTokenSecretManager is an AbstractDelegationTokenSecretManager.  The ADTSM
uses a thread to periodically roll secret keys and purge expired tokens.  We checked some
clusters that use the level db state store and we're not leaking tokens which implies the
problem is likely specific to the ZKRMStateStore.

Given it's the ADTSM's job to expunge expired tokens, every state store impl should not be
burdened with duplicated code to explicitly purge tokens just because one state store impl
is buggy.

> RMStateStore contains large number of expired RMDelegationToken
> ---------------------------------------------------------------
>
>                 Key: YARN-8865
>                 URL: https://issues.apache.org/jira/browse/YARN-8865
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 3.1.0
>            Reporter: Wilfred Spiegelenburg
>            Assignee: Wilfred Spiegelenburg
>            Priority: Major
>         Attachments: YARN-8865.001.patch
>
>
> When the RM state store is restored expired delegation tokens are restored and added
to the system. These expired tokens do not get cleaned up or removed. The exact reason why
the tokens are still in the store is not clear. We have seen as many as 250,000 tokens in
the store some of which were 2 years old.
> This has two side effects:
> * for the zookeeper store this leads to a jute buffer exhaustion issue and prevents the
RM from becoming active.
> * restore takes longer than needed and heap usage is higher than it should be
> We should not restore already expired tokens since they cannot be renewed or used.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message