[ https://issues.apache.org/jira/browse/YARN-674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13821586#comment-13821586
]
Vinod Kumar Vavilapalli commented on YARN-674:
----------------------------------------------
More comments, hopefully the final round.
- RMAppManager.submitApplication: Put a comment where you move apps to finish state saying
we are doing this before token-renewal so that we don't renew tokens for finished apps.
- DelegationTokenRenewer
-- isServiceStarted needs to be volatile?
-- handleDTRenewerEvent -> handleDTRenewerAppSubmitEvent
-- Add a comment in handleDTRenewerEvent to indicate why DTRenewer is starting the app
as opposed to RMAppManager.
-- applicationFinished(DelegationTokenRenewerEvent evt) -> handleAppFinishEvent and
similarly addApplication(DelegationTokenRenewerAppSubmitEvent evt) -> handleAppsubmitEvent?
-- Instead of putting renewerCount in the main code path, you can access the thread count
from ThreadPoolExecutor.getPoolSize() in the tests directly ?
- DelegationTokenRenewerAppSubmitEvent can be nested class inside DelegationTokenRenewer?
This is not an event from outside the renewer. Similarly DelegationTokenRenewerEventType.
Either nest them in, or create a separate package.
- testInvalidDelegationTokenApplicationSubmit, testInvalidDTWithAddApplication: Seem similar
but test different things. May be rename one or both?
> Slow or failing DelegationToken renewals on submission itself make RM unavailable
> ---------------------------------------------------------------------------------
>
> Key: YARN-674
> URL: https://issues.apache.org/jira/browse/YARN-674
> Project: Hadoop YARN
> Issue Type: Sub-task
> Components: resourcemanager
> Reporter: Vinod Kumar Vavilapalli
> Assignee: Omkar Vinit Joshi
> Attachments: YARN-674.1.patch, YARN-674.2.patch, YARN-674.3.patch, YARN-674.4.patch,
YARN-674.5.patch, YARN-674.5.patch, YARN-674.6.patch
>
>
> This was caused by YARN-280. A slow or a down NameNode for will make it look like RM
is unavailable as it may run out of RPC handlers due to blocked client submissions.
--
This message was sent by Atlassian JIRA
(v6.1#6144)
|