hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rohith (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.
Date Tue, 21 Oct 2014 03:36:34 GMT

    [ https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14177886#comment-14177886

Rohith commented on YARN-2579:

bq. Under what conditions, can resetDispatcher be called by two threads simultaneously? 
resetDispatcher is called only once in synchronized block(transitionToStandBy or transitinedToActive).

Here the problem is , 
*Thread-1 :* just before stoppingActiveServices() from trainsitionToStandBy() method if RMFatalEvent
is thrown then RMFatalEventDispatcher wait for trainsitionToStandBy() for obtaining lock.RMFatalEventDispatcher
is BLOCKED on trainsitionToStandBy().
*Thread-2 :* From the elector, trainsitionedTotandBy() stops dispatcher in resetDispatcher()
method. (Service)Dispatcher.stop() wait for draining out RMFatalEventDispatcher event.But
"AsyncDispatcher event handler" is WAITING on dispatcher thread to finish.

> Both RM's state is Active , but 1 RM is not really active.
> ----------------------------------------------------------
>                 Key: YARN-2579
>                 URL: https://issues.apache.org/jira/browse/YARN-2579
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.5.1
>            Reporter: Rohith
>            Assignee: Rohith
>         Attachments: YARN-2579.patch, YARN-2579.patch
> I encountered a situaltion where both RM's web page was able to access and its state
displayed as Active. But One of the RM's ActiveServices were stopped.

This message was sent by Atlassian JIRA

View raw message