hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rohith (JIRA)" <j...@apache.org>
Subject [jira] [Created] (YARN-1686) NodeManager.resyncWithRM() does not handle exception which cause NodeManger to Hang.
Date Wed, 05 Feb 2014 12:52:09 GMT
Rohith created YARN-1686:
----------------------------

             Summary: NodeManager.resyncWithRM() does not handle exception which cause NodeManger
to Hang.
                 Key: YARN-1686
                 URL: https://issues.apache.org/jira/browse/YARN-1686
             Project: Hadoop YARN
          Issue Type: Bug
          Components: nodemanager
    Affects Versions: 2.3.0
            Reporter: Rohith
            Assignee: Rohith


During start of NodeManager,if registration with resourcemanager throw exception then nodemager
shutdown happens. 

Consider case where NM-1 is registered with RM. RM issued Resync to NM. If any exception thrown
in "resyncWithRM" (starts new thread which does not handle exception) during RESYNC evet,
then this thread is lost. NodeManger enters hanged state. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message