hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-11000) HAServiceProtocol's health state is incorrectly transitioned to SERVICE_NOT_RESPONDING
Date Tue, 17 Feb 2015 14:33:13 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-11000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14324232#comment-14324232
] 

Hudson commented on HADOOP-11000:
---------------------------------

SUCCESS: Integrated in Hadoop-Hdfs-trunk #2039 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk/2039/])
HADOOP-11000. HAServiceProtocol's health state is incorrectly transitioned to SERVICE_NOT_RESPONDING
(Contributed by Ming Ma) (vinayakumarb: rev cf4b7f506dd338ecf2ed4c643b6a6a334e070fca)
* hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/ha/TestHealthMonitor.java
* hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/ha/DummyHAService.java
* hadoop-common-project/hadoop-common/CHANGES.txt
* hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ha/HealthMonitor.java


> HAServiceProtocol's health state is incorrectly transitioned to SERVICE_NOT_RESPONDING
> --------------------------------------------------------------------------------------
>
>                 Key: HADOOP-11000
>                 URL: https://issues.apache.org/jira/browse/HADOOP-11000
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Ming Ma
>            Assignee: Ming Ma
>             Fix For: 2.7.0
>
>         Attachments: HADOOP-11000-2.patch, HADOOP-11000.patch
>
>
> When HAServiceProtocol.monitorHealth throws a HealthCheckFailedException, the actual
exception from protocol buffer RPC is a RemoteException that wraps the real exception. Thus
the state is incorrectly transitioned to SERVICE_NOT_RESPONDING
> {noformat}
> HealthMonitor.java
> doHealthChecks
>       try {
>         status = proxy.getServiceStatus();
>         proxy.monitorHealth();
>         healthy = true;
>       } catch (HealthCheckFailedException e) {
>         .....
>         enterState(State.SERVICE_UNHEALTHY);
>       } catch (Throwable t) {
>         .....
>         enterState(State.SERVICE_NOT_RESPONDING);
>         .....
>       }
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message