hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chao Sun (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-14528) [SBN Read]Failover from Active to Standby Failed
Date Tue, 04 Jun 2019 20:06:00 GMT

    [ https://issues.apache.org/jira/browse/HDFS-14528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16856080#comment-16856080
] 

Chao Sun commented on HDFS-14528:
---------------------------------

I think the current fix is on the right track. Just two comments as mentioned earlier:
1. we need UT for this.
2. it might be more appropriate to do the observer check in {{ZKFailoverController#doCedeActive}}
as it can save one RPC call for each NN in the candidate set.

We may need a separate JIRA for the connection issue too - seems unavailability of one NN
in a multi-SBN environment shouldn't cause failover to fail.

> [SBN Read]Failover from Active to Standby Failed  
> --------------------------------------------------
>
>                 Key: HDFS-14528
>                 URL: https://issues.apache.org/jira/browse/HDFS-14528
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: ha
>            Reporter: Ravuri Sushma sree
>            Assignee: Ravuri Sushma sree
>            Priority: Major
>         Attachments: ZKFC_issue.patch
>
>
> *Started an HA Cluster with three nodes [ _Active ,Standby ,Observer_ ]*
> *When trying to exectue the failover command from active to standby* 
> *._/hdfs haadmin  -failover nn1 nn2, below Exception is thrown_*
>   Operation failed: Call From X-X-X-X/X-X-X-X to Y-Y-Y-Y:nnnn failed on connection
exception: java.net.ConnectException: Connection refused; For more details see: [http://wiki.apache.org/hadoop/ConnectionRefused]
>  at sun.reflect.GeneratedConstructorAccessor7.newInstance(Unknown Source)
>  at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
>  at java.lang.reflect.Constructor.newInstance(Constructor.java:422)
>  at org.apache.hadoop.net.NetUtils.wrapWithMessage(NetUtils.java:831)
>  at org.apache.hadoop.net.NetUtils.wrapException(NetUtils.java:755) 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message