zookeeper-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ZOOKEEPER-3398) Learner.connectToLeader() may take too long to time-out
Date Fri, 12 Jul 2019 18:06:00 GMT

    [ https://issues.apache.org/jira/browse/ZOOKEEPER-3398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16884054#comment-16884054
] 

Hudson commented on ZOOKEEPER-3398:
-----------------------------------

SUCCESS: Integrated in Jenkins build Zookeeper-trunk-single-thread #447 (See [https://builds.apache.org/job/Zookeeper-trunk-single-thread/447/])
ZOOKEEPER-3398: Learner.connectToLeader() may take too long to time-out (andor: rev 43ce772db000721546fcd13dd8523002dfa97741)
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/test/HierarchicalQuorumTest.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/test/TruncateTest.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/test/FLEZeroWeightTest.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/test/QuorumBase.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/test/QuorumUtil.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/test/FLERestartTest.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/server/quorum/FLEOutOfElectionTest.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/test/FLEPredicateTest.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/server/quorum/CnxManagerTest.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/server/quorum/FLELostMessageTest.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/server/quorum/QuorumPeerTest.java
* (edit) zookeeper-server/src/main/java/org/apache/zookeeper/server/quorum/QuorumPeerConfig.java
* (edit) zookeeper-it/src/test/java/org/apache/zookeeper/test/system/BaseSysTest.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/test/FLENewEpochTest.java
* (edit) zookeeper-server/src/main/java/org/apache/zookeeper/server/quorum/QuorumPeer.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/test/FLETest.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/server/quorum/LearnerTest.java
* (edit) zookeeper-server/src/main/java/org/apache/zookeeper/server/quorum/QuorumPeerMain.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/server/quorum/FLEBackwardElectionRoundTest.java
* (edit) zookeeper-server/src/main/java/org/apache/zookeeper/server/quorum/Learner.java
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/server/quorum/ReconfigDuringLeaderSyncTest.java
* (edit) zookeeper-it/src/test/java/org/apache/zookeeper/test/system/QuorumPeerInstance.java
* (edit) zookeeper-docs/src/main/resources/markdown/zookeeperAdmin.md
* (edit) zookeeper-server/src/test/java/org/apache/zookeeper/server/quorum/QuorumPeerTestBase.java


> Learner.connectToLeader() may take too long to time-out 
> --------------------------------------------------------
>
>                 Key: ZOOKEEPER-3398
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3398
>             Project: ZooKeeper
>          Issue Type: Improvement
>          Components: leaderElection, quorum
>            Reporter: Vladimir Ivić
>            Priority: Minor
>              Labels: pull-request-available
>             Fix For: 3.6.0
>
>          Time Spent: 5h
>  Remaining Estimate: 0h
>
> After leader election happens, the followers will connect to the leader which is facilitated
by the Learner.connectToLeader() method. 
> Learner.connectToLeader() is relying on the initLimit configuration value to time-out
in case the network connection is unreliable. This config may have a high value that could
leave the ensemble retrying and waiting in the state of not having quorum for too long. The
follower will retry up to 5 times. 
> This patch introduces a new configuration directive that will allow Zookeeper to use
different time-out value `connectToLeaderLimit` which then could be set to lower value than
`initLimit`.
>  



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Mime
View raw message