zookeeper-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alexis Midon <alexismi...@gmail.com>
Subject Re: server cannot join quorum
Date Fri, 07 Jan 2011 18:32:45 GMT
Flavio, I guess you're refering to ZK-822 & 790 ? We're actually upgrading
the environment right now.

Here are more details below. Logs are attached.
I didn't take the 'connection refused' on 2888 as an error, since - afaik -
followers do not always open this port. I double-checked the security groups
setting with my sys admin as well.

## zoo.cfg
########################
tickTime=2000
initLimit=12
syncLimit=5

dataDir=/var/zk/data
dataLogDir=/var/zk/txlog
clientPort=2181

#minSessionTimeout=10000
maxSessionTimeout=900000

server.1=zk1:2888:3888
server.2=zk2:2888:3888
server.3=zk3:2888:3888

########################

> for f in {1..3}; do echo "zk$f --------- "; ssh zk$f.prod2.i.c3-e.com"echo srvr | nc
127.0.0.1 2181";done
zk1 ---------
This ZooKeeper instance is not currently serving requests
zk2 ---------
Zookeeper version: 3.3.1-942149, built on 05/07/2010 17:14 GMT
Latency min/avg/max: 0/3/338
Received: 1738136
Sent: 1759599
Outstanding: 0
Zxid: 0x1000de84a
Mode: follower
Node count: 1041
zk3 ---------
Zookeeper version: 3.3.1-942149, built on 05/07/2010 17:14 GMT
Latency min/avg/max: 0/1/257
Received: 338889
Sent: 367196
Outstanding: 0
Zxid: 0x1000de84a
Mode: leader
Node count: 1041

## ZK 1,2 to ZK 3
###############
> for f in {1..2}; do echo "zk$f --------- "; ssh zk$f.prod2.i.c3-e.com"telnet
zk3.prod2.i.c3-e.com 2888";done
zk1 ---------
Trying 10.96.42.54...
Connected to ec2-75-101-171-125.compute-1.amazonaws.com.
Escape character is '^]'.
^Czk2 ---------
Trying 10.96.42.54...
Connected to ec2-75-101-171-125.compute-1.amazonaws.com.
Escape character is '^]'.
> for f in {1..2}; do echo "zk$f --------- "; ssh zk$f.prod2.i.c3-e.com"telnet
zk3.prod2.i.c3-e.com 3888";done
zk1 ---------
Trying 10.96.42.54...
Connected to ec2-75-101-171-125.compute-1.amazonaws.com.
Escape character is '^]'.
^Czk2 ---------
Trying 10.96.42.54...
Connected to ec2-75-101-171-125.compute-1.amazonaws.com.
Escape character is '^]'.


## ZK 2,3 to ZK 1
###############
> for f in {2..3}; do echo "zk$f --------- "; ssh zk$f.prod2.i.c3-e.com"telnet
zk1.prod2.i.c3-e.com 2888";done
zk2 ---------
Trying 10.196.155.208...
telnet: Unable to connect to remote host: Connection refused
zk3 ---------
Trying 10.196.155.208...
telnet: Unable to connect to remote host: Connection refused
> for f in {2..3}; do echo "zk$f --------- "; ssh zk$f.prod2.i.c3-e.com"telnet
zk1.prod2.i.c3-e.com 3888";done
zk2 ---------
Trying 10.196.155.208...
Connected to ec2-174-129-156-215.compute-1.amazonaws.com.
Escape character is '^]'.
^Czk3 ---------
Trying 10.196.155.208...
Connected to ec2-174-129-156-215.compute-1.amazonaws.com.
Escape character is '^]'.


## ZK 1,3 to ZK 2
###############
> for f in {1,3}; do echo "zk$f --------- "; ssh zk$f.prod2.i.c3-e.com"telnet
zk2.prod2.i.c3-e.com 2888";done
zk1 ---------
Trying 10.97.29.58...
telnet: Unable to connect to remote host: Connection refused
zk3 ---------
Trying 10.97.29.58...
telnet: Unable to connect to remote host: Connection refused
> for f in {1,3}; do echo "zk$f --------- "; ssh zk$f.prod2.i.c3-e.com"telnet
zk2.prod2.i.c3-e.com 3888";done
zk1 ---------
Trying 10.97.29.58...
Connected to ec2-50-16-119-92.compute-1.amazonaws.com.
Escape character is '^]'.
^Czk3 ---------
Trying 10.97.29.58...
Connected to ec2-50-16-119-92.compute-1.amazonaws.com.
Escape character is '^]'.



On Thu, Jan 6, 2011 at 9:17 PM, Ted Dunning <ted.dunning@gmail.com> wrote:

> When you checked this did you actually connect to the peer ports from the
> different machines?  Or just ping from machine to machine?
>
> On Thu, Jan 6, 2011 at 4:32 PM, Alexis Midon <alexismidon@gmail.com>
> wrote:
>
> > the connectivity between the machines
>

Mime
  • Unnamed multipart/mixed (inline, None, 0 bytes)
View raw message