Mailing-List: contact user-help@curator.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@curator.apache.org
Received-SPF: pass (nike.apache.org: local policy includes SPF record at
 spf.trusted-forwarder.org)
MIME-Version: 1.0
In-Reply-To: <etPan.5384ea7c.74b0dc51.11f@Jordans-MacBook-Pro.local>
References: 
 <CAJb9fpRourQ-fJKCrMfLC2RsQuw+QBQgn9GidthBtRuPRsc4HA@mail.gmail.com>
 <etPan.5384ea7c.74b0dc51.11f@Jordans-MacBook-Pro.local>
From: =?UTF-8?Q?Mathias_S=C3=B6derberg?= <mathias@burtcorp.com>
Date: Tue, 27 May 2014 22:11:40 +0200
Message-ID: 
 <CAJb9fpQxTmKGPc=_fc9hy3YK9rg31=SGp4iE88d2vi2SWMa+zw@mail.gmail.com>
Subject: Re: LeaderLatch recipe and error handling
To: Jordan Zimmerman <jordan@jordanzimmerman.com>
Cc: user <user@curator.apache.org>
Content-Type: multipart/alternative; boundary=001a11c1641e020af504fa674fee

--001a11c1641e020af504fa674fee
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Right, that=E2=80=99s what I assumed after I actually read the code for the
LeaderLatch class.

We=E2=80=99re not using await(), but have a number of LeaderLatches and cur=
rently
we=E2=80=99re caching the last response of getLeader() (for a each LeaderLa=
tch),
and we add watches for the election paths and update the cache if we get a
NodeChildrenChanged notification.
When we get a LOST event followed by a RECONNECTED event we clear the cache
and start over as we have no clue who=E2=80=99s responsible for what. If we=
 get a
SUSPENDED event we don=E2=80=99t permit reads from the cache until we get a
RECONNECTED event (or rather we return null as we cannot be sure who=E2=80=
=99s
leader).

Perhaps we should clear the cache when we get a SUSPENDED event as well, to
be on the safe side.

But in conclusion there=E2=80=99s no need to actually close and re-create
LeaderLatches in case of a connection loss, which is really what I was
wondering about.

Best regards,
Mathias


On Tue, May 27, 2014 at 9:41 PM, Jordan Zimmerman <
jordan@jordanzimmerman.com> wrote:

> The documentation probably needs updating as this has been refined over
> time.
>
>    - The LeaderLatch installs its own connection state listener
>    - If the connection drops (SUSPENDED or LOST), the LeaderLatch changes
>    its internal state to =E2=80=9Cleader =3D=3D false=E2=80=9D
>    - If the connection goes to RECONNECTED, the LeaderLatch will attempt
>    to regain leadership
>
> This has implications for users of LeaderLatch. If, for example you've
> called await() on the LeaderLatch your code will assume that it is the
> leader. However, if the connection drops you may no longer be the leader.
> So, clients should install their own ConnectionStateListener and notice
> that the connection has dropped. Also, you can examine
> LeaderLatch.hasLeadership() before your client code does anything where i=
t
> assumes it is the leader and then periodically re-check it.
>
> I hope this helps.
>
> -JZ
>
>
> From: Mathias S=C3=B6derberg mathias@burtcorp.com
> Reply: user@curator.apache.org user@curator.apache.org
> Date: May 27, 2014 at 2:33:21 PM
> To: user@curator.apache.org user@curator.apache.org
> Subject:  LeaderLatch recipe and error handling
>
>  Good evening,
>
> I=E2=80=99m currently working on a project where we=E2=80=99re utilising =
Curator and more
> specifically (quite heavily) the LeaderLatch recipe.
>
> The documentation for error handling in =E2=80=9Cgeneral=E2=80=9D states =
the following for
> a LOST notification:
>
>  The connection is confirmed to be lost. Close any locks, leaders, etc.
> and attempt to re-create them. NOTE: it is possible to get a RECONNECTED
> state after this but you should still consider any locks, etc. as
> dirty/unstable.
>
>  And the documentation for the LeaderLatch recipe states the following:
>
>  LeaderLatch instances add a ConnectionStateListener to watch for
> connection problems. If SUSPENDED or LOST is reported, the LeaderLatch th=
at
> is the leader will report that it is no longer the leader (i.e. there wil=
l
> not be a leader until the connection is re-established). If a LOST
> connection is RECONNECTED, the LeaderLatch will delete its previous ZNode
> and create a new one.
>
> Users of LeaderLatch must take account that connection issues can cause
> leadership to be lost. i.e. hasLeadership() returns true but some time
> later the connection is SUSPENDED or LOST. At that point hasLeadership()
> will return false. It is highly recommended that LeaderLatch users regist=
er
> a ConnectionStateListener.
>
>  My conclusion from reading these two sections is that we=E2=80=99re supp=
osed to
> add a ConnectionStateListener and when we=E2=80=99re notified of a LOST e=
vent
> followed by a RECONNECTED event, we=E2=80=99re supposed to close the curr=
ent
> LeaderLatches that we=E2=80=99re holding and re-create them?
>
> However, looking through the actual code for the LeaderLatch, it appears
> that this is actually already handled, i.e. it appears to create a new
> znode when it encounters a RECONNECTED event, or am I reading this wrong?
> (The documentation also states this as a fact).
>
> My question is really: do we have to take any particular precaution
> regarding the LeaderLatch recipe and connection loss scenarios? i.e. do w=
e
> have to close and re-create the LeaderLatches? Or can we be calm and just
> carry on with our business as Curator handles this?
>
> If anything is unclear, let me know.
>
> Best regards,
>
> Mathias S=C3=B6derberg
> Software Developer, Burt
>
> www.burtcorp.com
> Cell: + 46 762 79 57 55 | Skype: mthssdrbrg
> http://twitter.com/mthssdrbrg | http://twitter.com/burtcorp
> =E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=
=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=
=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=
=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=
=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=
=80=93=E2=80=93
>
> The Analytics Platform for Online Media
>
>


--=20

Mathias S=C3=B6derberg
Software Developer, Burt

www.burtcorp.com
Cell: + 46 762 79 57 55 | Skype: mthssdrbrg
http://twitter.com/mthssdrbrg | http://twitter.com/burtcorp
=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=
=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=
=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=
=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=
=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=
=93=E2=80=93

The Analytics Platform for Online Media

--001a11c1641e020af504fa674fee
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Right, that=E2=80=99s what I assumed after I actually read=
 the code for the LeaderLatch class.<div><br></div><div>We=E2=80=99re not u=
sing await(), but have a number of LeaderLatches and currently we=E2=80=99r=
e caching the last response of getLeader() (for a each LeaderLatch), and we=
 add watches for the election paths and update the cache if we get a NodeCh=
ildrenChanged notification.</div>

<div>When we get a LOST event followed by a RECONNECTED event we clear the =
cache and start over as we have no clue who=E2=80=99s responsible for what.=
 If we get a SUSPENDED event we don=E2=80=99t permit reads from the cache u=
ntil we get a RECONNECTED event (or rather we return null as we cannot be s=
ure who=E2=80=99s leader).</div>

<div><br></div><div>Perhaps we should clear the cache when we get a SUSPEND=
ED event as well, to be on the safe side.</div><div><br></div><div>But in c=
onclusion there=E2=80=99s no need to actually close and re-create LeaderLat=
ches in case of a connection loss, which is really what I was wondering abo=
ut.</div>

<div><br></div><div>Best regards,</div><div>Mathias</div></div><div class=
=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Tue, May 27, 2014 at=
 9:41 PM, Jordan Zimmerman <span dir=3D"ltr">&lt;<a href=3D"mailto:jordan@j=
ordanzimmerman.com" target=3D"_blank">jordan@jordanzimmerman.com</a>&gt;</s=
pan> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div style=3D"word-wrap:break-word"><div sty=
le=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1.0);marg=
in:0px;line-height:auto">

The documentation probably needs updating as this has been refined over tim=
e.</div><ul><li>The LeaderLatch installs its own connection state listener<=
/li><li>If the connection drops (SUSPENDED or LOST), the LeaderLatch change=
s its internal state to =E2=80=9Cleader =3D=3D false=E2=80=9D</li>

<li>If the connection goes to RECONNECTED, the LeaderLatch will attempt to =
regain leadership</li></ul> <div><div style=3D"font-family:helvetica,arial;=
font-size:13px">This has implications for users of LeaderLatch. If, for exa=
mple you&#39;ve called await() on the LeaderLatch your code will assume tha=
t it is the leader. However, if the connection drops you may no longer be t=
he leader. So, clients should install their own ConnectionStateListener and=
 notice that the connection has dropped. Also, you can examine LeaderLatch.=
hasLeadership() before your client code does anything where it assumes it i=
s the leader and then periodically re-check it.</div>

<div style=3D"font-family:helvetica,arial;font-size:13px"><br></div><div st=
yle=3D"font-family:helvetica,arial;font-size:13px">I hope this helps.</div>=
<div style=3D"font-family:helvetica,arial;font-size:13px"><br></div><div st=
yle=3D"font-family:helvetica,arial;font-size:13px">

-JZ</div><div style=3D"font-family:helvetica,arial;font-size:13px"><br></di=
v></div> <div style=3D"color:black"><br>From:=C2=A0<span style=3D"color:bla=
ck">Mathias S=C3=B6derberg</span> <a href=3D"mailto:mathias@burtcorp.com" t=
arget=3D"_blank">mathias@burtcorp.com</a><br>

Reply:=C2=A0<span style=3D"color:black"><a href=3D"mailto:user@curator.apac=
he.org" target=3D"_blank">user@curator.apache.org</a></span> <a href=3D"mai=
lto:user@curator.apache.org" target=3D"_blank">user@curator.apache.org</a><=
br>Date:=C2=A0<span style=3D"color:black">May 27, 2014 at 2:33:21 PM</span>=
<br>

To:=C2=A0<span style=3D"color:black"><a href=3D"mailto:user@curator.apache.=
org" target=3D"_blank">user@curator.apache.org</a></span> <a href=3D"mailto=
:user@curator.apache.org" target=3D"_blank">user@curator.apache.org</a><br>=
Subject:=C2=A0<span style=3D"color:black"> LeaderLatch recipe and error han=
dling <br>

</span></div><div><div class=3D"h5"><br> <blockquote type=3D"cite"><span><d=
iv><div></div><div>


<div dir=3D"ltr">Good evening,
<div><br></div>
<div>I=E2=80=99m currently working on a project where we=E2=80=99re utilisi=
ng
Curator and more specifically (quite heavily) the LeaderLatch
recipe.</div>
<div><br></div>
<div>The documentation for error handling in =E2=80=9Cgeneral=E2=80=9D stat=
es the
following for a LOST notification:</div>
<div><br></div>
<blockquote style=3D"margin:0px 0px 0px 40px;border:none;padding:0px">
<div>The connection is confirmed to be lost. Close any locks,
leaders, etc. and attempt to re-create them. NOTE: it is possible
to get a RECONNECTED state after this but you should still consider
any locks, etc. as dirty/unstable.</div>
<div><br></div>
</blockquote>
And the documentation for the LeaderLatch recipe states the
following:
<div><br></div>
<blockquote style=3D"margin:0 0 0 40px;border:none;padding:0px">
<div>LeaderLatch instances add a ConnectionStateListener to watch
for connection problems. If SUSPENDED or LOST is reported, the
LeaderLatch that is the leader will report that it is no longer the
leader (i.e. there will not be a leader until the connection is
re-established). If a LOST connection is RECONNECTED, the
LeaderLatch will delete its previous ZNode and create a new
one.</div>
<div><br></div>
<div>Users of LeaderLatch must take account that connection issues
can cause leadership to be lost. i.e. hasLeadership() returns true
but some time later the connection is SUSPENDED or LOST. At that
point hasLeadership() will return false. It is highly recommended
that LeaderLatch users register a ConnectionStateListener.</div>
<div><br></div>
</blockquote>
My conclusion from reading these two sections is that we=E2=80=99re
supposed to add a ConnectionStateListener and when we=E2=80=99re notified
of a LOST event followed by a RECONNECTED event, we=E2=80=99re supposed to
close the current LeaderLatches that we=E2=80=99re holding and re-create
them?
<div><br></div>
<div>However, looking through the actual code for the LeaderLatch,
it appears that this is actually already handled, i.e. it appears
to create a new znode when it encounters a RECONNECTED event, or am
I reading this wrong? (The documentation also states this as a
fact).</div>
<div><br></div>
<div>My question is really: do we have to take any particular
precaution regarding the LeaderLatch recipe and connection loss
scenarios? i.e. do we have to close and re-create the
LeaderLatches? Or can we be calm and just carry on with our
business as Curator handles this?</div>
<div><br></div>
<div>If anything is unclear, let me know.</div>
<div><br></div>
<div>Best regards,<br>
<div>
<div>
<div dir=3D"ltr">
<p><font>Mathias S=C3=B6derberg</font></p>
<font><font color=3D"#222222" style=3D"color:rgb(136,136,136)">Software
Developer, Burt</font><br>
<br>
<a href=3D"http://www.burtcorp.com/" style=3D"color:rgb(17,85,204)" target=
=3D"_blank">www.burtcorp.com</a><br>
<font color=3D"#222222" style=3D"color:rgb(136,136,136)">Cell:=C2=A0</font>=
<a value=3D"+46768973286" style=3D"color:rgb(17,85,204)">+ 46 762 79 57 55<=
/a><font color=3D"#222222" style=3D"color:rgb(136,136,136)">=C2=A0| Skype:
mthssdrbrg</font><br>
<a href=3D"http://twitter.com/mthssdrbrg" target=3D"_blank"><font color=3D"=
#1155CC">http://twitter.com/</font><font color=3D"#1155CC">m</font>thssdrbr=
g</a><font color=3D"#222222" style=3D"color:rgb(136,136,136)">=C2=A0|=C2=A0=
</font><a href=3D"http://twitter.com/burtcorp" style=3D"color:rgb(17,85,204=
)" target=3D"_blank">http://twitter.com/burtcorp</a><br>


<font color=3D"#888888">=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=
=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=
=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=
=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93</font=
><font color=3D"#888888">=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=
=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93</font=
></font>
<div style=3D"color:rgb(136,136,136)">
<div style=3D"margin:5px 0px"></div>
<div><br>
The Analytics Platform for Online Media</div>
</div>
</div>
</div>
</div>
</div>
</div>


</div></div></span></blockquote></div></div></div></blockquote></div><br><b=
r clear=3D"all"><div><br></div>-- <br><div dir=3D"ltr"><font><p>Mathias S=
=C3=B6derberg</p><font color=3D"#222222" style=3D"color:rgb(136,136,136)">S=
oftware Developer, Burt</font><br>

<br><a href=3D"http://www.burtcorp.com/" style=3D"color:rgb(17,85,204)" tar=
get=3D"_blank">www.burtcorp.com</a><br><font color=3D"#222222" style=3D"col=
or:rgb(136,136,136)">Cell:=C2=A0</font><a value=3D"+46768973286" style=3D"c=
olor:rgb(17,85,204)">+ 46 762 79 57 55</a><font color=3D"#222222" style=3D"=
color:rgb(136,136,136)">=C2=A0| Skype: mthssdrbrg</font><br>

<a href=3D"http://twitter.com/mthssdrbrg" target=3D"_blank"><font color=3D"=
#1155cc">http://twitter.com/</font><font color=3D"#1155cc">m</font>thssdrbr=
g</a><font color=3D"#222222" style=3D"color:rgb(136,136,136)">=C2=A0|=C2=A0=
</font><a href=3D"http://twitter.com/burtcorp" style=3D"color:rgb(17,85,204=
)" target=3D"_blank">http://twitter.com/burtcorp</a><br>

<font color=3D"#888888">=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=
=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=
=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=
=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93</font=
><font color=3D"#888888">=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=
=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93=E2=80=93</font=
></font><div style=3D"color:rgb(136,136,136)"><div style=3D"margin:5px 0px"=
></div><div><br>The Analytics Platform for Online Media</div>

</div></div>
</div>

--001a11c1641e020af504fa674fee--