Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of thedwilliams@googlemail.com
 designates 209.85.161.44 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=googlemail.com; s=gamma;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        b=kfJV98psvFrjS+x7uxYIy9e4ZeBrxLhtZ1a7/sIEYoZA4x8QzPKs6Czu9EgZYajR0G
         YyQW/zY8hB/7Gwlu4xZOz0BVmgqOM+LVJJfhR1e7xHax3G8+mkSNdlw64Bceb7ZYFlB5
         4H1/GfPYkmcx5p5DbbRLOh1ub9n/AMH97S694=
MIME-Version: 1.0
In-Reply-To: 
 <2B4B52565669304C979DC60E52AA3F9B022C790130@spnvm1183.bud.bpa.gov>
References: <AANLkTildXjpf9GF-xNtLYdybnj5Hp9pAgE4jlec4Hci_@mail.gmail.com>
	<AANLkTikIij7kaZWuYL7CPP5G1CuI3ZrLOQE82wdndsDw@mail.gmail.com>
	<AANLkTikY-nwDU_DLig5RC6bmZCHvPetzmogdrjpORLt_@mail.gmail.com>
	<2B4B52565669304C979DC60E52AA3F9B022C790130@spnvm1183.bud.bpa.gov>
Date: Mon, 14 Jun 2010 16:15:55 +0100
Message-ID: <AANLkTinynrdR6ZxjZC1-JuT3abQ7NEA-F18ChPBaniWr@mail.gmail.com>
Subject: Re: Pelops - a new Java client library paradigm
From: Dominic Williams <thedwilliams@googlemail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001485f27fae1974340488fef74f

--001485f27fae1974340488fef74f
Content-Type: text/plain; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

Hi, re: pools and detecting node failure...

Pooling is handled by ThriftPool. This class maintains a separate
NodeContext object for each known node. This in turn maintains a pool of
connections to its node.

Each NodeContext has a single "poolRefiller" object/thread, which runs
either when signalled, or every ~2s, whichever is the sooner. Whenever it
runs, the first thing it does is check which of its existing pooled
connections are open. This is necessary for it to correctly calculate the
number of new connections to open (assuming it has to)

To check whether a connection is open, it calls TTransport.isOpen, which is
TSocket.isOpen, which is Socket.isConnected. If a connection is not open,
then it is binned.

Therefore, pretty quickly if a node has failed, the NodeContext will not be
holding any connections to it. This causes the NodeContext.isAvailable
method to return false. When this is the case, that node is not considered
by ThriftPool when it is seeking to return a connection to an operand
(Mutator, Selector, KeyDeletor etc object)

The pool refiller thread keeps on trying to create connections to a node,
even after all connections to it have failed. When/if it becomes available
again, then as soon as a connection is made NodeContext.isAvailable will
return true and it comes "back online" for the purposes of the operands.

NOTE: Some of my colleagues were working on Windows machines separated from
our local development servers by low-end NAT routers. After some period
using this Cassandra, inside Pelops even though TSocket.isOpen was returnin=
g
true, when an operand tried using connections they were getting a timeout o=
r
other network exception. Calling setKeepAlive(true) on the underlying socke=
t
does not prevent this (although this option is best set because in general
it should force timely detection of connection failure). Hector also
experienced similar problems and we adopt a similar response - by default
you'll see that Pelops sets Policy.getKillNodeConnsOnException() to true by
default. What this means is that if a network exception is thrown when an
operand interacts with a node, the NodeContext destroys all pooled
connections to that node on the basis that the general failure of
connections to that node may not be detectable because of the network setup=
.
Of course, not many people will be running their Cassandra clients from
Windows behind NAT in production, but the option is set by default because
otherwise a segment of developers trying the library will experience
persistent problems due to this network (and/or Thrift) strangeness and in
production we are ourselves will switch it off (although note the worse
downside is that the occasional network error to a node will cause the
refreshing of its pool)

Hope this makes sense.
Best, Dominic

On 14 June 2010 15:32, Kochheiser,Todd W - TO-DITT1 <twkochheiser@bpa.gov>w=
rote:

>  Great API that looks easy and intuitive to use.  Regarding your
> connection pool implementation, how does it handle failed/crashed nodes?
> Will the pool auto-detect failed nodes via a =93tester=94 thread or will =
a
> failed node, and hence its pooled connection(s), be removed only when the=
y
> are used?  Conversely, how will the pool be repopulated once the
> failed/crashed node becomes available?
>
>
>
> Todd
>
>
>  ------------------------------
>
> *From:* Dominic Williams [mailto:thedwilliams@googlemail.com]
> *Sent:* Friday, June 11, 2010 7:05 AM
> *To:* user@cassandra.apache.org
> *Subject:* Re: Pelops - a new Java client library paradigm
>
>
>
> Hi good question.
>
>
>
> The scalability of Pelops is dependent on Cassandra, not the library
> itself. The library aims to provide an more effective access layer on top=
 of
> the Thrift API.
>
>
>
> The library does perform connection pooling, and you can control the size
> of the pool and other parameters using a policy object. But connection
> pooling itself does not increase scalability, only efficiency.
>
>
>
> Hope this helps.
>
> BEst, Dominic
>
>
>
> On 11 June 2010 14:47, Ian Soboroff <isoboroff@gmail.com> wrote:
>
> Sounds nice.  Can you say something about the scales at which you've used
> this library?  Both write and read load?  Size of clusters and size of da=
ta?
>
> Ian
>
>
>
> On Fri, Jun 11, 2010 at 9:41 AM, Dominic Williams <
> thedwilliams@googlemail.com> wrote:
>
> Pelops is a new high quality Java client library for Cassandra.
>
>
>
> It has a design that:
>
> * reveals the full power of Cassandra through an elegant "Mutator and
> Selector" paradigm
>
> * generates better, cleaner, less bug prone code
>
> * reduces the learning curve for new users
>
> * drives rapid application development
>
> * encapsulates advanced pooling algorithms
>
>
>
> An article introducing Pelops can be found at
>
>
> http://ria101.wordpress.com/2010/06/11/pelops-the-beautiful-cassandra-dat=
abase-client-for-java/
>
>
>
> Thanks for reading.
>
> Best, Dominic
>
>
>
>
>

--001485f27fae1974340488fef74f
Content-Type: text/html; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

<div>Hi, re: pools and detecting node failure...</div><div><br></div><div>P=
ooling is handled by ThriftPool. This class maintains a separate NodeContex=
t object for each known node. This in turn maintains a pool of connections =
to its node.</div>
<div><br></div><div>Each NodeContext has a single &quot;poolRefiller&quot; =
object/thread, which runs either when signalled, or every ~2s, whichever is=
 the sooner. Whenever it runs, the first thing it does is check which of it=
s existing pooled connections are open. This is necessary for it to correct=
ly calculate the number of new connections to open (assuming it has to)</di=
v>
<div><br></div><div>To check whether a connection is open, it calls TTransp=
ort.isOpen, which is TSocket.isOpen, which is Socket.isConnected. If a conn=
ection is not open, then it is binned.</div><div><br></div><div>Therefore, =
pretty quickly if a node has failed, the NodeContext will not be holding an=
y connections to it. This causes the NodeContext.isAvailable method to retu=
rn false. When this is the case, that node is not considered by ThriftPool =
when it is seeking to return a connection to an operand (Mutator, Selector,=
 KeyDeletor etc object)</div>
<div><br></div><div>The pool refiller thread keeps on trying to create conn=
ections to a node, even after all connections to it have failed. When/if it=
 becomes available again, then as soon as a connection is made NodeContext.=
isAvailable will return true and it comes &quot;back online&quot; for the p=
urposes of the operands.</div>
<div><br></div><div>NOTE: Some of my=A0colleagues=A0were working on Windows=
 machines separated from our local development servers by low-end NAT route=
rs. After some period using this Cassandra, inside Pelops even though TSock=
et.isOpen was returning true, when an operand tried using connections they =
were getting a timeout or other network exception. Calling setKeepAlive(tru=
e) on the underlying socket does not prevent this (although this option is =
best set because in general it should force timely detection of connection =
failure). Hector also experienced similar problems and we adopt a similar r=
esponse - by default you&#39;ll see that Pelops sets Policy.getKillNodeConn=
sOnException() to true by default. What this means is that if a network exc=
eption is thrown when an operand interacts with a node, the NodeContext des=
troys all pooled connections to that node on the basis that the general fai=
lure of connections to that node may not be detectable because of the netwo=
rk setup. Of course, not many people will be running their Cassandra client=
s from Windows behind NAT in production, but the option is set by default b=
ecause otherwise a segment of developers trying the library will experience=
 persistent problems due to this network (and/or Thrift) strangeness and in=
 production we are ourselves will switch it off (although note the worse do=
wnside is that the occasional network error to a node will cause the refres=
hing of its pool)</div>
<div><br></div><div>Hope this makes sense.</div><div>Best, Dominic</div><di=
v><br><div class=3D"gmail_quote">On 14 June 2010 15:32, Kochheiser,Todd W -=
 TO-DITT1 <span dir=3D"ltr">&lt;<a href=3D"mailto:twkochheiser@bpa.gov">twk=
ochheiser@bpa.gov</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex;">


<div lang=3D"EN-US" link=3D"blue" vlink=3D"blue">

<div>

<p class=3D"MsoNormal"><font size=3D"2" color=3D"navy" face=3D"Arial"><span=
 style=3D"font-size:10.0pt;font-family:Arial;color:navy">Great API that loo=
ks easy and intuitive to
use.=A0 Regarding your connection pool implementation, how does it handle
failed/crashed nodes?=A0 Will the pool auto-detect failed nodes via a =93te=
ster=94
thread or will a failed node, and hence its pooled connection(s), be remove=
d
only when they are used?=A0 Conversely, how will the pool be repopulated
once the failed/crashed node becomes available?</span></font></p>

<p class=3D"MsoNormal"><font size=3D"2" color=3D"navy" face=3D"Arial"><span=
 style=3D"font-size:10.0pt;font-family:Arial;color:navy">=A0</span></font><=
/p>

<p class=3D"MsoNormal"><font size=3D"2" color=3D"navy" face=3D"Arial"><span=
 style=3D"font-size:10.0pt;font-family:Arial;color:navy">Todd=A0 </span></f=
ont></p>

<p class=3D"MsoNormal"><font size=3D"2" color=3D"navy" face=3D"Arial"><span=
 style=3D"font-size:10.0pt;font-family:Arial;color:navy">=A0</span></font><=
/p>

<div>

<div class=3D"MsoNormal" align=3D"center" style=3D"text-align:center"><font=
 size=3D"3" face=3D"Times New Roman"><span style=3D"font-size:12.0pt">

<hr size=3D"2" width=3D"100%" align=3D"center">

</span></font></div>

<p class=3D"MsoNormal"><b><font size=3D"2" face=3D"Tahoma"><span style=3D"f=
ont-size:10.0pt;font-family:Tahoma;font-weight:bold">From:</span></font></b=
><font size=3D"2" face=3D"Tahoma"><span style=3D"font-size:10.0pt;font-fami=
ly:Tahoma"> Dominic Williams
[mailto:<a href=3D"mailto:thedwilliams@googlemail.com" target=3D"_blank">th=
edwilliams@googlemail.com</a>] <br>
<b><span style=3D"font-weight:bold">Sent:</span></b> Friday, June 11, 2010 =
7:05
AM<br>
<b><span style=3D"font-weight:bold">To:</span></b> <a href=3D"mailto:user@c=
assandra.apache.org" target=3D"_blank">user@cassandra.apache.org</a><br>
<b><span style=3D"font-weight:bold">Subject:</span></b> Re: Pelops - a new =
Java
client library paradigm</span></font></p>

</div><div><div></div><div class=3D"h5">

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">=A0</span></font></p>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">Hi good question.</span></font></p>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">=A0</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">The scalability of Pelops is dependent on Cassandra,=
 not the library
itself. The library aims to provide an more effective access layer on top o=
f
the Thrift API.</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">=A0</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">The library does perform connection pooling, and you=
 can control the
size of the pool and other parameters using a policy object. But connection
pooling itself does not increase scalability, only efficiency.=A0</span></f=
ont></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">=A0</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">Hope this helps.</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">BEst, Dominic</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">=A0</span></font></p>

<div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">On 11 June 2010 14:47, Ian Soboroff &lt;<a href=3D"m=
ailto:isoboroff@gmail.com" target=3D"_blank">isoboroff@gmail.com</a>&gt; wr=
ote:</span></font></p>


<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">Sounds nice.=A0 Can you say something about the scal=
es at which
you&#39;ve used this library?=A0 Both write and read load?=A0 Size of
clusters and size of data?<br>
<font color=3D"#888888"><span style=3D"color:#888888"><br>
Ian</span></font></span></font></p>

<div>

<div>

<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt"><font size=3D"3" face=
=3D"Times New Roman"><span style=3D"font-size:12.0pt">=A0</span></font></p>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">On Fri, Jun 11, 2010 at 9:41 AM, Dominic Williams &l=
t;<a href=3D"mailto:thedwilliams@googlemail.com" target=3D"_blank">thedwill=
iams@googlemail.com</a>&gt;
wrote:</span></font></p>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">Pelops is a new high quality Java client library for=
 Cassandra.</span></font></p>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">=A0</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">It has a design that:</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">* reveals the full power of Cassandra through an ele=
gant &quot;Mutator
and Selector&quot; paradigm</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">* generates better, cleaner, less bug prone code</sp=
an></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">* reduces the learning curve for new users</span></f=
ont></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">* drives rapid application development</span></font>=
</p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">* encapsulates advanced pooling algorithms</span></f=
ont></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">=A0</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">An article introducing Pelops can be found at</span>=
</font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt"><a href=3D"http://ria101.wordpress.com/2010/06/11/pe=
lops-the-beautiful-cassandra-database-client-for-java/" target=3D"_blank">h=
ttp://ria101.wordpress.com/2010/06/11/pelops-the-beautiful-cassandra-databa=
se-client-for-java/</a></span></font></p>


</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">=A0</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">Thanks for reading.</span></font></p>

</div>

<div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">Best, Dominic</span></font></p>

</div>

</div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">=A0</span></font></p>

</div>

</div>

</div>

<p class=3D"MsoNormal"><font size=3D"3" face=3D"Times New Roman"><span styl=
e=3D"font-size:12.0pt">=A0</span></font></p>

</div>

</div>

</div></div></div>

</div>


</blockquote></div><br></div>

--001485f27fae1974340488fef74f--