Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
References: 
 <CAHAy_hRRRvuLNB5=nxdTOz47pUA=oEvaTu5vof6SORHpBw01nQ@mail.gmail.com>
 <CACUnPaCsTUAHKVbv7VHZXA5kkR3NHPcFZoZNunR3_AyXJd822g@mail.gmail.com>
 <CAKGou-VmaJPqVuDbkpXej_spN6d7pmvUTTssO-ESWzB+F8w=cg@mail.gmail.com>
 <CACUnPaBDNJL1MEKsVU_HoK-qz8SaiB2YOOZT2shkkU30uv6-eQ@mail.gmail.com>
 <CAKYY9AJhLFL4rCHMSn+iBgQr9M6FxkSP58eH9xsEWdSMPywazQ@mail.gmail.com>
In-Reply-To: 
 <CAKYY9AJhLFL4rCHMSn+iBgQr9M6FxkSP58eH9xsEWdSMPywazQ@mail.gmail.com>
From: Jonathan Haddad <jon@jonhaddad.com>
Date: Tue, 05 Apr 2016 21:44:41 +0000
Message-ID: 
 <CACUnPaD9TfqF791k857uhqbFN5xHLVhEXk+1kO4HHwtwMday2A@mail.gmail.com>
Subject: Re: Is it possible to achieve "sticky" request routing?
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a11c301a63d9690052fc3c192

--001a11c301a63d9690052fc3c192
Content-Type: text/plain; charset=UTF-8

Jim, that's not what he asked. He asked for the equivalent of a load
balancer with sticky sessions.


On Tue, Apr 5, 2016 at 2:24 PM Jim Ancona <jim@anconafamily.com> wrote:

> Jon and Steve:
>
> I don't understand your point. The TokenAwareLoadBalancer identifies the
> nodes in the cluster that own the data for a particular token and route
> requests to one of them. As I understand it, the OP wants to send requests
> for a particular token to the same node every time (assuming it's
> available). How does that fail in a large cluster?
>
> Jim
>
> On Tue, Apr 5, 2016 at 4:31 PM, Jonathan Haddad <jon@jonhaddad.com> wrote:
>
>> Yep - Steve hit the nail on the head.  The odds of hitting the right
>> server with "sticky routing" goes down as your cluster size increases.  You
>> end up adding extra network hops instead of using token aware routing.
>>
>> Unless you're trying to do a coordinator tier (and you're not, according
>> to your original post), this is a pretty bad idea and I'd advise you to
>> push back on that requirement.
>>
>> On Tue, Apr 5, 2016 at 12:47 PM Steve Robenalt <srobenalt@highwire.org>
>> wrote:
>>
>>> Aside from Jon's "why" question, I would point out that this only really
>>> works because you are running a 3 node cluster with RF=3. If your cluster
>>> is going to grow, you can't guarantee that any one server would have all
>>> records. I'd be pretty hesitant to put an invisible constraint like that on
>>> a cluster unless you're pretty sure it'll only ever be 3 nodes.
>>>
>>> On Tue, Apr 5, 2016 at 9:34 AM, Jonathan Haddad <jon@jonhaddad.com>
>>> wrote:
>>>
>>>> Why is this a requirement?  Honestly I don't know why you would do this.
>>>>
>>>>
>>>> On Sat, Apr 2, 2016 at 8:06 PM Mukil Kesavan <weirdbluelights@gmail.com>
>>>> wrote:
>>>>
>>>>> Hello,
>>>>>
>>>>> We currently have 3 Cassandra servers running in a single datacenter
>>>>> with a replication factor of 3 for our keyspace. We also use the
>>>>> SimpleSnitch wiith DynamicSnitching enabled by default. Our load balancing
>>>>> policy is TokenAwareLoadBalancingPolicy with RoundRobinPolicy as the child.
>>>>> This overall configuration results in our client requests spreading equally
>>>>> across our 3 servers.
>>>>>
>>>>> However, we have a new requirement where we need to restrict a
>>>>> client's requests to a single server and only go to the other servers on
>>>>> failure of the previous server. This particular use case does not have high
>>>>> request traffic.
>>>>>
>>>>> Looking at the documentation the options we have seem to be:
>>>>>
>>>>> 1. Play with the snitching (e.g. place each server into its own DC or
>>>>> Rack) to ensure that requests always go to one server and failover to the
>>>>> others if required. I understand that this may also affect replica
>>>>> placement and we may need to run nodetool repair. So this is not our most
>>>>> preferred option.
>>>>>
>>>>> 2. Write a new load balancing policy that also uses the
>>>>> HostStateListener for tracking host up and down messages, that essentially
>>>>> accomplishes "sticky" request routing with failover to other nodes.
>>>>>
>>>>> Is option 2 the only clean way of accomplishing our requirement?
>>>>>
>>>>> Thanks,
>>>>> Micky
>>>>>
>>>>
>>>
>>>
>>> --
>>> Steve Robenalt
>>> Software Architect
>>> srobenalt@highwire.org <bzavon@highwire.org>
>>> (office/cell): 916-505-1785
>>>
>>> HighWire Press, Inc.
>>> 425 Broadway St, Redwood City, CA 94063
>>> www.highwire.org
>>>
>>> Technology for Scholarly Communication
>>>
>>
>

--001a11c301a63d9690052fc3c192
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Jim, that&#39;s not what he asked. He asked for the equivalent of a load ba=
lancer with sticky sessions. <br><br> <br><div class=3D"gmail_quote"><div d=
ir=3D"ltr">On Tue, Apr 5, 2016 at 2:24 PM Jim Ancona &lt;<a href=3D"mailto:=
jim@anconafamily.com">jim@anconafamily.com</a>&gt; wrote:<br></div><blockqu=
ote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc s=
olid;padding-left:1ex"><div dir=3D"ltr">Jon and Steve:<div><br></div><div>I=
 don&#39;t understand your point. The TokenAwareLoadBalancer identifies the=
 nodes in the cluster that own the data for a particular token and route re=
quests to one of them. As I understand it, the OP wants to send requests fo=
r a particular token to the same node every time (assuming it&#39;s availab=
le). How does that fail in a large cluster?=C2=A0</div></div><div dir=3D"lt=
r"><div><div><br></div><div>Jim</div></div></div><div class=3D"gmail_extra"=
><br><div class=3D"gmail_quote">On Tue, Apr 5, 2016 at 4:31 PM, Jonathan Ha=
ddad <span dir=3D"ltr">&lt;<a href=3D"mailto:jon@jonhaddad.com" target=3D"_=
blank">jon@jonhaddad.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmai=
l_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left=
:1ex"><div dir=3D"ltr">Yep - Steve hit the nail on the head.=C2=A0 The odds=
 of hitting the right server with &quot;sticky routing&quot; goes down as y=
our cluster size increases.=C2=A0 You end up adding extra network hops inst=
ead of using token aware routing.<div><br></div><div>Unless you&#39;re tryi=
ng to do a coordinator tier (and you&#39;re not, according to your original=
 post), this is a pretty bad idea and I&#39;d advise you to push back on th=
at requirement.</div></div><div><div><br><div class=3D"gmail_quote"><div di=
r=3D"ltr">On Tue, Apr 5, 2016 at 12:47 PM Steve Robenalt &lt;<a href=3D"mai=
lto:srobenalt@highwire.org" target=3D"_blank">srobenalt@highwire.org</a>&gt=
; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .=
8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Aside fro=
m Jon&#39;s &quot;why&quot; question, I would point out that this only real=
ly works because you are running a 3 node cluster with RF=3D3. If your clus=
ter is going to grow, you can&#39;t guarantee that any one server would hav=
e all records. I&#39;d be pretty hesitant to put an invisible constraint li=
ke that on a cluster unless you&#39;re pretty sure it&#39;ll only ever be 3=
 nodes.=C2=A0</div><div class=3D"gmail_extra"></div><div class=3D"gmail_ext=
ra"><br><div class=3D"gmail_quote">On Tue, Apr 5, 2016 at 9:34 AM, Jonathan=
 Haddad <span dir=3D"ltr">&lt;<a href=3D"mailto:jon@jonhaddad.com" target=
=3D"_blank">jon@jonhaddad.com</a>&gt;</span> wrote:<br><blockquote class=3D=
"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding=
-left:1ex"><div dir=3D"ltr">Why is this a requirement?=C2=A0 Honestly I don=
&#39;t know why you would do this.<div><div><br><br><div class=3D"gmail_quo=
te"><div dir=3D"ltr">On Sat, Apr 2, 2016 at 8:06 PM Mukil Kesavan &lt;<a hr=
ef=3D"mailto:weirdbluelights@gmail.com" target=3D"_blank">weirdbluelights@g=
mail.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D=
"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D=
"ltr">Hello,<div><br></div><div>We currently have 3 Cassandra servers runni=
ng in a single datacenter with a replication factor of 3 for our keyspace. =
We also use the SimpleSnitch wiith DynamicSnitching enabled by default. Our=
 load balancing policy is TokenAwareLoadBalancingPolicy with RoundRobinPoli=
cy as the child. This overall configuration results in our client requests =
spreading equally across our 3 servers.</div><div><br></div><div>However, w=
e have a new requirement where we need to restrict a client&#39;s requests =
to a single server and only go to the other servers on failure of the previ=
ous server. This particular use case does not have high request traffic.</d=
iv><div><br></div><div>Looking at the documentation the options we have see=
m to be:</div><div><br></div><div>1. Play with the snitching (e.g. place ea=
ch server into its own DC or Rack) to ensure that requests always go to one=
 server and failover to the others if required. I understand that this may =
also affect replica placement and we may need to run nodetool repair. So th=
is is not our most preferred option.</div><div><br></div><div>2. Write a ne=
w load balancing policy that also uses the HostStateListener for tracking h=
ost up and down messages, that essentially accomplishes &quot;sticky&quot; =
request routing with failover to other nodes.</div><div><br></div><div>Is o=
ption 2 the only clean way of accomplishing our requirement?</div><div><br>=
</div><div>Thanks,</div><div>Micky</div></div>
</blockquote></div></div></div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div><div class=3D=
"gmail_extra">-- <br><div><div dir=3D"ltr"><div><div dir=3D"ltr"><div><div =
dir=3D"ltr"><div><div><font face=3D"verdana, sans-serif" size=3D"2">Steve R=
obenalt=C2=A0</font></div><div><font face=3D"verdana, sans-serif" size=3D"2=
">Software Architect</font></div><div><font face=3D"verdana, sans-serif" si=
ze=3D"2"><a href=3D"mailto:bzavon@highwire.org" style=3D"color:rgb(17,85,20=
4)" target=3D"_blank">srobenalt@highwire.org</a>=C2=A0</font></div><div><fo=
nt face=3D"verdana, sans-serif" size=3D"2">(office/cell): <a href=3D"tel:91=
6-505-1785" value=3D"+19165051785" target=3D"_blank">916-505-1785</a></font=
></div><div><font face=3D"verdana, sans-serif" size=3D"2"><br></font></div>=
<div><font face=3D"verdana, sans-serif" size=3D"2">HighWire Press, Inc.</fo=
nt></div><div><font face=3D"verdana, sans-serif" size=3D"2">425 Broadway St=
, Redwood City, CA 94063</font></div><div><font face=3D"verdana, sans-serif=
" size=3D"2"><a href=3D"http://www.highwire.org/" style=3D"color:rgb(17,85,=
204)" target=3D"_blank">www.highwire.org</a></font></div></div><div><br></d=
iv><span style=3D"font-family:verdana,sans-serif;font-size:12.8000001907349=
px">Technology for Scholarly Communication</span><br></div></div></div></di=
v></div></div>
</div></blockquote></div>
</div></div></blockquote></div><br></div>
</blockquote></div>

--001a11c301a63d9690052fc3c192--