Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_A32C39D5-D83F-4C02-A106-15DECF133074"
Message-Id: <7475747E-253A-4A04-B3E4-A163DC4425F8@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\))
Subject: Re: hinted handoff disabling trade-offs
Date: Tue, 19 Mar 2013 20:14:43 +1300
References: <cwokqftyjnjaeicogud7s9ry.1362460621015@email.android.com>
 <9B6E672A-20AB-4F02-AA7D-103A563AD1A8@barracuda.com>
 <AC3F2F06-CCF0-4F8F-BA5B-FBEB8367B717@thelastpickle.com>
 <CAGGEs2xLSq05c1Aa8ns9-M5n=E1aPCsr0-yLLV72z5Ep1AdTRw@mail.gmail.com>
To: user@cassandra.apache.org
In-Reply-To: 
 <CAGGEs2xLSq05c1Aa8ns9-M5n=E1aPCsr0-yLLV72z5Ep1AdTRw@mail.gmail.com>


--Apple-Mail=_A32C39D5-D83F-4C02-A106-15DECF133074
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

>  I think I understand what it means for
> application-level data, but the part I'm not entirely sure about is
> what it could mean for Cassandra internals.
Internally it means the write will not be retries to nodes that were =
either down or did not ack before rpc_timeout. That's all.=20

If you are doing thing with read_repair_chance =3D=3D 0 and CL ONE you =
are in a very eventually consistent world. The only thing that will =
guarantee consistency for you now is running nodetool repair.=20

=20
>=20
>=20
> My cluster is under heavy write load. I'm considering disabling Hinted
> Handoffs so the nodes recover quicker in case compactions begin to
> back up.
If the node cluster is approaching capacity, then ultimately the thing =
to do is add more nodes. The only things to do are disable the commit =
log and use a lower CL. =20

If it's approaching capacity you will start to see pending mutations =
back up, maybe some dropped mutations and the maybe an increase in the =
difference between the latency reported in the proxyhistograms and the =
cfhistograms or cfstats.=20

Cheers

-----------------
Aaron Morton
Freelance Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 16/03/2013, at 4:50 PM, Matt Kap <matvey1414@gmail.com> wrote:

> Thanks Aaron.
>=20
> I am using CL=3DONE. read_repair_chance=3D0. The part which I'm =
wondering
> about is what happens to the internal Cassandra writes if Hinted
> Handoffs are disabled. I think I understand what it means for
> application-level data, but the part I'm not entirely sure about is
> what it could mean for Cassandra internals.
>=20
> My cluster is under heavy write load. I'm considering disabling Hinted
> Handoffs so the nodes recover quicker in case compactions begin to
> back up.
>=20
> On Wed, Mar 6, 2013 at 2:06 AM, aaron morton <aaron@thelastpickle.com> =
wrote:
>> The advantage of HH is that it reduces the probability of a =
DigestMismatch
>> when using a CL > ONE. A DigestMismatch means the read has to run a =
second
>> time before returning to the client.
>>=20
>>> - No risk of hinted-handoffs building up
>>> - No risk of hinted-handoffs flooding a node that just came up
>>=20
>> See the yaml config settings for the max hint window and the =
throttling.
>>=20
>>> Can anyone suggest any other factors that I'm missing here. =
Specifically
>>> reasons
>>> not to do this.
>>=20
>> If you are doing this for performance first make sure your data model =
is
>> efficient, that you are doing the most efficient reads (see my =
presentation
>> here =
http://www.datastax.com/events/cassandrasummit2012/presentations), and
>> your caching is bang on. Then consider if you can tune the CL, and if =
your
>> client is token aware so it directs traffic to a node that has it.
>>=20
>> Cheers
>>=20
>> -----------------
>> Aaron Morton
>> Freelance Cassandra Developer
>> New Zealand
>>=20
>> @aaronmorton
>> http://www.thelastpickle.com
>>=20
>> On 4/03/2013, at 9:19 PM, Michael Kjellman <mkjellman@barracuda.com> =
wrote:
>>=20
>> Also, if you have enough hints being created that its significantly
>> impacting your heap I have a feeling things are going to get out of =
sync
>> very quickly.
>>=20
>> On Mar 4, 2013, at 9:17 PM, "Wz1975" <wz1975@YAHOO.COM> wrote:
>>=20
>> Why do you think disabling hinted handoff will improve memory usage?
>>=20
>>=20
>> Thanks.
>> -Wei
>>=20
>> Sent from my Samsung smartphone on AT&T
>>=20
>>=20
>> -------- Original message --------
>> Subject: Re: hinted handoff disabling trade-offs
>> From: Michael Kjellman <mkjellman@barracuda.com>
>> To: "user@cassandra.apache.org" <user@cassandra.apache.org>
>> CC:
>>=20
>>=20
>> Repair is slow.
>>=20
>> On Mar 4, 2013, at 8:07 PM, "Matt Kap" <matvey1414@gmail.com> wrote:
>>=20
>>> I am looking to get a second opinion about disabling =
hinted-handoffs. I
>>> have an application that can tolerate a fair amount of inconsistency
>>> (advertising domain), and so I'm weighting the pros and cons of =
hinted
>>> handoffs. I'm running Cassandra 1.0, looking to upgrade to 1.1 soon.
>>>=20
>>> Pros of disabling hinted handoffs:
>>> - Reduces heap
>>> - Improves GC performance
>>> - No risk of hinted-handoffs building up
>>> - No risk of hinted-handoffs flooding a node that just came up
>>>=20
>>> Cons
>>> - Some writes can be lost, at least until repair runs
>>>=20
>>> Can anyone suggest any other factors that I'm missing here. =
Specifically
>>> reasons
>>> not to do this.
>>>=20
>>> Cheers!
>>> -Matt
>>=20
>> Copy, by Barracuda, helps you store, protect, and share all your =
amazing
>> things. Start today: www.copy.com.
>>=20
>>=20
>> ----------------------------------
>> Copy, by Barracuda, helps you store, protect, and share all your =
amazing
>> things. Start today: www.copy.com.
>>  =AD=AD
>>=20
>>=20
>=20
>=20
>=20
> --=20
> www.calcmachine.com - easy online calculator.


--Apple-Mail=_A32C39D5-D83F-4C02-A106-15DECF133074
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Diso-8859-1"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><blockquote type=3D"cite">&nbsp;I think I understand what it means =
for<br>application-level data, but the part I'm not entirely sure about =
is<br>what it could mean for Cassandra internals.</blockquote>Internally =
it means the write will not be retries to nodes that were either down or =
did not ack before rpc_timeout. That's all.&nbsp;<div><br></div><div>If =
you are doing thing with read_repair_chance =3D=3D 0 and CL ONE you are =
in a very eventually consistent world. The only thing that will =
guarantee consistency for you now is running nodetool =
repair.&nbsp;</div><div><br></div><div>&nbsp;<blockquote =
type=3D"cite"><br>My cluster is under heavy write load. I'm considering =
disabling Hinted<br>Handoffs so the nodes recover quicker in case =
compactions begin to<br>back up.</blockquote>If the node cluster is =
approaching capacity, then ultimately the thing to do is add more nodes. =
The only things to do are disable the commit log and use a lower CL. =
&nbsp;</div><div><br></div><div>If it's approaching capacity you will =
start to see pending mutations back up, maybe some dropped mutations and =
the maybe an increase in the difference between the latency reported in =
the proxyhistograms and the cfhistograms or =
cfstats.&nbsp;</div><div><br></div><div>Cheers</div><div><br><div =
apple-content-edited=3D"true">
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: =
medium; font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div =
style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; =
font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
border-spacing: 0px; -webkit-text-decorations-in-effect: none; =
-webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px; =
font-size: medium; "><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Consultant</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></span></div></span></div></span></div></span></div></div>
</div>

<br><div><div>On 16/03/2013, at 4:50 PM, Matt Kap &lt;<a =
href=3D"mailto:matvey1414@gmail.com">matvey1414@gmail.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite">Thanks Aaron.<br><br>I am using CL=3DONE. =
read_repair_chance=3D0. The part which I'm wondering<br>about is what =
happens to the internal Cassandra writes if Hinted<br>Handoffs are =
disabled. I think I understand what it means for<br>application-level =
data, but the part I'm not entirely sure about is<br>what it could mean =
for Cassandra internals.<br><br>My cluster is under heavy write load. =
I'm considering disabling Hinted<br>Handoffs so the nodes recover =
quicker in case compactions begin to<br>back up.<br><br>On Wed, Mar 6, =
2013 at 2:06 AM, aaron morton &lt;<a =
href=3D"mailto:aaron@thelastpickle.com">aaron@thelastpickle.com</a>&gt; =
wrote:<br><blockquote type=3D"cite">The advantage of HH is that it =
reduces the probability of a DigestMismatch<br>when using a CL &gt; ONE. =
A DigestMismatch means the read has to run a second<br>time before =
returning to the client.<br><br><blockquote type=3D"cite">- No risk of =
hinted-handoffs building up<br>- No risk of hinted-handoffs flooding a =
node that just came up<br></blockquote><br>See the yaml config settings =
for the max hint window and the throttling.<br><br><blockquote =
type=3D"cite">Can anyone suggest any other factors that I'm missing =
here. Specifically<br>reasons<br>not to do this.<br></blockquote><br>If =
you are doing this for performance first make sure your data model =
is<br>efficient, that you are doing the most efficient reads (see my =
presentation<br>here <a =
href=3D"http://www.datastax.com/events/cassandrasummit2012/presentations">=
http://www.datastax.com/events/cassandrasummit2012/presentations</a>), =
and<br>your caching is bang on. Then consider if you can tune the CL, =
and if your<br>client is token aware so it directs traffic to a node =
that has it.<br><br>Cheers<br><br>-----------------<br>Aaron =
Morton<br>Freelance Cassandra Developer<br>New =
Zealand<br><br>@aaronmorton<br><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a><br>=
<br>On 4/03/2013, at 9:19 PM, Michael Kjellman =
&lt;mkjellman@barracuda.com&gt; wrote:<br><br>Also, if you have enough =
hints being created that its significantly<br>impacting your heap I have =
a feeling things are going to get out of sync<br>very quickly.<br><br>On =
Mar 4, 2013, at 9:17 PM, "Wz1975" &lt;wz1975@YAHOO.COM&gt; =
wrote:<br><br>Why do you think disabling hinted handoff will improve =
memory usage?<br><br><br>Thanks.<br>-Wei<br><br>Sent from my Samsung =
smartphone on AT&amp;T<br><br><br>-------- Original message =
--------<br>Subject: Re: hinted handoff disabling trade-offs<br>From: =
Michael Kjellman &lt;mkjellman@barracuda.com&gt;<br>To: =
"user@cassandra.apache.org" =
&lt;user@cassandra.apache.org&gt;<br>CC:<br><br><br>Repair is =
slow.<br><br>On Mar 4, 2013, at 8:07 PM, "Matt Kap" =
&lt;matvey1414@gmail.com&gt; wrote:<br><br><blockquote type=3D"cite">I =
am looking to get a second opinion about disabling hinted-handoffs. =
I<br>have an application that can tolerate a fair amount of =
inconsistency<br>(advertising domain), and so I'm weighting the pros and =
cons of hinted<br>handoffs. I'm running Cassandra 1.0, looking to =
upgrade to 1.1 soon.<br><br>Pros of disabling hinted handoffs:<br>- =
Reduces heap<br>- Improves GC performance<br>- No risk of =
hinted-handoffs building up<br>- No risk of hinted-handoffs flooding a =
node that just came up<br><br>Cons<br>- Some writes can be lost, at =
least until repair runs<br><br>Can anyone suggest any other factors that =
I'm missing here. Specifically<br>reasons<br>not to do =
this.<br><br>Cheers!<br>-Matt<br></blockquote><br>Copy, by Barracuda, =
helps you store, protect, and share all your amazing<br>things. Start =
today: =
www.copy.com.<br><br><br>----------------------------------<br>Copy, by =
Barracuda, helps you store, protect, and share all your =
amazing<br>things. Start today: www.copy.com.<br> =
&nbsp;=AD=AD<br><br><br></blockquote><br><br><br>-- <br><a =
href=3D"http://www.calcmachine.com">www.calcmachine.com</a> - easy =
online calculator.<br></blockquote></div><br></div></body></html>=

--Apple-Mail=_A32C39D5-D83F-4C02-A106-15DECF133074--