Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of
 SRS0=l14YJq=4T=basetechnology.com=jack@yourhostingaccount.com designates
 65.254.253.36 as permitted sender)
Message-ID: <12FEED3390544C37AA836CD3F82F6A4C@JackKrupansky14>
From: "Jack Krupansky" <jack@basetechnology.com>
To: <user@cassandra.apache.org>
References: 
 <CABNXB2CtuLF0FM=6tGbECdOrVRCanPNQpDB7g-M=5GO-8-MSnA@mail.gmail.com><CFF6D4A8.33759%kwright@nanigans.com>
 <CABNXB2C0QvX3YoimmgL-OVS0sspOPTfjijweOVDf5+v_Dd3AcQ@mail.gmail.com>
In-Reply-To: 
 <CABNXB2C0QvX3YoimmgL-OVS0sspOPTfjijweOVDf5+v_Dd3AcQ@mail.gmail.com>
Subject: Re: Hot, large row
Date: Thu, 24 Jul 2014 16:07:41 -0400
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_NextPart_000_3A88_01CFA759.65CC27C0"
Importance: Normal
Sender: "Jack Krupansky" <jack@basetechnology.com>

This is a multi-part message in MIME format.

------=_NextPart_000_3A88_01CFA759.65CC27C0
Content-Type: text/plain;
	charset="utf-8"
Content-Transfer-Encoding: quoted-printable

Could it be some =E2=80=9Cfat columns=E2=80=9D (cells with large blob or =
text values) rather than the cell-count per se? IOW, a =E2=80=9Cbig =
row=E2=80=9D rather than a =E2=80=9Cwide row=E2=80=9D?

And, could it be a large partition rather than a large row (many rows in =
a single partition)? Are clustering columns being used in the primary =
key?

-- Jack Krupansky

From: DuyHai Doan=20
Sent: Thursday, July 24, 2014 3:53 PM
To: user@cassandra.apache.org=20
Subject: Re: Hot, large row

Your extract of cfhistograms show that there are no particular "wide =
rows". The widest has 61214 cells which is big but not that huge to be =
really a concern.


Turning on trace probabilty only tells give you some "hints" about what =
kind of queries are done, it does not give the exact partition key nor =
other statement values, especially when you are using prepared =
statements ...


"I am considering reducing read_request_timeout_in_ms: 5000 in =
cassandra.yaml so that it reduces the impact when this occurs." --> =
Don't do that, you'll only sweep dust under the carpet. Find the real =
issue and fix it instead of changing parameter to hide it.


One solution would be on client side, to activate some logging to show =
the CQL3 statements the application is issuing that may overload the =
server.  I know that's better said than done but I don't have any other =
idea for the moment


-------- Shameless self-promotion ------


To support this kind of live prod debugging & investigation that I add a =
new dynamic query logging feature in Achilles: =
https://github.com/doanduyhai/Achilles/wiki/Statements-Logging-and-Tracin=
g#dynamic-statements-logging


Once you hit the issue, this kind of feature may save your day...


On Thu, Jul 24, 2014 at 9:22 PM, Keith Wright <kwright@nanigans.com> =
wrote:

  I can see from cfhistograms that I do have some wide rows (see below). =
 I set trace probability as you suggested but the output doesn=E2=80=99t =
appear to tell me what row was actually read unless I missed something.  =
I just see executing prepared statement.   Any ideas how I can find the =
row in question?

  I am considering reducing read_request_timeout_in_ms: 5000 in =
cassandra.yaml so that it reduces the impact when this occurs.

  Any help in identifying my issue would be GREATLY appreciated

  Cell Count per Partition

      1 cells: 50449950

      2 cells: 14281828

      3 cells: 8093366

      4 cells: 5029200

      5 cells: 3103023

      6 cells: 3059903

      7 cells: 1903018

      8 cells: 1509297

     10 cells: 2420359

     12 cells: 1624895

     14 cells: 1171678

     17 cells: 1289391

     20 cells: 909777

     24 cells: 852081

     29 cells: 722925

     35 cells: 587067

     42 cells: 459473

     50 cells: 358744

     60 cells: 304146

     72 cells: 244682

     86 cells: 191045

    103 cells: 155337

    124 cells: 127061

    149 cells: 98913

    179 cells: 77454

    215 cells: 59849

    258 cells: 46117

    310 cells: 35321

    372 cells: 26319

    446 cells: 19379

    535 cells: 13783

    642 cells: 9993

    770 cells: 6973

    924 cells: 4713

  1109 cells: 3229

  1331 cells: 2062

  1597 cells: 1338

  1916 cells: 773

  2299 cells: 495

  2759 cells: 268

  3311 cells: 150

  3973 cells: 100

  4768 cells: 42

  5722 cells: 24

  6866 cells: 12

  8239 cells: 9

  9887 cells: 3

  11864 cells: 0

  14237 cells: 5

  17084 cells: 1

  20501 cells: 0

  24601 cells: 2

  29521 cells: 0

  35425 cells: 0

  42510 cells: 0

  51012 cells: 0

  61214 cells: 2


  From: DuyHai Doan <doanduyhai@gmail.com>
  Reply-To: "user@cassandra.apache.org" <user@cassandra.apache.org>
  Date: Thursday, July 24, 2014 at 3:01 PM
  To: "user@cassandra.apache.org" <user@cassandra.apache.org>
  Subject: Re: Hot, large row


  "How can I detect wide rows?" -->


  nodetool cfhistograms <keyspace> <suspected column family>


  Look at column "Column count" (last column) and identify a line in =
this column with very high value of "Offset". In a well designed =
application you should have a gaussian distribution where 80% of your =
row have a similar number of columns.


  "Anyone know what debug level I can set so that I can see what reads =
the hot node is handling?  " -->


  "nodetool settraceprobability <value>",  where value is a small number =
(0.001) on the node where you encounter the issue. Activate the tracing =
for a while (5 mins) and deactivate it (value =3D 0). Then look into =
system_traces tables "events" & "sessions". It may help or not since the =
tracing is done once every 1000.=20


  "Any way to get the server to blacklist these wide rows =
automatically?" --> No


  On Thu, Jul 24, 2014 at 8:48 PM, Keith Wright <kwright@nanigans.com> =
wrote:

    Hi all,

       We are seeing an issue where basically daily one of our nodes =
spikes in load and is churning in CMS heap pressure.  It appears that =
reads are backing up and my guess is that our application is reading a =
large row repeatedly.  Our write structure can lead itself to wide rows =
very infrequently (<0.001%) and we do our best to detect and delete them =
but obviously we=E2=80=99re missing a case.  Hoping for assistance on =
the following questions:
      a.. How can I detect wide rows?=20
      b.. Anyone know what debug level I can set so that I can see what =
reads the hot node is handling?  I=E2=80=99m hoping to see the =
=E2=80=9Cbad=E2=80=9D row=20
      c.. Any way to get the server to blacklist these wide rows =
automatically?=20
    We=E2=80=99re using C* 2.0.6 with Vnodes.

    Thanks


------=_NextPart_000_3A88_01CFA759.65CC27C0
Content-Type: text/html;
	charset="utf-8"
Content-Transfer-Encoding: quoted-printable

<HTML><HEAD></HEAD>
<BODY dir=3Dltr>
<DIV dir=3Dltr>
<DIV style=3D"FONT-SIZE: 12pt; FONT-FAMILY: 'Calibri'; COLOR: #000000">
<DIV>Could it be some =E2=80=9Cfat columns=E2=80=9D (cells with large =
blob or text values)=20
rather than the cell-count per se? IOW, a =E2=80=9Cbig row=E2=80=9D =
rather than a =E2=80=9Cwide=20
row=E2=80=9D?</DIV>
<DIV>&nbsp;</DIV>
<DIV>And, could it be a large partition rather than a large row (many =
rows in a=20
single partition)? Are clustering columns being used in the primary =
key?</DIV>
<DIV>&nbsp;</DIV>
<DIV style=3D"FONT-SIZE: 12pt; FONT-FAMILY: 'Calibri'; COLOR: =
#000000">-- Jack=20
Krupansky</DIV>
<DIV=20
style=3D'FONT-SIZE: small; TEXT-DECORATION: none; FONT-FAMILY: =
"Calibri"; FONT-WEIGHT: normal; COLOR: #000000; FONT-STYLE: normal; =
DISPLAY: inline'>
<DIV style=3D"FONT: 10pt tahoma">
<DIV>&nbsp;</DIV>
<DIV style=3D"BACKGROUND: #f5f5f5">
<DIV style=3D"font-color: black"><B>From:</B> <A =
title=3Ddoanduyhai@gmail.com=20
href=3D"mailto:doanduyhai@gmail.com">DuyHai Doan</A> </DIV>
<DIV><B>Sent:</B> Thursday, July 24, 2014 3:53 PM</DIV>
<DIV><B>To:</B> <A title=3Duser@cassandra.apache.org=20
href=3D"mailto:user@cassandra.apache.org">user@cassandra.apache.org</A> =
</DIV>
<DIV><B>Subject:</B> Re: Hot, large row</DIV></DIV></DIV>
<DIV>&nbsp;</DIV></DIV>
<DIV=20
style=3D'FONT-SIZE: small; TEXT-DECORATION: none; FONT-FAMILY: =
"Calibri"; FONT-WEIGHT: normal; COLOR: #000000; FONT-STYLE: normal; =
DISPLAY: inline'>
<DIV dir=3Dltr>
<DIV>
<DIV>
<DIV>Your extract of cfhistograms show that there are no particular =
"wide rows".=20
The widest has 61214 cells which is big but not that huge to be really a =

concern.<BR><BR></DIV>Turning on trace probabilty only tells give you =
some=20
"hints" about what kind of queries are done, it does not give the exact=20
partition key nor other statement values, especially when you are using =
prepared=20
statements ...<BR><BR><BR>"I am considering reducing =
read_request_timeout_in_ms:=20
5000 in cassandra.yaml so that it reduces the impact when this occurs." =
--&gt;=20
Don't do that, you'll only sweep dust under the carpet. Find the real =
issue and=20
fix it instead of changing parameter to hide it.<BR><BR></DIV>One =
solution would=20
be on client side, to activate some logging to show the CQL3 statements =
the=20
application is issuing that may overload the server.&nbsp; I know that's =
better=20
said than done but I don't have any other idea for the =
moment<BR><BR></DIV>
<DIV>-------- Shameless self-promotion ------<BR><BR></DIV>
<DIV>To support this kind of live prod debugging &amp; investigation =
that I add=20
a new dynamic query logging feature in Achilles: <A=20
href=3D"https://github.com/doanduyhai/Achilles/wiki/Statements-Logging-an=
d-Tracing#dynamic-statements-logging">https://github.com/doanduyhai/Achil=
les/wiki/Statements-Logging-and-Tracing#dynamic-statements-logging</A><BR=
><BR></DIV>
<DIV>Once you hit the issue, this kind of feature may save your =
day...<BR></DIV>
<DIV>&nbsp;</DIV>
<DIV>&nbsp;</DIV>
<DIV>
<DIV>
<DIV><BR><BR></DIV></DIV></DIV></DIV>
<DIV class=3Dgmail_extra><BR><BR>
<DIV class=3Dgmail_quote>On Thu, Jul 24, 2014 at 9:22 PM, Keith Wright =
<SPAN=20
dir=3Dltr>&lt;<A href=3D"mailto:kwright@nanigans.com"=20
target=3D_blank>kwright@nanigans.com</A>&gt;</SPAN> wrote:<BR>
<BLOCKQUOTE class=3Dgmail_quote=20
style=3D"PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; BORDER-LEFT: #ccc =
1px solid">
  <DIV=20
  style=3D"WORD-WRAP: break-word; FONT-SIZE: 14px; FONT-FAMILY: =
calibri,sans-serif; COLOR: rgb(0,0,0)">
  <DIV>I can see from cfhistograms that I do have some wide rows (see=20
  below).&nbsp; I set trace probability as you suggested but the output =
doesn=E2=80=99t=20
  appear to tell me what row was actually read unless I missed =
something.&nbsp;=20
  I just see executing prepared statement.&nbsp;&nbsp; Any ideas how I =
can find=20
  the row in question?</DIV>
  <DIV>&nbsp;</DIV>
  <DIV>I am considering reducing read_request_timeout_in_ms: 5000 in=20
  cassandra.yaml so that it reduces the impact when this occurs.</DIV>
  <DIV>&nbsp;</DIV>
  <DIV>Any help in identifying my issue would be GREATLY =
appreciated</DIV>
  <DIV>&nbsp;</DIV>
  <DIV>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">Cell =
Count per=20
  Partition</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp;&nbsp;=20
  1 cells: 50449950</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp;&nbsp;=20
  2 cells: 14281828</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp;&nbsp;=20
  3 cells: 8093366</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp;&nbsp;=20
  4 cells: 5029200</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp;&nbsp;=20
  5 cells: <A href=3D"tel:3103023" target=3D_blank =
value=3D"+13103023">3103023</A></P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp;&nbsp;=20
  6 cells: 3059903</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp;&nbsp;=20
  7 cells: 1903018</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp;&nbsp;=20
  8 cells: 1509297</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 10=20
  cells: 2420359</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 12=20
  cells: 1624895</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 14=20
  cells: 1171678</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 17=20
  cells: 1289391</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 20=20
  cells: 909777</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 24=20
  cells: 852081</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 29=20
  cells: 722925</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 35=20
  cells: 587067</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 42=20
  cells: 459473</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 50=20
  cells: 358744</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 60=20
  cells: 304146</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 72=20
  cells: 244682</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: =
0px">&nbsp;&nbsp; 86=20
  cells: 191045</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
103 cells:=20
  155337</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
124 cells:=20
  127061</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
149 cells:=20
  98913</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
179 cells:=20
  77454</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
215 cells:=20
  59849</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
258 cells:=20
  46117</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
310 cells:=20
  35321</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
372 cells:=20
  26319</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
446 cells:=20
  19379</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
535 cells:=20
  13783</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
642 cells:=20
  9993</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
770 cells:=20
  6973</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">&nbsp; =
924 cells:=20
  4713</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">1109 =
cells:=20
  3229</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">1331 =
cells:=20
  2062</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">1597 =
cells:=20
  1338</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">1916 =
cells:=20
773</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">2299 =
cells:=20
495</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">2759 =
cells:=20
268</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">3311 =
cells:=20
150</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">3973 =
cells:=20
100</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">4768 =
cells: 42</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">5722 =
cells: 24</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">6866 =
cells: 12</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">8239 =
cells: 9</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">9887 =
cells: 3</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">11864 =
cells: 0</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">14237 =
cells: 5</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">17084 =
cells: 1</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">20501 =
cells: 0</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">24601 =
cells: 2</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">29521 =
cells: 0</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">35425 =
cells: 0</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">42510 =
cells: 0</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">51012 =
cells: 0</P>
  <P style=3D"FONT-SIZE: 11px; FONT-FAMILY: menlo; MARGIN: 0px">61214 =
cells:=20
  2</P></DIV>
  <DIV>&nbsp;</DIV><SPAN>
  <DIV=20
  style=3D"FONT-SIZE: 11pt; BORDER-TOP: #b5c4df 1pt solid; FONT-FAMILY: =
calibri; BORDER-RIGHT: medium none; BORDER-BOTTOM: medium none; COLOR: =
black; PADDING-BOTTOM: 0in; TEXT-ALIGN: left; PADDING-TOP: 3pt; =
PADDING-LEFT: 0in; BORDER-LEFT: medium none; PADDING-RIGHT: 0in"><SPAN=20
  style=3D"FONT-WEIGHT: bold">From: </SPAN>DuyHai Doan &lt;<A=20
  href=3D"mailto:doanduyhai@gmail.com"=20
  target=3D_blank>doanduyhai@gmail.com</A>&gt;<BR><SPAN=20
  style=3D"FONT-WEIGHT: bold">Reply-To: </SPAN>"<A=20
  href=3D"mailto:user@cassandra.apache.org"=20
  target=3D_blank>user@cassandra.apache.org</A>" &lt;<A=20
  href=3D"mailto:user@cassandra.apache.org"=20
  target=3D_blank>user@cassandra.apache.org</A>&gt;<BR><SPAN=20
  style=3D"FONT-WEIGHT: bold">Date: </SPAN>Thursday, July 24, 2014 at =
3:01=20
  PM<BR><SPAN style=3D"FONT-WEIGHT: bold">To: </SPAN>"<A=20
  href=3D"mailto:user@cassandra.apache.org"=20
  target=3D_blank>user@cassandra.apache.org</A>" &lt;<A=20
  href=3D"mailto:user@cassandra.apache.org"=20
  target=3D_blank>user@cassandra.apache.org</A>&gt;<BR><SPAN=20
  style=3D"FONT-WEIGHT: bold">Subject: </SPAN>Re: Hot, large =
row<BR></DIV>
  <DIV>
  <DIV class=3Dh5>
  <DIV>&nbsp;</DIV>
  <DIV dir=3Dltr>
  <DIV>
  <DIV>"How can I detect wide rows?" --&gt;<BR><BR></DIV>nodetool =
cfhistograms=20
  &lt;keyspace&gt; &lt;suspected column family&gt;<BR><BR></DIV>Look at =
column=20
  "Column count" (last column) and identify a line in this column with =
very high=20
  value of "Offset". In a well designed application you should have a =
gaussian=20
  distribution where 80% of your row have a similar number of =
columns.<BR>
  <DIV><BR>"Anyone know what debug level I can set so that I can see =
what reads=20
  the hot node is handling?&nbsp; " --&gt;<BR><BR></DIV>
  <DIV>"nodetool settraceprobability &lt;value&gt;",&nbsp; where value =
is a=20
  small number (0.001) on the node where you encounter the issue. =
Activate the=20
  tracing for a while (5 mins) and deactivate it (value =3D 0). Then =
look into=20
  system_traces tables "events" &amp; "sessions". It may help or not =
since the=20
  tracing is done once every 1000. <BR></DIV>
  <DIV><BR>"Any way to get the server to blacklist these wide rows=20
  automatically?" --&gt; No<BR></DIV></DIV>
  <DIV class=3Dgmail_extra><BR><BR>
  <DIV class=3Dgmail_quote>On Thu, Jul 24, 2014 at 8:48 PM, Keith Wright =
<SPAN=20
  dir=3Dltr>&lt;<A href=3D"mailto:kwright@nanigans.com"=20
  target=3D_blank>kwright@nanigans.com</A>&gt;</SPAN> wrote:<BR>
  <BLOCKQUOTE class=3Dgmail_quote=20
  style=3D"PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; BORDER-LEFT: =
#ccc 1px solid">
    <DIV=20
    style=3D"WORD-WRAP: break-word; FONT-SIZE: 14px; FONT-FAMILY: =
calibri,sans-serif; COLOR: rgb(0,0,0)">
    <DIV>Hi all,</DIV>
    <DIV>&nbsp;</DIV>
    <DIV>&nbsp;&nbsp; We are seeing an issue where basically daily one =
of our=20
    nodes spikes in load and is churning in CMS heap pressure.&nbsp; It =
appears=20
    that reads are backing up and my guess is that our application is =
reading a=20
    large row repeatedly.&nbsp; Our write structure can lead itself to =
wide rows=20
    very infrequently (&lt;0.001%) and we do our best to detect and =
delete them=20
    but obviously we=E2=80=99re missing a case.&nbsp; Hoping for =
assistance on the=20
    following questions:</DIV>
    <UL>
      <LI>How can I detect wide rows?=20
      <LI>Anyone know what debug level I can set so that I can see what =
reads=20
      the hot node is handling?&nbsp; I=E2=80=99m hoping to see the =
=E2=80=9Cbad=E2=80=9D row=20
      <LI>Any way to get the server to blacklist these wide rows =
automatically?=20
      </LI></UL>
    <DIV>We=E2=80=99re using C* 2.0.6 with Vnodes.</DIV>
    <DIV>&nbsp;</DIV>
    <DIV>Thanks</DIV></DIV></BLOCKQUOTE></DIV>
  <DIV>&nbsp;</DIV></DIV></DIV></DIV></SPAN></DIV></BLOCKQUOTE></DIV>
<DIV>&nbsp;</DIV></DIV></DIV></DIV></DIV></BODY></HTML>

------=_NextPart_000_3A88_01CFA759.65CC27C0--