Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of tivv00@gmail.com designates
 209.85.210.172 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <CC7E3917.1330C%mkjellman@barracuda.com>
References: <CC7E0CF0.132B9%mkjellman@barracuda.com>
	<CC7E3917.1330C%mkjellman@barracuda.com>
Date: Thu, 20 Sep 2012 08:35:09 +0300
Message-ID: 
 <CABWW-d2d=7hBgvvJ83YGih5GXS1GnDkFQFpc3tO58boeuWeGMA@mail.gmail.com>
Subject: Re: persistent compaction issue (1.1.4 and 1.1.5)
From: =?KOI8-U?B?96bUwcymyiD0yc3eydvJzg==?= <tivv00@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=14dae93407f788e22e04ca1b7c8b

--14dae93407f788e22e04ca1b7c8b
Content-Type: text/plain; charset=KOI8-U
Content-Transfer-Encoding: quoted-printable

I did see problems with schema agreement on 1.1.4, but they did go away
after rolling restart (BTW: it would be still good to check describe schema
for unreachable). Same rolling restart helped to force compactions after
moving to Leveled compaction. If your compactions still don't go, you can
try removing *.json files from the data directory of the stopped node to
force moving all SSTables to level0.

Best regards, Vitalii Tymchyshyn

2012/9/19 Michael Kjellman <mkjellman@barracuda.com>

> Potentially the pending compactions are a symptom and not the root
> cause/problem.
>
> When updating a 3rd column family with a larger sstable_size_in_mb it
> looks like the schema may not be in a good state
>
> [default@xxxx] UPDATE COLUMN FAMILY screenshots WITH
> compaction_strategy=3DLeveledCompactionStrategy AND
> compaction_strategy_options=3D{sstable_size_in_mb: 200};
> 290cf619-57b0-3ad1-9ae3-e313290de9c9
> Waiting for schema agreement...
> Warning: unreachable nodes 10.8.30.102The schema has not settled in 10
> seconds; further migrations are ill-advised until it does.
> Versions are UNREACHABLE:[10.8.30.102],
> 290cf619-57b0-3ad1-9ae3-e313290de9c9:[10.8.30.15, 10.8.30.14, 10.8.30.13,
> 10.8.30.103, 10.8.30.104, 10.8.30.105, 10.8.30.106],
> f1de54f5-8830-31a6-9cdd-aaa6220cccd1:[10.8.30.101]
>
>
> However, tpstats looks good. And the schema changes eventually do get
> applied on *all* the nodes (even the ones that seem to have different
> schema versions). There are no communications issues between the nodes an=
d
> they are all in the same rack
>
> root@xxxx:~# nodetool tpstats
> Pool Name                    Active   Pending      Completed   Blocked
> All time blocked
> ReadStage                         0         0        1254592         0
>             0
> RequestResponseStage              0         0        9480827         0
>             0
> MutationStage                     0         0        8662263         0
>             0
> ReadRepairStage                   0         0         339158         0
>             0
> ReplicateOnWriteStage             0         0              0         0
>             0
> GossipStage                       0         0        1469197         0
>             0
> AntiEntropyStage                  0         0              0         0
>             0
> MigrationStage                    0         0           1808         0
>             0
> MemtablePostFlusher               0         0            248         0
>             0
> StreamStage                       0         0              0         0
>             0
> FlushWriter                       0         0            248         0
>             4
> MiscStage                         0         0              0         0
>             0
> commitlog_archiver                0         0              0         0
>             0
> InternalResponseStage             0         0           5286         0
>             0
> HintedHandoff                     0         0             21         0
>             0
>
> Message type           Dropped
> RANGE_SLICE                  0
> READ_REPAIR                  0
> BINARY                       0
> READ                         0
> MUTATION                     0
> REQUEST_RESPONSE             0
>
> So I'm guessing maybe the different schema versions may be potentially
> stopping compactions? Will compactions still happen if there are differen=
t
> versions of the schema?
>
>
>
>
>
> On 9/18/12 11:38 AM, "Michael Kjellman" <mkjellman@barracuda.com> wrote:
>
> >Thanks, I just modified the schema on the worse offending column family
> >(as determined by the .json) from 10MB to 200MB.
> >
> >Should I kick off a compaction on this cf now/repair?/scrub?
> >
> >Thanks
> >
> >-michael
> >
> >From: =F7=A6=D4=C1=CC=A6=CA =F4=C9=CD=DE=C9=DB=C9=CE <tivv00@gmail.com<m=
ailto:tivv00@gmail.com>>
> >Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>"
> ><user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
> >To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>"
> ><user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
> >Subject: Re: persistent compaction issue (1.1.4 and 1.1.5)
> >
> >I've started to use LeveledCompaction some time ago and from my
> >experience this indicates some SST on lower levels than they should be.
> >The compaction is going, moving them up level by level, but total count
> >does not change as new data goes in.
> >The numbers are pretty high as for me. Such numbers mean a lot of files
> >(over 100K in single directory) and a lot of thinking for compaction
> >executor to decide what to compact next. I can see numbers like 5K-10K
> >and still thing this is high number. If I were you, I'd increase
> >sstable_size_in_mb 10-20 times it is now.
> >
> >2012/9/17 Michael Kjellman
> ><mkjellman@barracuda.com<mailto:mkjellman@barracuda.com>>
> >Hi All,
> >
> >I have an issue where each one of my nodes (currently all running at
> >1.1.5) is reporting around 30,000 pending compactions. I understand that
> >a pending compaction doesn't necessarily mean it is a scheduled task
> >however I'm confused why this behavior is occurring. It is the same on
> >all nodes, occasionally goes down 5k pending compaction tasks, and then
> >returns to 25,000-35,000 compaction tasks pending.
> >
> >I have tried a repair operation/scrub operation on two of the nodes and
> >while compactions initially happen the number of pending compactions doe=
s
> >not decrease.
> >
> >Any ideas? Thanks for your time.
> >
> >Best,
> >michael
> >
> >
> >'Like' us on Facebook for exclusive content and other resources on all
> >Barracuda Networks solutions.
> >
> >Visit http://barracudanetworks.com/facebook
> >
> >
> >
> >
> >
> >
> >
> >--
> >Best regards,
> > Vitalii Tymchyshyn
> >
> >'Like' us on Facebook for exclusive content and other resources on all
> >Barracuda Networks solutions.
> >
> >Visit http://barracudanetworks.com/facebook
> >
> >
> >
> >
>
>
> 'Like' us on Facebook for exclusive content and other resources on all
> Barracuda Networks solutions.
>
> Visit http://barracudanetworks.com/facebook
>
>
>
>
>


--=20
Best regards,
 Vitalii Tymchyshyn

--14dae93407f788e22e04ca1b7c8b
Content-Type: text/html; charset=KOI8-U
Content-Transfer-Encoding: quoted-printable

I did see problems with schema agreement on 1.1.4, but they did go away aft=
er rolling restart (BTW: it would be still good to check describe schema fo=
r unreachable). Same rolling restart helped to force compactions after movi=
ng to Leveled compaction. If your compactions still don&#39;t go, you can t=
ry removing *.json files from the data directory of the stopped node to for=
ce moving all SSTables to level0.<div>
<br></div><div>Best regards, Vitalii Tymchyshyn<br><br><div class=3D"gmail_=
quote">2012/9/19 Michael Kjellman <span dir=3D"ltr">&lt;<a href=3D"mailto:m=
kjellman@barracuda.com" target=3D"_blank">mkjellman@barracuda.com</a>&gt;</=
span><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Potentially the pending compactions are a sy=
mptom and not the root<br>
cause/problem.<br>
<br>
When updating a 3rd column family with a larger sstable_size_in_mb it<br>
looks like the schema may not be in a good state<br>
<br>
[default@xxxx] UPDATE COLUMN FAMILY screenshots WITH<br>
compaction_strategy=3DLeveledCompactionStrategy AND<br>
compaction_strategy_options=3D{sstable_size_in_mb: 200};<br>
290cf619-57b0-3ad1-9ae3-e313290de9c9<br>
Waiting for schema agreement...<br>
Warning: unreachable nodes 10.8.30.102The schema has not settled in 10<br>
seconds; further migrations are ill-advised until it does.<br>
Versions are UNREACHABLE:[10.8.30.102],<br>
290cf619-57b0-3ad1-9ae3-e313290de9c9:[10.8.30.15, 10.8.30.14, 10.8.30.13,<b=
r>
10.8.30.103, 10.8.30.104, 10.8.30.105, 10.8.30.106],<br>
f1de54f5-8830-31a6-9cdd-aaa6220cccd1:[10.8.30.101]<br>
<br>
<br>
However, tpstats looks good. And the schema changes eventually do get<br>
applied on *all* the nodes (even the ones that seem to have different<br>
schema versions). There are no communications issues between the nodes and<=
br>
they are all in the same rack<br>
<br>
root@xxxx:~# nodetool tpstats<br>
Pool Name =9A =9A =9A =9A =9A =9A =9A =9A =9A =9AActive =9A Pending =9A =9A=
 =9ACompleted =9A Blocked<br>
All time blocked<br>
ReadStage =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A=
 0 =9A =9A =9A =9A1254592 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
RequestResponseStage =9A =9A =9A =9A =9A =9A =9A0 =9A =9A =9A =9A 0 =9A =9A=
 =9A =9A9480827 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
MutationStage =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A 0 =
=9A =9A =9A =9A8662263 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
ReadRepairStage =9A =9A =9A =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A 0 =9A=
 =9A =9A =9A 339158 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
ReplicateOnWriteStage =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A 0 =9A =9A =
=9A =9A =9A =9A =9A0 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
GossipStage =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A 0=
 =9A =9A =9A =9A1469197 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
AntiEntropyStage =9A =9A =9A =9A =9A =9A =9A =9A =9A0 =9A =9A =9A =9A 0 =9A=
 =9A =9A =9A =9A =9A =9A0 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
MigrationStage =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A0 =9A =9A =9A =9A 0 =
=9A =9A =9A =9A =9A 1808 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
MemtablePostFlusher =9A =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A 0 =9A =9A=
 =9A =9A =9A =9A248 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
StreamStage =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A 0=
 =9A =9A =9A =9A =9A =9A =9A0 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
FlushWriter =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A 0=
 =9A =9A =9A =9A =9A =9A248 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 4<br>
MiscStage =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A=
 0 =9A =9A =9A =9A =9A =9A =9A0 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
commitlog_archiver =9A =9A =9A =9A =9A =9A =9A =9A0 =9A =9A =9A =9A 0 =9A =
=9A =9A =9A =9A =9A =9A0 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
InternalResponseStage =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A 0 =9A =9A =
=9A =9A =9A 5286 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
HintedHandoff =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A 0 =9A =9A =9A =9A 0 =
=9A =9A =9A =9A =9A =9A 21 =9A =9A =9A =9A 0<br>
=9A =9A =9A =9A =9A =9A 0<br>
<br>
Message type =9A =9A =9A =9A =9A Dropped<br>
RANGE_SLICE =9A =9A =9A =9A =9A =9A =9A =9A =9A0<br>
READ_REPAIR =9A =9A =9A =9A =9A =9A =9A =9A =9A0<br>
BINARY =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A 0<br>
READ =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A 0<br>
MUTATION =9A =9A =9A =9A =9A =9A =9A =9A =9A =9A 0<br>
REQUEST_RESPONSE =9A =9A =9A =9A =9A =9A 0<br>
<br>
So I&#39;m guessing maybe the different schema versions may be potentially<=
br>
stopping compactions? Will compactions still happen if there are different<=
br>
versions of the schema?<br>
<div class=3D"HOEnZb"><div class=3D"h5"><br>
<br>
<br>
<br>
<br>
On 9/18/12 11:38 AM, &quot;Michael Kjellman&quot; &lt;<a href=3D"mailto:mkj=
ellman@barracuda.com">mkjellman@barracuda.com</a>&gt; wrote:<br>
<br>
&gt;Thanks, I just modified the schema on the worse offending column family=
<br>
&gt;(as determined by the .json) from 10MB to 200MB.<br>
&gt;<br>
&gt;Should I kick off a compaction on this cf now/repair?/scrub?<br>
&gt;<br>
&gt;Thanks<br>
&gt;<br>
&gt;-michael<br>
&gt;<br>
&gt;From: =F7=A6=D4=C1=CC=A6=CA =F4=C9=CD=DE=C9=DB=C9=CE &lt;<a href=3D"mai=
lto:tivv00@gmail.com">tivv00@gmail.com</a>&lt;mailto:<a href=3D"mailto:tivv=
00@gmail.com">tivv00@gmail.com</a>&gt;&gt;<br>
&gt;Reply-To: &quot;<a href=3D"mailto:user@cassandra.apache.org">user@cassa=
ndra.apache.org</a>&lt;mailto:<a href=3D"mailto:user@cassandra.apache.org">=
user@cassandra.apache.org</a>&gt;&quot;<br>
&gt;&lt;<a href=3D"mailto:user@cassandra.apache.org">user@cassandra.apache.=
org</a>&lt;mailto:<a href=3D"mailto:user@cassandra.apache.org">user@cassand=
ra.apache.org</a>&gt;&gt;<br>
&gt;To: &quot;<a href=3D"mailto:user@cassandra.apache.org">user@cassandra.a=
pache.org</a>&lt;mailto:<a href=3D"mailto:user@cassandra.apache.org">user@c=
assandra.apache.org</a>&gt;&quot;<br>
&gt;&lt;<a href=3D"mailto:user@cassandra.apache.org">user@cassandra.apache.=
org</a>&lt;mailto:<a href=3D"mailto:user@cassandra.apache.org">user@cassand=
ra.apache.org</a>&gt;&gt;<br>
&gt;Subject: Re: persistent compaction issue (1.1.4 and 1.1.5)<br>
&gt;<br>
&gt;I&#39;ve started to use LeveledCompaction some time ago and from my<br>
&gt;experience this indicates some SST on lower levels than they should be.=
<br>
&gt;The compaction is going, moving them up level by level, but total count=
<br>
&gt;does not change as new data goes in.<br>
&gt;The numbers are pretty high as for me. Such numbers mean a lot of files=
<br>
&gt;(over 100K in single directory) and a lot of thinking for compaction<br=
>
&gt;executor to decide what to compact next. I can see numbers like 5K-10K<=
br>
&gt;and still thing this is high number. If I were you, I&#39;d increase<br=
>
&gt;sstable_size_in_mb 10-20 times it is now.<br>
&gt;<br>
&gt;2012/9/17 Michael Kjellman<br>
&gt;&lt;<a href=3D"mailto:mkjellman@barracuda.com">mkjellman@barracuda.com<=
/a>&lt;mailto:<a href=3D"mailto:mkjellman@barracuda.com">mkjellman@barracud=
a.com</a>&gt;&gt;<br>
&gt;Hi All,<br>
&gt;<br>
&gt;I have an issue where each one of my nodes (currently all running at<br=
>
&gt;1.1.5) is reporting around 30,000 pending compactions. I understand tha=
t<br>
&gt;a pending compaction doesn&#39;t necessarily mean it is a scheduled tas=
k<br>
&gt;however I&#39;m confused why this behavior is occurring. It is the same=
 on<br>
&gt;all nodes, occasionally goes down 5k pending compaction tasks, and then=
<br>
&gt;returns to 25,000-35,000 compaction tasks pending.<br>
&gt;<br>
&gt;I have tried a repair operation/scrub operation on two of the nodes and=
<br>
&gt;while compactions initially happen the number of pending compactions do=
es<br>
&gt;not decrease.<br>
&gt;<br>
&gt;Any ideas? Thanks for your time.<br>
&gt;<br>
&gt;Best,<br>
&gt;michael<br>
&gt;<br>
&gt;<br>
&gt;&#39;Like&#39; us on Facebook for exclusive content and other resources=
 on all<br>
&gt;Barracuda Networks solutions.<br>
&gt;<br>
&gt;Visit <a href=3D"http://barracudanetworks.com/facebook" target=3D"_blan=
k">http://barracudanetworks.com/facebook</a><br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;--<br>
&gt;Best regards,<br>
&gt; Vitalii Tymchyshyn<br>
&gt;<br>
&gt;&#39;Like&#39; us on Facebook for exclusive content and other resources=
 on all<br>
&gt;Barracuda Networks solutions.<br>
&gt;<br>
&gt;Visit <a href=3D"http://barracudanetworks.com/facebook" target=3D"_blan=
k">http://barracudanetworks.com/facebook</a><br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
<br>
<br>
&#39;Like&#39; us on Facebook for exclusive content and other resources on =
all Barracuda Networks solutions.<br>
<br>
Visit <a href=3D"http://barracudanetworks.com/facebook" target=3D"_blank">h=
ttp://barracudanetworks.com/facebook</a><br>
<br>
<br>
<br>
<br>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
Best regards,<br>=9AVitalii Tymchyshyn<br>
</div>

--14dae93407f788e22e04ca1b7c8b--