Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
MIME-Version: 1.0
In-Reply-To: <CACuMZ5TGsMug_ksmt5NhDwOCS-T8tCrkNNXBtED4mXP8aZYumg@mail.gmail.com>
References: <CACuMZ5R-KeV5d9ErMNcGncBsOdyODrKjyW3FRZb5uTDtf18qeQ@mail.gmail.com>
 <pony-cb367aa5aed1991b197e48bcdd89864c2a894a70-8d4c475139025669de2356522f68b5cfd9f99f0a@user.cassandra.apache.org>
 <CACuMZ5Q+_jH5Z-bJaMRdaC8=x01Zj3BcKgj7+ApdBV7NjZN4cw@mail.gmail.com>
 <CA+Emch==LcFZmM7UD54oUDp0uq9B5rpEBcqdz-w3yVPxXzzV-Q@mail.gmail.com> <CACuMZ5TGsMug_ksmt5NhDwOCS-T8tCrkNNXBtED4mXP8aZYumg@mail.gmail.com>
From: Jeff Jirsa <jjirsa@gmail.com>
Date: Thu, 11 May 2017 13:11:48 -0700
Message-ID: <CA+Emchmy=jCiWjc=AEbCBMpvb1ASdEofiQ0mZhSmrnvJJhCT4g@mail.gmail.com>
Subject: Re: Nodetool cleanup doesn't work
To: Jai Bheemsen Rao Dhanwada <jaibheemsen@gmail.com>
Cc: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary="001a11404c781b3f5b054f4534ca"
archived-at: Thu, 11 May 2017 20:12:19 -0000

--001a11404c781b3f5b054f4534ca
Content-Type: text/plain; charset="UTF-8"

No, it's not expected, but it's pretty obvious from reading the code
what'll happen. Opened https://issues.apache.org/jira/browse/CASSANDRA-13526


On Thu, May 11, 2017 at 12:53 PM, Jai Bheemsen Rao Dhanwada <
jaibheemsen@gmail.com> wrote:

> Yes I have many keyspaces which are not spread across all the data
> centers(expected by design).
> In this case, is this the expected behavior cleanup will not work for all
> the keyspaces(nodetool cleanup)? is it going to be fixed in the latest
> versions?
>
> P.S: Thanks for the tip, I can workaround this by "nodetool cleanup
> keyspacename"
>
> On Thu, May 11, 2017 at 12:11 PM, Jeff Jirsa <jjirsa@gmail.com> wrote:
>
>> If you didn't explicitly remove a keyspace from one of your datacenters,
>> the next most likely cause is that you have one keyspace that's NOT
>> replicated to one of the datacenters. You can work around this by running
>> 'nodetool cleanup <ks>' on all of your other keyspaces individually,
>> skipping the one that isn't replicated to that datacenter.
>>
>>
>>
>> On Thu, May 11, 2017 at 11:19 AM, Jai Bheemsen Rao Dhanwada <
>> jaibheemsen@gmail.com> wrote:
>>
>>> Thanks Jeff,
>>>
>>> I have a C* cluster spread across multiple datacenter.
>>> reason for cleanup : I added multiple nodes to cluster and need to run
>>> cleanup on old nodes so that the redundant data is cleaned-up.
>>>
>>> On Thu, May 11, 2017 at 11:08 AM, Jeff Jirsa <jjirsa@apache.org> wrote:
>>>
>>>>
>>>>
>>>> On 2017-05-10 22:44 (-0700), Jai Bheemsen Rao Dhanwada <
>>>> jaibheemsen@gmail.com> wrote:
>>>> > Hello,
>>>> >
>>>> > I am running into an issue where *nodetool cleanup *fails to cleanup
>>>> data.
>>>> > We are running 2.1.16 version of Cassandra.
>>>> >
>>>> >
>>>> > [user@host ~]$ nodetool cleanup
>>>> > Aborted cleaning up atleast one column family in keyspace user, check
>>>> > server logs for more information.
>>>> > Aborted cleaning up atleast one column family in keyspace org, check
>>>> server
>>>> > logs for more information.
>>>> > error: nodetool failed, check server logs
>>>> > -- StackTrace --
>>>> > java.lang.RuntimeException: nodetool failed, check server logs
>>>> >         at
>>>> > org.apache.cassandra.tools.NodeTool$NodeToolCmd.run(NodeTool
>>>> .java:294)
>>>> >         at org.apache.cassandra.tools.Nod
>>>> eTool.main(NodeTool.java:206)
>>>> >
>>>> > *Logs:*
>>>> >
>>>> > INFO  [RMI TCP Connection(17)-x.x.x.x] 2017-05-05 04:04:07,987
>>>> > CompactionManager.java:415 - Cleanup cannot run before a node has
>>>> joined
>>>> > the ring
>>>> > INFO  [RMI TCP Connection(17)-x.x.x.x] 2017-05-05 04:04:08,010
>>>> > CompactionManager.java:415 - Cleanup cannot run before a node has
>>>> joined
>>>> > the ring
>>>> >
>>>> > All the nodes in the cluster are up and running. We tried doing a
>>>> rolling
>>>> > restart of all nodes and no luck.
>>>> >
>>>> > After looking at the Cassandra JIRA :
>>>> > https://issues.apache.org/jira/browse/CASSANDRA-10991 looks like the
>>>> issue
>>>> > is fixed with 2.2.6 and 3.0 version.
>>>> > While we have plans to upgrade to the latest versions(which might take
>>>> > longer time), does any know if there is any work around to mitigate
>>>> the
>>>> > issue?
>>>> >
>>>>
>>>> Are you running multiple datacenters, and you just removed a specific
>>>> datacenter from a keyspace (and that's why you want to run cleanup)? If
>>>> that's the case, I fear the fix for 10991 isn't really going to fix it in
>>>> the way you hope (we may need a follow-up jira). What you'll almost
>>>> certainly need to do is remove the data on disk manually, which is quite
>>>> unfortunate as it'll require you to stop+delete-data-for-that-keyspace+start
>>>> each node in the datacenter for which you removed replication.
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: user-unsubscribe@cassandra.apache.org
>>>> For additional commands, e-mail: user-help@cassandra.apache.org
>>>>
>>>>
>>>
>>
>

--001a11404c781b3f5b054f4534ca
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">No, it&#39;s not expected, but it&#39;s pretty obvious fro=
m reading the code what&#39;ll happen. Opened=C2=A0<a href=3D"https://issue=
s.apache.org/jira/browse/CASSANDRA-13526">https://issues.apache.org/jira/br=
owse/CASSANDRA-13526</a><div><br></div><div><br><div><br></div><div><br></d=
iv></div></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On=
 Thu, May 11, 2017 at 12:53 PM, Jai Bheemsen Rao Dhanwada <span dir=3D"ltr"=
>&lt;<a href=3D"mailto:jaibheemsen@gmail.com" target=3D"_blank">jaibheemsen=
@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=
=3D"ltr">Yes I have many keyspaces which are not spread across all the data=
 centers(expected by design).<div>In this case, is this the expected behavi=
or cleanup will not work for all the keyspaces(nodetool cleanup)? is it goi=
ng to be fixed in the latest versions?</div><div><br></div><div>P.S: Thanks=
 for the tip, I can workaround this by &quot;nodetool cleanup keyspacename&=
quot;</div></div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmai=
l_extra"><br><div class=3D"gmail_quote">On Thu, May 11, 2017 at 12:11 PM, J=
eff Jirsa <span dir=3D"ltr">&lt;<a href=3D"mailto:jjirsa@gmail.com" target=
=3D"_blank">jjirsa@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"=
gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-=
left:1ex"><div dir=3D"ltr">If you didn&#39;t explicitly remove a keyspace f=
rom one of your datacenters, the next most likely cause is that you have on=
e keyspace that&#39;s NOT replicated to one of the datacenters. You can wor=
k around this by running &#39;nodetool cleanup &lt;ks&gt;&#39; on all of yo=
ur other keyspaces individually, skipping the one that isn&#39;t replicated=
 to that datacenter.<div><br></div><div><br></div></div><div class=3D"m_-16=
45396950528340665HOEnZb"><div class=3D"m_-1645396950528340665h5"><div class=
=3D"gmail_extra"><br><div class=3D"gmail_quote">On Thu, May 11, 2017 at 11:=
19 AM, Jai Bheemsen Rao Dhanwada <span dir=3D"ltr">&lt;<a href=3D"mailto:ja=
ibheemsen@gmail.com" target=3D"_blank">jaibheemsen@gmail.com</a>&gt;</span>=
 wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bor=
der-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Thanks Jeff,<div=
><br></div><div>I have a C* cluster spread across multiple datacenter.</div=
><div>reason for cleanup : I added multiple nodes to cluster and need to ru=
n cleanup on old nodes so that the redundant data is cleaned-up.</div></div=
><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Thu, May 11, =
2017 at 11:08 AM, Jeff Jirsa <span dir=3D"ltr">&lt;<a href=3D"mailto:jjirsa=
@apache.org" target=3D"_blank">jjirsa@apache.org</a>&gt;</span> wrote:<br><=
blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px=
 #ccc solid;padding-left:1ex"><span><br>
<br>
On 2017-05-10 22:44 (-0700), Jai Bheemsen Rao Dhanwada &lt;<a href=3D"mailt=
o:jaibheemsen@gmail.com" target=3D"_blank">jaibheemsen@gmail.com</a>&gt; wr=
ote:<br>
&gt; Hello,<br>
&gt;<br>
</span>&gt; I am running into an issue where *nodetool cleanup *fails to cl=
eanup data.<br>
<span>&gt; We are running 2.1.16 version of Cassandra.<br>
&gt;<br>
&gt;<br>
&gt; [user@host ~]$ nodetool cleanup<br>
&gt; Aborted cleaning up atleast one column family in keyspace user, check<=
br>
&gt; server logs for more information.<br>
&gt; Aborted cleaning up atleast one column family in keyspace org, check s=
erver<br>
&gt; logs for more information.<br>
&gt; error: nodetool failed, check server logs<br>
&gt; -- StackTrace --<br>
&gt; java.lang.RuntimeException: nodetool failed, check server logs<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0at<br>
&gt; org.apache.cassandra.tools.Nod<wbr>eTool$NodeToolCmd.run(NodeTool<wbr>=
.java:294)<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0at org.apache.cassandra.tools.Nod<wbr=
>eTool.main(NodeTool.java:206)<br>
&gt;<br>
</span>&gt; *Logs:*<br>
<span>&gt;<br>
&gt; INFO=C2=A0 [RMI TCP Connection(17)-x.x.x.x] 2017-05-05 04:04:07,987<br=
>
&gt; CompactionManager.java:415 - Cleanup cannot run before a node has join=
ed<br>
&gt; the ring<br>
&gt; INFO=C2=A0 [RMI TCP Connection(17)-x.x.x.x] 2017-05-05 04:04:08,010<br=
>
&gt; CompactionManager.java:415 - Cleanup cannot run before a node has join=
ed<br>
&gt; the ring<br>
&gt;<br>
&gt; All the nodes in the cluster are up and running. We tried doing a roll=
ing<br>
&gt; restart of all nodes and no luck.<br>
&gt;<br>
&gt; After looking at the Cassandra JIRA :<br>
&gt; <a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-10991" rel=
=3D"noreferrer" target=3D"_blank">https://issues.apache.org/jira<wbr>/brows=
e/CASSANDRA-10991</a> looks like the issue<br>
&gt; is fixed with 2.2.6 and 3.0 version.<br>
&gt; While we have plans to upgrade to the latest versions(which might take=
<br>
&gt; longer time), does any know if there is any work around to mitigate th=
e<br>
&gt; issue?<br>
&gt;<br>
<br>
</span>Are you running multiple datacenters, and you just removed a specifi=
c datacenter from a keyspace (and that&#39;s why you want to run cleanup)? =
If that&#39;s the case, I fear the fix for 10991 isn&#39;t really going to =
fix it in the way you hope (we may need a follow-up jira). What you&#39;ll =
almost certainly need to do is remove the data on disk manually, which is q=
uite unfortunate as it&#39;ll require you to stop+delete-data-for-that-keys=
<wbr>pace+start each node in the datacenter for which you removed replicati=
on.<br>
<br>
------------------------------<wbr>------------------------------<wbr>-----=
----<br>
To unsubscribe, e-mail: <a href=3D"mailto:user-unsubscribe@cassandra.apache=
.org" target=3D"_blank">user-unsubscribe@cassandra.apa<wbr>che.org</a><br>
For additional commands, e-mail: <a href=3D"mailto:user-help@cassandra.apac=
he.org" target=3D"_blank">user-help@cassandra.apache.org</a><br>
<br>
</blockquote></div><br></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--001a11404c781b3f5b054f4534ca--