From user-return-65103-archive-asf-public=cust-asf.ponee.io@cassandra.apache.org  Mon Feb  3 02:36:26 2020
Return-Path: <user-return-65103-archive-asf-public=cust-asf.ponee.io@cassandra.apache.org>
X-Original-To: archive-asf-public@cust-asf.ponee.io
Delivered-To: archive-asf-public@cust-asf.ponee.io
Received: from mail.apache.org (hermes.apache.org [207.244.88.153])
	by mx-eu-01.ponee.io (Postfix) with SMTP id 9690D1802C7
	for <archive-asf-public@cust-asf.ponee.io>; Mon,  3 Feb 2020 03:36:25 +0100 (CET)
Received: (qmail 51988 invoked by uid 500); 3 Feb 2020 02:36:21 -0000
Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
List-Help: <mailto:user-help@cassandra.apache.org>
List-Unsubscribe: <mailto:user-unsubscribe@cassandra.apache.org>
List-Post: <mailto:user@cassandra.apache.org>
List-Id: <user.cassandra.apache.org>
Reply-To: user@cassandra.apache.org
Delivered-To: mailing list user@cassandra.apache.org
Received: (qmail 51977 invoked by uid 99); 3 Feb 2020 02:36:21 -0000
Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142)
    by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Feb 2020 02:36:21 +0000
Received: from localhost (localhost [127.0.0.1])
	by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 904741A321A
	for <user@cassandra.apache.org>; Mon,  3 Feb 2020 02:36:20 +0000 (UTC)
X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org
X-Spam-Flag: NO
X-Spam-Score: 0
X-Spam-Level:
X-Spam-Status: No, score=0 tagged_above=-999 required=6.31
	tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1,
	DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.2, RCVD_IN_DNSWL_NONE=-0.0001,
	RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
	URIBL_BLOCKED=0.001] autolearn=disabled
Authentication-Results: spamd2-us-west.apache.org (amavisd-new);
	dkim=pass (2048-bit key) header.d=gmail.com
Received: from mx1-ec2-va.apache.org ([10.40.0.8])
	by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024)
	with ESMTP id Dln06lOUSX6Q for <user@cassandra.apache.org>;
	Mon,  3 Feb 2020 02:36:18 +0000 (UTC)
Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=209.85.208.169; helo=mail-lj1-f169.google.com; envelope-from=anthony.grasso@gmail.com; receiver=<UNKNOWN> 
Received: from mail-lj1-f169.google.com (mail-lj1-f169.google.com [209.85.208.169])
	by mx1-ec2-va.apache.org (ASF Mail Server at mx1-ec2-va.apache.org) with ESMTPS id 31076BB802
	for <user@cassandra.apache.org>; Mon,  3 Feb 2020 02:36:18 +0000 (UTC)
Received: by mail-lj1-f169.google.com with SMTP id q8so12939418ljb.2
        for <user@cassandra.apache.org>; Sun, 02 Feb 2020 18:36:18 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20161025;
        h=mime-version:references:in-reply-to:from:date:message-id:subject:to;
        bh=Gu+erXuf563xpvo9EnHMI49oZArSsH88GN4BDm3E4PA=;
        b=Oy+BLkoqQuWxyp7/G/XO9yxeJfIVMzBnv4cKeUGwvYlRcvXP+nHEYDukSXJqRrjUn3
         YG/xnFlE9jVpnfjAgy+sHXo5zgSJTdevhvksiWg2MI8jQBucIA2ZApgVLqMBIJCMFPD2
         oJmapkgy+1IsabvbTtpEySum2w+fbSARdQfSsD7p/26jJEdJpRm969tNtl+uWVkgRSpS
         gBEQYhidkcjfLNLF0mSxDPZxxPQyNENJASnJa9OxFSTkGY3EgcysO6wCDUzhXpelfyFY
         ju4jKdaABNuXQNeDzKiJkd7gY0siIWx8DToqSEgdkYSDJAZBe0kQXSFdrrJmgHU2uw4L
         9FDg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20161025;
        h=x-gm-message-state:mime-version:references:in-reply-to:from:date
         :message-id:subject:to;
        bh=Gu+erXuf563xpvo9EnHMI49oZArSsH88GN4BDm3E4PA=;
        b=GWlJudx+ekVoulKhT4YuCUVvV1G9H4C4WNKFSZBrh1R5eQCW2myahUj4h7U//fVsa5
         mtajITiIfqXzr2YSi034LqqRKrp5Dknvbu+UrKH6W5o8uycClsa9NGqc7DXzLty0+9Tb
         ZRmrJhwU9d6/zWkWVj9ANszBBW3JBej4XHPVEbUXr13H0YCKMIb1DASH/gbaboqFxKP0
         eDXyjULk1wQ7ftmIk0pV3x9c6RCDoOew5pZzOrgjlKlMktY0ZNvVHQefxxoZwrBLH0t1
         KAYFxc37yB8a9eMsa9SGY7BP8ZbAx2Xd+QdeeE8ECxrIaOx/PqpEJTecaOI2SMU6VfpX
         kabg==
X-Gm-Message-State: APjAAAXAi1/JA0vj8wbgqnRZPaZ2bcXIt6jRttHRQ7AHik7/0SLr/GK+
	X1f5DsV2/2rVeAGmB07tbzlqsWUFT6j4O40CEsOS7eZ5
X-Google-Smtp-Source: APXvYqzca1FwsclGfyEnLejh4/N6qnR/psedrGt7PdV7q9upybiZk/puk1TmuVHNWlFzw/nz+uT3PDS/Qbc/cm7kSsQ=
X-Received: by 2002:a2e:9d89:: with SMTP id c9mr12870326ljj.212.1580697376171;
 Sun, 02 Feb 2020 18:36:16 -0800 (PST)
MIME-Version: 1.0
References: <CAHQk_ARTiUj1=wHguNjWi3Pbtw4tUoqU-GV45c+2P=cZsPvQ_w@mail.gmail.com>
 <BYAP108MB018109930019135C33546466BC040@BYAP108MB0181.NAMP108.PROD.OUTLOOK.COM>
 <CAGbQL+UCGD=jMRWwR3gPjGc2PWg6Ug061o26YR94+SjGG9jSrw@mail.gmail.com>
 <BYAP108MB0181B70340BC722AF48C5A13BC070@BYAP108MB0181.NAMP108.PROD.OUTLOOK.COM>
 <CA+mMCVduM5sxfO5oxFa_F3kNhVhWD_dtQWEL1PSeSQJUiUF73Q@mail.gmail.com>
 <CAM8skDKaGDAg43VXsss8y7TMxrYN8gsN_=7+LTdsHEnq+Rp4MQ@mail.gmail.com> <CAACTTH0V1bgaVUr6C_18zzXDa7AGnZV=iPRiDFm-p1pi3=a66Q@mail.gmail.com>
In-Reply-To: <CAACTTH0V1bgaVUr6C_18zzXDa7AGnZV=iPRiDFm-p1pi3=a66Q@mail.gmail.com>
From: Anthony Grasso <anthony.grasso@gmail.com>
Date: Mon, 3 Feb 2020 13:35:40 +1100
Message-ID: <CAGbQL+U6FgG8wWojCWkCJ010jATdjhqVKJie12-0HuMCBagiaQ@mail.gmail.com>
Subject: Re: [EXTERNAL] How to reduce vnodes without downtime
To: user <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary="0000000000009aac56059da2c99e"

--0000000000009aac56059da2c99e
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hi Sergio,

There is a misunderstanding here. My post makes no recommendation for the
value of num_tokens. Rather, it focuses on how to use
the allocate_tokens_for_keyspace setting when creating a new cluster.

Whilst a value of 4 is used for num_tokens in the post, it was chosen for
demonstration purposes. Specifically it makes:

   - the uneven token distribution in a small cluster very obvious,
   - identifying the endpoints displayed in nodetool ring easy, and
   - the initial_token setup less verbose and easier to follow.

I will add an editorial note to the post with the above information
so there is no confusion about why 4 tokens were used.

I would only consider moving a cluster to 4 tokens if it is larger than 100
nodes. If you read through the paper that Erick mentioned, written by Joe
Lynch & Josh Snyder, they show that the num_tokens impacts the availability
of large scale clusters.

If you are after more details about the trade-offs between different sized
token values, please see the discussion on the dev mailing list: "[Discuss]
num_tokens default in Cassandra 4.0
<https://www.mail-archive.com/search?l=3Ddev%40cassandra.apache.org&q=3Dsub=
ject%3A%22%5C%5BDiscuss%5C%5D+num_tokens+default+in+Cassandra+4.0%22&o=3Dol=
dest>
".

Regards,
Anthony

On Sat, 1 Feb 2020 at 10:07, Sergio <lapostadisergio@gmail.com> wrote:

>
> https://thelastpickle.com/blog/2019/02/21/set-up-a-cluster-with-even-toke=
n-distribution.html This
> is the article with 4 token recommendations.
> @Erick Ramirez. which is the dev thread for the default 32 tokens
> recommendation?
>
> Thanks,
> Sergio
>
> Il giorno ven 31 gen 2020 alle ore 14:49 Erick Ramirez <
> flightctlr@gmail.com> ha scritto:
>
>> There's an active discussion going on right now in a separate dev thread=
.
>> The current "default recommendation" is 32 tokens. But there's a push fo=
r 4
>> in combination with allocate_tokens_for_keyspace from Jon Haddad & co
>> (based on a paper from Joe Lynch & Josh Snyder).
>>
>> If you're satisfied with the results from your own testing, go with 4
>> tokens. And that's the key -- you must test, test, TEST! Cheers!
>>
>> On Sat, Feb 1, 2020 at 5:17 AM Arvinder Dhillon <dhillonarvi@gmail.com>
>> wrote:
>>
>>> What is recommended vnodes now? I read 8 in later cassandra 3.x
>>> Is the new recommendation 4 now even in version 3.x (asking for 3.11)?
>>> Thanks
>>>
>>> On Fri, Jan 31, 2020 at 9:49 AM Durity, Sean R <
>>> SEAN_R_DURITY@homedepot.com> wrote:
>>>
>>>> These are good clarifications and expansions.
>>>>
>>>>
>>>>
>>>> Sean Durity
>>>>
>>>>
>>>>
>>>> *From:* Anthony Grasso <anthony.grasso@gmail.com>
>>>> *Sent:* Thursday, January 30, 2020 7:25 PM
>>>> *To:* user <user@cassandra.apache.org>
>>>> *Subject:* Re: [EXTERNAL] How to reduce vnodes without downtime
>>>>
>>>>
>>>>
>>>> Hi Maxim,
>>>>
>>>>
>>>>
>>>> Basically what Sean suggested is the way to do this without downtime.
>>>>
>>>>
>>>>
>>>> To clarify the, the *three* steps following the "Decommission each
>>>> node in the DC you are working on" step should be applied to *only*
>>>> the decommissioned nodes. So where it say "*all nodes*" or "*every
>>>> node*" it applies to only the decommissioned nodes.
>>>>
>>>>
>>>>
>>>> In addition, the step that says "Wipe data on all the nodes", I would
>>>> delete all files in the following directories on the decommissioned no=
des.
>>>>
>>>>    - data (usually located in /var/lib/cassandra/data)
>>>>    - commitlogs (usually located in /var/lib/cassandra/commitlogs)
>>>>    - hints (usually located in /var/lib/casandra/hints)
>>>>    - saved_caches (usually located in /var/lib/cassandra/saved_caches)
>>>>
>>>>
>>>>
>>>> Cheers,
>>>>
>>>> Anthony
>>>>
>>>>
>>>>
>>>> On Fri, 31 Jan 2020 at 03:05, Durity, Sean R <
>>>> SEAN_R_DURITY@homedepot.com> wrote:
>>>>
>>>> Your procedure won=E2=80=99t work very well. On the first node, if you=
 switched
>>>> to 4, you would end up with only a tiny fraction of the data (because =
the
>>>> other nodes would still be at 256). I updated a large cluster (over 15=
0
>>>> nodes =E2=80=93 2 DCs) to smaller number of vnodes. The basic outline =
was this:
>>>>
>>>>
>>>>
>>>>    - Stop all repairs
>>>>    - Make sure the app is running against one DC only
>>>>    - Change the replication settings on keyspaces to use only 1 DC
>>>>    (basically cutting off the other DC)
>>>>    - Decommission each node in the DC you are working on. Because the
>>>>    replication setting are changed, no streaming occurs. But it releas=
es the
>>>>    token assignments
>>>>    - Wipe data on all the nodes
>>>>    - Update configuration on every node to your new settings,
>>>>    including auto_bootstrap =3D false
>>>>    - Start all nodes. They will choose tokens, but not stream any data
>>>>    - Update replication factor for all keyspaces to include the new DC
>>>>    - I disabled binary on those nodes to prevent app connections
>>>>    - Run nodetool reduild with -dc (other DC) on as many nodes as your
>>>>    system can safely handle until they are all rebuilt.
>>>>    - Re-enable binary (and app connections to the rebuilt DC)
>>>>    - Turn on repairs
>>>>    - Rest for a bit, then reverse the process for the remaining DCs
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> Sean Durity =E2=80=93 Staff Systems Engineer, Cassandra
>>>>
>>>>
>>>>
>>>> *From:* Maxim Parkachov <lazy.gopher@gmail.com>
>>>> *Sent:* Thursday, January 30, 2020 10:05 AM
>>>> *To:* user@cassandra.apache.org
>>>> *Subject:* [EXTERNAL] How to reduce vnodes without downtime
>>>>
>>>>
>>>>
>>>> Hi everyone,
>>>>
>>>>
>>>>
>>>> with discussion about reducing default vnodes in version 4.0 I would
>>>> like to ask, what would be optimal procedure to perform reduction of v=
nodes
>>>> in existing 3.11.x cluster which was set up with default value 256. Cl=
uster
>>>> has 2 DC with 5 nodes each and RF=3D3. There is one more restriction, =
I could
>>>> not add more servers, nor to create additional DC, everything is physi=
cal.
>>>> This should be done without downtime.
>>>>
>>>>
>>>>
>>>> My idea for such procedure would be
>>>>
>>>>
>>>>
>>>> for each node:
>>>>
>>>> - decommission node
>>>>
>>>> - set auto_bootstrap to true and vnodes to 4
>>>>
>>>> - start and wait till node joins cluster
>>>>
>>>> - run cleanup on rest of nodes in cluster
>>>>
>>>> - run repair on whole cluster (not sure if needed after cleanup)
>>>>
>>>> - set auto_bootstrap to false
>>>>
>>>> repeat for each node
>>>>
>>>>
>>>>
>>>> rolling restart of cluster
>>>>
>>>> cluster repair
>>>>
>>>>
>>>>
>>>> Is this sounds right ? My concern is that after decommission, node wil=
l
>>>> start on the same IP which could create some confusion.
>>>>
>>>>
>>>>
>>>> Regards,
>>>>
>>>> Maxim.
>>>>
>>>>
>>>> ------------------------------
>>>>
>>>>
>>>> The information in this Internet Email is confidential and may be
>>>> legally privileged. It is intended solely for the addressee. Access to=
 this
>>>> Email by anyone else is unauthorized. If you are not the intended
>>>> recipient, any disclosure, copying, distribution or any action taken o=
r
>>>> omitted to be taken in reliance on it, is prohibited and may be unlawf=
ul.
>>>> When addressed to our clients any opinions or advice contained in this
>>>> Email are subject to the terms and conditions expressed in any applica=
ble
>>>> governing The Home Depot terms of business or client engagement letter=
. The
>>>> Home Depot disclaims all responsibility and liability for the accuracy=
 and
>>>> content of this attachment and for any damages or losses arising from =
any
>>>> inaccuracies, errors, viruses, e.g., worms, trojan horses, etc., or ot=
her
>>>> items of a destructive nature, which may be contained in this attachme=
nt
>>>> and shall not be liable for direct, indirect, consequential or special
>>>> damages in connection with this e-mail message or its attachment.
>>>>
>>>>
>>>> ------------------------------
>>>>
>>>> The information in this Internet Email is confidential and may be
>>>> legally privileged. It is intended solely for the addressee. Access to=
 this
>>>> Email by anyone else is unauthorized. If you are not the intended
>>>> recipient, any disclosure, copying, distribution or any action taken o=
r
>>>> omitted to be taken in reliance on it, is prohibited and may be unlawf=
ul.
>>>> When addressed to our clients any opinions or advice contained in this
>>>> Email are subject to the terms and conditions expressed in any applica=
ble
>>>> governing The Home Depot terms of business or client engagement letter=
. The
>>>> Home Depot disclaims all responsibility and liability for the accuracy=
 and
>>>> content of this attachment and for any damages or losses arising from =
any
>>>> inaccuracies, errors, viruses, e.g., worms, trojan horses, etc., or ot=
her
>>>> items of a destructive nature, which may be contained in this attachme=
nt
>>>> and shall not be liable for direct, indirect, consequential or special
>>>> damages in connection with this e-mail message or its attachment.
>>>>
>>>

--0000000000009aac56059da2c99e
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi Sergio,<div><br></div><div>There is a misunderstanding =
here. My post makes no recommendation for the value of num_tokens. Rather, =
it focuses on how to use the=C2=A0allocate_tokens_for_keyspace setting when=
 creating a new cluster.</div><div><br></div><div>Whilst a value of 4 is us=
ed for num_tokens in the post, it was chosen for demonstration purposes. Sp=
ecifically it makes:</div><div><ul><li>the uneven=C2=A0token distribution=
=C2=A0in a small cluster very obvious,<br></li><li>identifying the endpoint=
s displayed in nodetool ring easy, and</li><li>the initial_token setup less=
 verbose and easier to follow.</li></ul><div>I will add an editorial note t=
o the post with the above information so=C2=A0there is no confusion about w=
hy 4 tokens were used.=C2=A0</div><div><br></div><div>I would only consider=
 moving a cluster to 4 tokens if it is larger than 100 nodes. If you read t=
hrough the paper=C2=A0that Erick mentioned, written by=C2=A0Joe Lynch &amp;=
 Josh Snyder, they show that the num_tokens impacts the availability of lar=
ge scale clusters.</div></div><div><br></div><div>If you are after more det=
ails about the trade-offs between different sized token values, please see =
the discussion on the dev mailing list: &quot;<a href=3D"https://www.mail-a=
rchive.com/search?l=3Ddev%40cassandra.apache.org&amp;q=3Dsubject%3A%22%5C%5=
BDiscuss%5C%5D+num_tokens+default+in+Cassandra+4.0%22&amp;o=3Doldest">[Disc=
uss] num_tokens default in Cassandra 4.0</a>&quot;.</div><div><br></div><di=
v>Regards,</div><div>Anthony</div></div><br><div class=3D"gmail_quote"><div=
 dir=3D"ltr" class=3D"gmail_attr">On Sat, 1 Feb 2020 at 10:07, Sergio &lt;<=
a href=3D"mailto:lapostadisergio@gmail.com">lapostadisergio@gmail.com</a>&g=
t; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0p=
x 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div d=
ir=3D"ltr"><a href=3D"https://thelastpickle.com/blog/2019/02/21/set-up-a-cl=
uster-with-even-token-distribution.html" target=3D"_blank">https://thelastp=
ickle.com/blog/2019/02/21/set-up-a-cluster-with-even-token-distribution.htm=
l</a>=C2=A0This is the article with 4 token recommendations.<br>@Erick Rami=
rez. which is the dev thread for the default 32 tokens recommendation?<br><=
br>Thanks,<br>Sergio</div><br><div class=3D"gmail_quote"><div dir=3D"ltr" c=
lass=3D"gmail_attr">Il giorno ven 31 gen 2020 alle ore 14:49 Erick Ramirez =
&lt;<a href=3D"mailto:flightctlr@gmail.com" target=3D"_blank">flightctlr@gm=
ail.com</a>&gt; ha scritto:<br></div><blockquote class=3D"gmail_quote" styl=
e=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);paddin=
g-left:1ex"><div dir=3D"ltr">There&#39;s an active discussion going on righ=
t now in a separate dev thread. The current &quot;default recommendation&qu=
ot; is 32 tokens. But there&#39;s a push for 4 in combination with=C2=A0<fo=
nt face=3D"monospace">allocate_tokens_for_keyspace</font>=C2=A0from Jon Had=
dad &amp; co (based on a paper from Joe Lynch &amp; Josh Snyder).<div><br><=
/div><div>If you&#39;re satisfied with the results from your own testing, g=
o with 4 tokens. And that&#39;s the key -- you must test, test, TEST! Cheer=
s!</div></div><br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmai=
l_attr">On Sat, Feb 1, 2020 at 5:17 AM Arvinder Dhillon &lt;<a href=3D"mail=
to:dhillonarvi@gmail.com" target=3D"_blank">dhillonarvi@gmail.com</a>&gt; w=
rote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0p=
x 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir=
=3D"ltr"><div>What is recommended vnodes now? I read 8 in later cassandra 3=
.x<br></div><div>Is the new recommendation 4 now even in version 3.x (askin=
g for 3.11)?</div><div>Thanks <br></div></div><br><div class=3D"gmail_quote=
"><div dir=3D"ltr" class=3D"gmail_attr">On Fri, Jan 31, 2020 at 9:49 AM Dur=
ity, Sean R &lt;<a href=3D"mailto:SEAN_R_DURITY@homedepot.com" target=3D"_b=
lank">SEAN_R_DURITY@homedepot.com</a>&gt; wrote:<br></div><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex">





<div lang=3D"EN-US">
<div>
<p class=3D"MsoNormal">These are good clarifications and expansions.<u></u>=
<u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal">Sean Durity<u></u><u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal"><b>From:</b> Anthony Grasso &lt;<a href=3D"mailto:an=
thony.grasso@gmail.com" target=3D"_blank">anthony.grasso@gmail.com</a>&gt; =
<br>
<b>Sent:</b> Thursday, January 30, 2020 7:25 PM<br>
<b>To:</b> user &lt;<a href=3D"mailto:user@cassandra.apache.org" target=3D"=
_blank">user@cassandra.apache.org</a>&gt;<br>
<b>Subject:</b> Re: [EXTERNAL] How to reduce vnodes without downtime<u></u>=
<u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<p class=3D"MsoNormal">Hi Maxim,<u></u><u></u></p>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Basically what Sean suggested is the way to do this =
without downtime.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">To clarify the, the <b>three</b> steps following the=
 &quot;Decommission each node in the DC you are working on&quot; step shoul=
d be applied to
<b>only</b> the decommissioned=C2=A0nodes. So where it say=C2=A0&quot;<i>al=
l nodes</i>&quot; or &quot;<i>every node</i>&quot; it applies to only the d=
ecommissioned nodes.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">In addition, the step that says &quot;Wipe data on a=
ll the nodes&quot;, I would delete all files in the following directories o=
n the decommissioned nodes.<u></u><u></u></p>
</div>
<div>
<ul type=3D"disc">
<li class=3D"MsoNormal">
data (usually located in /var/lib/cassandra/data)<u></u><u></u></li><li cla=
ss=3D"MsoNormal">
commitlogs=C2=A0(usually located in /var/lib/cassandra/commitlogs)<u></u><u=
></u></li><li class=3D"MsoNormal">
hints (usually located in /var/lib/casandra/hints)<u></u><u></u></li><li cl=
ass=3D"MsoNormal">
saved_caches (usually located in /var/lib/cassandra/saved_caches)<u></u><u>=
</u></li></ul>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Cheers,<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Anthony<u></u><u></u></p>
</div>
</div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<div>
<p class=3D"MsoNormal">On Fri, 31 Jan 2020 at 03:05, Durity, Sean R &lt;<a =
href=3D"mailto:SEAN_R_DURITY@homedepot.com" target=3D"_blank">SEAN_R_DURITY=
@homedepot.com</a>&gt; wrote:<u></u><u></u></p>
</div>
<blockquote style=3D"border-color:currentcolor currentcolor currentcolor rg=
b(204,204,204);border-style:none none none solid;border-width:medium medium=
 medium 1pt;padding:0in 0in 0in 6pt;margin-left:4.8pt;margin-right:0in">
<div>
<div>
<p class=3D"MsoNormal">Your procedure won=E2=80=99t work very well. On the =
first node, if you switched to 4, you would end up with only a tiny fractio=
n of the data (because the other nodes would still be at 256).
 I updated a large cluster (over 150 nodes =E2=80=93 2 DCs) to smaller numb=
er of vnodes. The basic outline was this:<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<ul type=3D"disc">
<li>
Stop all repairs<u></u><u></u></li><li>
Make sure the app is running against one DC only<u></u><u></u></li><li>
Change the replication settings on keyspaces to use only 1 DC (basically cu=
tting off the other DC)<u></u><u></u></li><li>
Decommission each node in the DC you are working on. Because the replicatio=
n setting are changed, no streaming occurs. But it releases the token assig=
nments<u></u><u></u></li><li>
Wipe data on all the nodes<u></u><u></u></li><li>
Update configuration on every node to your new settings, including auto_boo=
tstrap =3D false<u></u><u></u></li><li>
Start all nodes. They will choose tokens, but not stream any data<u></u><u>=
</u></li><li>
Update replication factor for all keyspaces to include the new DC<u></u><u>=
</u></li><li>
I disabled binary on those nodes to prevent app connections<u></u><u></u></=
li><li>
Run nodetool reduild with -dc (other DC) on as many nodes as your system ca=
n safely handle until they are all rebuilt.<u></u><u></u></li><li>
Re-enable binary (and app connections to the rebuilt DC)<u></u><u></u></li>=
<li>
Turn on repairs<u></u><u></u></li><li>
Rest for a bit, then reverse the process for the remaining DCs<u></u><u></u=
></li></ul>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Sean Durity =E2=80=93 Staff Systems Engineer, Cassan=
dra<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal"><b>From:</b> Maxim Parkachov &lt;<a href=3D"mailto:l=
azy.gopher@gmail.com" target=3D"_blank">lazy.gopher@gmail.com</a>&gt;
<br>
<b>Sent:</b> Thursday, January 30, 2020 10:05 AM<br>
<b>To:</b> <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">u=
ser@cassandra.apache.org</a><br>
<b>Subject:</b> [EXTERNAL] How to reduce vnodes without downtime<u></u><u><=
/u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">Hi everyone,<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">with discussion about reducing default vnodes in ver=
sion 4.0 I would like to ask, what would be optimal procedure to perform re=
duction of vnodes in existing 3.11.x cluster which
 was set up with default value 256. Cluster has 2 DC with 5 nodes each and =
RF=3D3. There is one more restriction, I could not add more servers, nor to=
 create additional DC, everything is physical. This should be done without =
downtime.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">My idea for such procedure would be<u></u><u></u></p=
>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">for each node:<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">- decommission=C2=A0node<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">- set=C2=A0auto_bootstrap to true and vnodes to 4<u>=
</u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">- start and wait till node joins cluster<u></u><u></=
u></p>
</div>
<div>
<p class=3D"MsoNormal">- run cleanup on rest of nodes in cluster<u></u><u><=
/u></p>
</div>
<div>
<p class=3D"MsoNormal">- run repair on whole cluster (not sure if needed af=
ter cleanup)<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">- set auto_bootstrap to false<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">repeat for each node<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">rolling restart of cluster<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">cluster repair<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Is this sounds right ? My concern is that after=C2=
=A0decommission,=C2=A0node will start on the same IP which could create som=
e confusion.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Regards,<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Maxim.<u></u><u></u></p>
</div>
</div>
</div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div class=3D"MsoNormal" style=3D"text-align:center" align=3D"center">
<hr width=3D"100%" size=3D"2" align=3D"center">
</div>
<p class=3D"MsoNormal"><span style=3D"font-size:7.5pt;font-family:Arial,san=
s-serif;color:gray"><br>
The information in this Internet Email is confidential and may be legally p=
rivileged. It is intended solely for the addressee. Access to this Email by=
 anyone else is unauthorized. If you are not the intended recipient, any di=
sclosure, copying, distribution
 or any action taken or omitted to be taken in reliance on it, is prohibite=
d and may be unlawful. When addressed to our clients any opinions or advice=
 contained in this Email are subject to the terms and conditions expressed =
in any applicable governing The
 Home Depot terms of business or client engagement letter. The Home Depot d=
isclaims all responsibility and liability for the accuracy and content of t=
his attachment and for any damages or losses arising from any inaccuracies,=
 errors, viruses, e.g., worms, trojan
 horses, etc., or other items of a destructive nature, which may be contain=
ed in this attachment and shall not be liable for direct, indirect, consequ=
ential or special damages in connection with this e-mail message or its att=
achment.</span><u></u><u></u></p>
</div>
</blockquote>
</div>
</div>
<br>
<hr>
<font size=3D"1" face=3D"Arial" color=3D"Gray"><br>
The information in this Internet Email is confidential and may be legally p=
rivileged. It is intended solely for the addressee. Access to this Email by=
 anyone else is unauthorized. If you are not the intended recipient, any di=
sclosure, copying, distribution
 or any action taken or omitted to be taken in reliance on it, is prohibite=
d and may be unlawful. When addressed to our clients any opinions or advice=
 contained in this Email are subject to the terms and conditions expressed =
in any applicable governing The
 Home Depot terms of business or client engagement letter. The Home Depot d=
isclaims all responsibility and liability for the accuracy and content of t=
his attachment and for any damages or losses arising from any inaccuracies,=
 errors, viruses, e.g., worms, trojan
 horses, etc., or other items of a destructive nature, which may be contain=
ed in this attachment and shall not be liable for direct, indirect, consequ=
ential or special damages in connection with this e-mail message or its att=
achment.<br>
</font>
</div>

</blockquote></div>
</blockquote></div>
</blockquote></div>
</blockquote></div>

--0000000000009aac56059da2c99e--