From user-return-64386-archive-asf-public=cust-asf.ponee.io@cassandra.apache.org Thu Aug 22 09:04:18 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 7A704180644 for ; Thu, 22 Aug 2019 11:04:18 +0200 (CEST) Received: (qmail 57026 invoked by uid 500); 22 Aug 2019 09:04:15 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 57015 invoked by uid 99); 22 Aug 2019 09:04:15 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Aug 2019 09:04:15 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id EDB48C09D0 for ; Thu, 22 Aug 2019 09:04:14 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.801 X-Spam-Level: * X-Spam-Status: No, score=1.801 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-ec2-va.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id aP2SqyOrQJ15 for ; Thu, 22 Aug 2019 09:04:12 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=209.85.215.195; helo=mail-pg1-f195.google.com; envelope-from=ahmed.eljami@gmail.com; receiver= Received: from mail-pg1-f195.google.com (mail-pg1-f195.google.com [209.85.215.195]) by mx1-ec2-va.apache.org (ASF Mail Server at mx1-ec2-va.apache.org) with ESMTPS id A700ABC7F0 for ; Thu, 22 Aug 2019 09:04:12 +0000 (UTC) Received: by mail-pg1-f195.google.com with SMTP id d1so3229731pgp.4 for ; Thu, 22 Aug 2019 02:04:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=bfPxU/7ECnuv1OuvTyAa3SMe1Vd/tVV7SB7TnYPn/a8=; b=GYBoRJRrqXac5PCK7AdjE3hlf6uFhQ3mkNR96aOuVtYsjZE2LF5CLL/WiXMTyLVoh/ gaF6wQFkRuE18prbPzzwFLqkDigdHIjelnwMMrERnK+utf0xja41hgMMS/fYw1FNDXCT qPBbbW+gzi0y66WVVHgLCi+LVnIkPSYKmAhnHcQwgx5but97tccUhN1m2HDSsQP4KMUD xcXtCuhTciHOl2kofHUmyrq6hrtKwcJoVMtsteqEtCcSqO7ZwfGXaaZX7Rq7fOO+uxBJ rk8GFZnVObR8XKWbd2sZvfYAolTYmZjHp35EwHdgnSrY4fV//fKxOXipj0wMYe9bTlVZ w5XA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=bfPxU/7ECnuv1OuvTyAa3SMe1Vd/tVV7SB7TnYPn/a8=; b=gj3I0NNQPJk6U7QLs4JqOp2aGpxnrxi6/9eGSSuMi3Xq/DkhrN8nkJoyo9gsZfmveo KkLFt0gt3f869NsO7nt7yrV2FOHZeA5EfTIYyu9Ucj0VStTz7+jNhdOzi8eYgbdbdAZn Fl4HqLYJu5Bw7bYzbhZx1cVigdbPtmSkPeiiH29p2t9M3ZweEGbK5WYdcEegHVq9H67h 4TPvzpgPtXSAldWVQJODhvtWevvoWS98jFpUYT2bXnlfmySv0hdO2ZNE8RtyB4y5X+bE v5+jD/Jj03rvdmL4LSOCXnUkGTj3J9deaHYV4e38QmkKR0fvknLM8iUCWNMxQJDiWXnJ +ZRA== X-Gm-Message-State: APjAAAWEX3vUbjuLu7ALyW4cL//Y/N8DKHApwebBdoewMZBzLICfVo+h XxpcOpelIP1oKlRL3TMout+e3u+8M5ZVUluMNT616F0S X-Google-Smtp-Source: APXvYqwE2t9Tm0GpjfPILuiV1xfryi5pgvYBVPBiwlt0u4Esqo/fW41ts7rmZDOTsf5mwQWQsPFqahZI/I2qe3k9ZJs= X-Received: by 2002:aa7:9713:: with SMTP id a19mr40231018pfg.64.1566464651602; Thu, 22 Aug 2019 02:04:11 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Ahmed Eljami Date: Thu, 22 Aug 2019 11:04:00 +0200 Message-ID: Subject: Re: Cassandra copy command To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary="0000000000001cb1880590b0f950" --0000000000001cb1880590b0f950 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hello, As Jean said il will be preferable to use http://cassandra-reaper.io So you don't have to manually manage the consistency of your cassandra ring nor the list of nodes to repair. Le mer. 21 ao=C3=BBt 2019 =C3=A0 15:57, Rahul Reddy a =C3=A9crit : > Thanks Jean, > > I have dc1 and dc2 existing. added dc3 from dc1 and dc4 from dc2. If I > want to run repair on one node in dc3 from dc1 only is it possible? > > On Wed, Aug 21, 2019, 8:11 AM Jean Carlo > wrote: > >> Hello Rahul, >> >> To ensure the consistency among the DCs, it is enough to run a repair >> command. >> >> You can do it using http://cassandra-reaper.io/ >> or runing the commande *nodetool repair* with the respectively options >> in every node. >> >> You do not need to count the rows in every DC to ensure cassandra is syn= c >> amongs DC after you have run the repair. But if you still want to do it, >> use Spark for it. >> >> Jean Carlo >> >> "The best way to predict the future is to invent it" Alan Kay >> >> >> On Wed, Aug 21, 2019 at 1:51 PM Rahul Reddy >> wrote: >> >>> Yep I did run rebuild on each new node >>> >>> On Wed, Aug 21, 2019, 7:25 AM Stefan Miklosovic < >>> stefan.miklosovic@instaclustr.com> wrote: >>> >>>> Hi Rahul, >>>> >>>> how did you add that dc3 to cluster? The rule of thumb here is to do >>>> rebuild from each node, for example like here >>>> >>>> https://docs.datastax.com/en/archived/cassandra/3.0/cassandra/operatio= ns/opsAddDCToCluster.html >>>> >>>> On Wed, 21 Aug 2019 at 12:57, Rahul Reddy >>>> wrote: >>>> > >>>> > Hi sefan, >>>> > >>>> > I'm adding new DC3 to exiting cluster and see discripencies couple o= f >>>> millions in Nodetool cfstats in new DC. >>>> > >>>> > My table size is 50gb >>>> > I'm trying to run copy entire table. >>>> > >>>> > Copy table to 'full_tablr.csv' with delimiter ','; >>>> > >>>> > If I run above command from dc3. Does it get the data only from dc3? >>>> > >>>> > >>>> > >>>> > On Wed, Aug 21, 2019, 6:46 AM Stefan Miklosovic < >>>> stefan.miklosovic@instaclustr.com> wrote: >>>> >> >>>> >> Hi Rahul, >>>> >> >>>> >> what is your motivation behind this? Why do you want to make sure t= he >>>> >> count is same? What is the purpose of that? All you should care abo= ut >>>> >> is that Cassandra will return you right results. It was designed fr= om >>>> >> the very bottom to do that for you, you should not be bothered too >>>> >> much about such discrepancies, they will be always there in general= . >>>> >> But the important fact is that once queried, you can rest assured i= t >>>> >> is returned (and consequentially repaired if data not match) as the= y >>>> >> should. >>>> >> >>>> >> What copy command you are talking about precisely, why you cant use >>>> just repair? >>>> >> >>>> >> On Wed, 21 Aug 2019 at 12:14, Rahul Reddy >>>> wrote: >>>> >> > >>>> >> > Hello, >>>> >> > >>>> >> > I have 3 datacenters . Want to make sure record count is same in >>>> all dc's . If I run copy command node1 in dc1 does it get the data fro= m >>>> only dc1? Nodetool cfstats I'm seeing discrepancies in partitions coun= t is >>>> it because we didn't run cleanup after adding few nodes and remove the= m?. >>>> To rule out any discripencies I want to run copy command from 3 DC's a= nd >>>> compare. Please let me know if copy command extracts data from the DC = only >>>> I ran it from? >>>> >> >>>> >> -------------------------------------------------------------------= -- >>>> >> To unsubscribe, e-mail: user-unsubscribe@cassandra.apache.org >>>> >> For additional commands, e-mail: user-help@cassandra.apache.org >>>> >> >>>> >>>> --------------------------------------------------------------------- >>>> To unsubscribe, e-mail: user-unsubscribe@cassandra.apache.org >>>> For additional commands, e-mail: user-help@cassandra.apache.org >>>> >>>> --=20 Cordialement; Ahmed ELJAMI --0000000000001cb1880590b0f950 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Hello,

As Jean s= aid il will be preferable to use=C2=A0 http://cassandra-reaper.io
<= br>
So you don't have to manually manage the consistency = of your cassandra ring nor the list of nodes to repair.

<= /div>

Le=C2=A0mer. 21 ao=C3=BBt 2019 =C3=A0=C2=A015:57, Rahul Reddy <rahulreddy1234@gmail.com> = a =C3=A9crit=C2=A0:
Thanks Jean,

I have dc1 and dc2 existing. added dc3 from dc1 and dc4 from dc2. If I= want to run repair on one node in dc3 from dc1 only is it possible?
<= /div>
O= n Wed, Aug 21, 2019, 8:11 AM Jean Carlo <jean.jeancarl48@gmail.com> wrote:
<= div>Hello Rahul,

To ensure the consistency among t= he DCs, it is enough to run a repair command.

You can do it using http://cassandra-reaper.io/
or run= ing the commande nodetool repair with the respectively options in ev= ery node.

You do not need to count the rows in every DC to ensure cassandra i= s sync amongs DC after you have run the repair. But if you still want to do= it, use Spark for it.

=
Jean Carlo

"T= he best way to predict the future is to invent it" Alan Kay

=


On Wed, Aug 21, 2019 at 1:51 PM Rahul Red= dy <rahulreddy1234@gmail.com> wrote:
Yep I did run rebuil= d=C2=A0 =C2=A0on each new node

On Wed, Aug 21, 2019, 7:25 AM Stefan Miklosov= ic <stefan.miklosovic@instaclustr.com> wrote:
<= /div>
Hi Rahul,

how did you add that dc3 to cluster? The rule of thumb here is to do
rebuild from each node, for example like here
https://docs.datastax.com/en/archived/cassandra/3.0/cassan= dra/operations/opsAddDCToCluster.html

On Wed, 21 Aug 2019 at 12:57, Rahul Reddy <rahulreddy12= 34@gmail.com> wrote:
>
> Hi sefan,
>
> I'm adding new DC3 to exiting cluster and see discripencies couple= of millions in Nodetool cfstats in new DC.
>
> My table size is 50gb
> I'm trying to run copy entire table.
>
> Copy table to 'full_tablr.csv' with delimiter ',';
>
> If I run above command from dc3. Does it get the data only from dc3? >
>
>
> On Wed, Aug 21, 2019, 6:46 AM Stefan Miklosovic <stefan.miklosovic@instaclustr.com> wrote:
>>
>> Hi Rahul,
>>
>> what is your motivation behind this? Why do you want to make sure = the
>> count is same? What is the purpose of that? All you should care ab= out
>> is that Cassandra will return you right results. It was designed f= rom
>> the very bottom to do that for you, you should not be bothered too=
>> much about such discrepancies, they will be always there in genera= l.
>> But the important fact is that once queried, you can rest assured = it
>> is returned (and consequentially repaired if data not match) as th= ey
>> should.
>>
>> What copy command you are talking about precisely, why you cant us= e just repair?
>>
>> On Wed, 21 Aug 2019 at 12:14, Rahul Reddy <rah= ulreddy1234@gmail.com> wrote:
>> >
>> > Hello,
>> >
>> > I have 3 datacenters . Want to make sure record count is same= in all dc's . If I run copy command node1 in dc1 does it get the data = from only dc1? Nodetool cfstats I'm seeing discrepancies in partitions = count is it because we didn't run cleanup after adding few nodes and re= move them?. To rule out any discripencies I want to run copy command from 3= DC's and compare. Please let me know if copy command extracts data fro= m the DC only I ran it from?
>>
>> ------------------------------------------------------------------= ---
>> To unsubscribe, e-mail: user-unsubsc= ribe@cassandra.apache.org
>> For additional commands, e-mail: user-help@= cassandra.apache.org
>>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@cass= andra.apache.org
For additional commands, e-mail: user-help@cassandra= .apache.org



--
= Cordialement;

=
Ahmed ELJAMI
--0000000000001cb1880590b0f950--