From user-return-64373-archive-asf-public=cust-asf.ponee.io@cassandra.apache.org Wed Aug 21 10:57:39 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 2EB20180607 for ; Wed, 21 Aug 2019 12:57:39 +0200 (CEST) Received: (qmail 27474 invoked by uid 500); 21 Aug 2019 10:57:36 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 27459 invoked by uid 99); 21 Aug 2019 10:57:36 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 21 Aug 2019 10:57:36 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 3A7431A4097 for ; Wed, 21 Aug 2019 10:57:36 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.051 X-Spam-Level: ** X-Spam-Status: No, score=2.051 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-ec2-va.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 8Enmw5gQnebS for ; Wed, 21 Aug 2019 10:57:34 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=209.85.161.65; helo=mail-yw1-f65.google.com; envelope-from=rahulreddy1234@gmail.com; receiver= Received: from mail-yw1-f65.google.com (mail-yw1-f65.google.com [209.85.161.65]) by mx1-ec2-va.apache.org (ASF Mail Server at mx1-ec2-va.apache.org) with ESMTPS id 046F6BDEB0 for ; Wed, 21 Aug 2019 10:57:33 +0000 (UTC) Received: by mail-yw1-f65.google.com with SMTP id u141so763471ywe.4 for ; Wed, 21 Aug 2019 03:57:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=K1SFxNgxIYAHrA/iLHhcTi68RJk8hHihK9RP4TuxDrs=; b=c09To2UT5701AagpLCYnKkOkc4AFeiEYO+dByAZ71Rx3OvKcc4xQexbhX55WnzZ91h Ey1u83binbdP4HZEIhnOPZtrYD3xJ87Ct8lff9mW7i5eU7QgGOfk4iPH1lpuXrxb4DdM bc8OiS3GQ08UN7uoBG/U+IzXOx7YV/pkJ+WxvMU8wAZXVQBnnpmJ8rxa2YKGaEptg1+j cP6fxAzjF71U43IcPogAtblkcYQ1pgDk+T+2mqPBItprHQSRtl7BHM9ZxjzzbPrdExzy +etAnstTJvtDcmklwrogRfX/zA4dGgVkNLtNrmE1Xgmg2TumeJAzdW51IA7x0nqDtX9I abHQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=K1SFxNgxIYAHrA/iLHhcTi68RJk8hHihK9RP4TuxDrs=; b=F2U252bCmgVWAi5kcCZmvoFdjOsj+44JXRl05whAtVUrp+mRBNEdfFmJq1VT5w+BRS 2w4yWArWSwwA3wD2egdFh0OzJAitzxROTYvaXs77E8REROo52E/Lj4wl8mD4anlW4auY YW9fD7m/Kx91loCh96D8MCFM9BEzXIYL6fANRNqkUGdmdK0bf+07JPIIyhB/0cIz0mLX lKDhMCr3mC31/Yi5tWr9G+I45w+RY7ir21oqDiBYW0OOKzWajYePKu7MMgySgXRxJQzG 0GHAwXAkFvUzF8vbE9X1iuBwiXTzlYkSCRCZwweiSkMnUPMGj0WFeHYMAc8k3ZNeXCa1 oddw== X-Gm-Message-State: APjAAAVbjrXeWjcfYcUWFJiI3YR8xbO7WvOHZkQrxCA5OENzUlR4EKsO ZsDpUezlm1Mp786U1yuNDTQGNHAHiSvsC7m3f2qv6w== X-Google-Smtp-Source: APXvYqzzSlfWgIsX11++plUHoV0BvF2EkulmdapDpKuBkPAKw8Dm9M/YeXMchusUrWWJ2eMX8/ABDy6nnoswwLklUYM= X-Received: by 2002:a81:6145:: with SMTP id v66mr22853261ywb.136.1566385053247; Wed, 21 Aug 2019 03:57:33 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Rahul Reddy Date: Wed, 21 Aug 2019 06:57:21 -0400 Message-ID: Subject: Re: Cassandra copy command To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary="000000000000ae2cb005909e7002" --000000000000ae2cb005909e7002 Content-Type: text/plain; charset="UTF-8" Hi sefan, I'm adding new DC3 to exiting cluster and see discripencies couple of millions in Nodetool cfstats in new DC. My table size is 50gb I'm trying to run copy entire table. Copy table to 'full_tablr.csv' with delimiter ','; If I run above command from dc3. Does it get the data only from dc3? On Wed, Aug 21, 2019, 6:46 AM Stefan Miklosovic < stefan.miklosovic@instaclustr.com> wrote: > Hi Rahul, > > what is your motivation behind this? Why do you want to make sure the > count is same? What is the purpose of that? All you should care about > is that Cassandra will return you right results. It was designed from > the very bottom to do that for you, you should not be bothered too > much about such discrepancies, they will be always there in general. > But the important fact is that once queried, you can rest assured it > is returned (and consequentially repaired if data not match) as they > should. > > What copy command you are talking about precisely, why you cant use just > repair? > > On Wed, 21 Aug 2019 at 12:14, Rahul Reddy > wrote: > > > > Hello, > > > > I have 3 datacenters . Want to make sure record count is same in all > dc's . If I run copy command node1 in dc1 does it get the data from only > dc1? Nodetool cfstats I'm seeing discrepancies in partitions count is it > because we didn't run cleanup after adding few nodes and remove them?. To > rule out any discripencies I want to run copy command from 3 DC's and > compare. Please let me know if copy command extracts data from the DC only > I ran it from? > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscribe@cassandra.apache.org > For additional commands, e-mail: user-help@cassandra.apache.org > > --000000000000ae2cb005909e7002 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Hi sefan,

I&= #39;m adding new DC3 to exiting cluster and see discripencies couple of mil= lions in Nodetool cfstats in new DC.=C2=A0

My table size is 50gb
I'm try= ing to run copy entire table.

Copy table to 'full_tablr.csv' with delimiter ',';

If I run above command fro= m dc3. Does it get the data only from dc3?



On Wed, Aug 21, 2019, 6:46 AM Stefan Miklosov= ic <stefan.miklosov= ic@instaclustr.com> wrote:
H= i Rahul,

what is your motivation behind this? Why do you want to make sure the
count is same? What is the purpose of that? All you should care about
is that Cassandra will return you right results. It was designed from
the very bottom to do that for you, you should not be bothered too
much about such discrepancies, they will be always there in general.
But the important fact is that once queried, you can rest assured it
is returned (and consequentially repaired if data not match) as they
should.

What copy command you are talking about precisely, why you cant use just re= pair?

On Wed, 21 Aug 2019 at 12:14, Rahul Reddy <rahulreddy1234@gmail.co= m> wrote:
>
> Hello,
>
> I have 3 datacenters . Want to make sure record count is same in all d= c's . If I run copy command node1 in dc1 does it get the data from only= dc1? Nodetool cfstats I'm seeing discrepancies in partitions count is = it because we didn't run cleanup after adding few nodes and remove them= ?. To rule out any discripencies I want to run copy command from 3 DC's= and compare. Please let me know if copy command extracts data from the DC = only I ran it from?

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@cassandra.apach= e.org
For additional commands, e-mail: user-help@cassandra.apache.org=

--000000000000ae2cb005909e7002--