Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 32F04200BF3 for ; Thu, 5 Jan 2017 23:57:11 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 3165F160B33; Thu, 5 Jan 2017 22:57:11 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id DA3E5160B26 for ; Thu, 5 Jan 2017 23:57:09 +0100 (CET) Received: (qmail 64398 invoked by uid 500); 5 Jan 2017 22:57:08 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 64388 invoked by uid 99); 5 Jan 2017 22:57:08 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 Jan 2017 22:57:08 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 950531A7B5F for ; Thu, 5 Jan 2017 22:57:07 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.98 X-Spam-Level: ** X-Spam-Status: No, score=2.98 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, KAM_LINEPADDING=1.2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=instaclustr-com.20150623.gappssmtp.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 7D9-zCfhoh6Q for ; Thu, 5 Jan 2017 22:57:03 +0000 (UTC) Received: from mail-qt0-f181.google.com (mail-qt0-f181.google.com [209.85.216.181]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id A5CED5F24E for ; Thu, 5 Jan 2017 22:57:02 +0000 (UTC) Received: by mail-qt0-f181.google.com with SMTP id c47so532983639qtc.2 for ; Thu, 05 Jan 2017 14:57:02 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=instaclustr-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=PWBm3j/RTLdLmeZjG42Cx3sKe8gw6sE3iGZ4ck8Bqak=; b=oLcGZ3JdXt/rNxV9rXbXJEyvsdyfssU2Omt7M/HLZ4d3Yppb3JP9q0lK9+mxrrXGXF qd8/jjMnt0RP/P7iFUT1WUc0yVTzWBNW2dR+a/NrhZsHV72gv9FQOkw64Xa/N8j09zGC 4sWIhWWiOCGV32XjVnOVoi/6b5fUjKumQoAzz7iVK1vU8J+zZVSwfVrAm/AqpEhHvz0a CFLMY8oHuQDWbLKAffiCECpH00R1mil/mv1r5BGUdz7DzXypQF62yoS9+CTCKmndxc+f dQM8qiv7DMMUqvY5pOvji3/wIa1XFki7v8rFkqOa3+JJi33wJpg/Q0pwn7rDkq5FCKdX 78Sw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=PWBm3j/RTLdLmeZjG42Cx3sKe8gw6sE3iGZ4ck8Bqak=; b=MtTubes2OiMdqzJyLr6VORp/bShp8mwy8viy030CC2oVi00Ly0Xz/9Hk2bBe9V82t/ GTpkpWc4oGmOcYxo5SZsbyPXIhn2+TF/TTlOv64FvNCdJtKrvurOP09+YLTFJSmkCGyw 8NXxxdfjr44UNPIdN1xZednoYCBPjyMY2GJg3HA1y9wtcuNPCJGjjYt4uQr8rl4yYp4R zozHMGiX8IN8NPgn865eWN+sRRJIHBdvQnNBWddU/XhNKR5UffmiH48PHY/dTwzKWoUr OymZVLOA6dRQbp0whbcd0Js4++0JUgaQoFZ9+kRLsQrC0D0/HyWVsHwNh6EW5dPSEZHM OTPA== X-Gm-Message-State: AIkVDXIHNZGyPDo1J++ryWGqZCWg4n/gzmwBHDpBlA38ElqGiINzYBC2kCJiNND9uY/l/mxQc4XUP1No3ACH8GuX X-Received: by 10.200.41.146 with SMTP id 18mr74348289qts.84.1483657021663; Thu, 05 Jan 2017 14:57:01 -0800 (PST) MIME-Version: 1.0 Received: by 10.12.153.207 with HTTP; Thu, 5 Jan 2017 14:57:01 -0800 (PST) Received: by 10.12.153.207 with HTTP; Thu, 5 Jan 2017 14:57:01 -0800 (PST) In-Reply-To: <985DA910-9BC1-4CB2-8544-EBC04333776F@cisco.com> References: <3673F312-60BE-41E3-BF94-BBCDDC5AC84B@cisco.com> <60F8F41B-B499-426C-9B6F-90C69C10A4AE@cisco.com> <985DA910-9BC1-4CB2-8544-EBC04333776F@cisco.com> From: kurt Greaves Date: Thu, 5 Jan 2017 22:57:01 +0000 Message-ID: Subject: Re: Cassandra cluster performance To: user@cassandra.apache.org Cc: Abhishek Kumar Maheshwari Content-Type: multipart/alternative; boundary=001a11404e0ebec998054560d151 archived-at: Thu, 05 Jan 2017 22:57:11 -0000 --001a11404e0ebec998054560d151 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable you should try switching to async writes and then perform the test. sync writes won't make much difference from a single node but multiple nodes there should be a massive difference. On 4 Jan 2017 10:05, "Branislav Janosik -T (bjanosik - AAP3 INC at Cisco)" = < bjanosik@cisco.com> wrote: > Hi, > > > > Our column family definition is > > > > *"CREATE TABLE onem2m.cse(" *+ > *"name TEXT PRIMARY KEY," *+ > *"resourceId TEXT," *+ > *")"*; > > *"CREATE TABLE IF NOT EXISTS onem2m.AeIdToResourceIdMapping(" *+ > *"cseBaseCseId TEXT," *+ > *"aeId TEXT," *+ > *"resourceId TEXT," *+ > *"PRIMARY KEY ((cseBaseCseId), aeId)" *+ > *")"*; > > > > *"CREATE TABLE IF NOT EXISTS onem2m.Resources_" *+ i + *"(" *+ > *"CONTENT_INSTANCE_OldestId TEXT," *+ > *"CONTENT_INSTANCE_LatestId TEXT," *+ > *"SUBSCRIPTION_OldestId TEXT," *+ > *"SUBSCRIPTION_LatestId TEXT," *+ > *"resourceId TEXT PRIMARY KEY," *+ > *"resourceType TEXT," *+ > *"resourceName TEXT," *+ > *"jsonContent TEXT," *+ > *"parentId TEXT," *+ > *")"*; > > *"CREATE TABLE IF NOT EXISTS onem2m.Children_" *+ i + *"(" *+ > *"parentResourceId TEXT," *+ > *"childName TEXT," *+ > *"childResourceId TEXT," *+ > *"nextId TEXT," *+ > *"prevId TEXT," *+ > *"PRIMARY KEY ((parentResourceId), childName)" *+ > *")"*; > > > > > > > > *From: *Abhishek Kumar Maheshwari > *Date: *Sunday, December 25, 2016 at 8:54 PM > *To: *"Branislav Janosik -T (bjanosik - AAP3 INC at Cisco)" < > bjanosik@cisco.com> > *Cc: *"user@cassandra.apache.org" > *Subject: *RE: Cassandra cluster performance > > > > Hi Branislav, > > > > > > What is your column family definition? > > > > > > *Thanks & Regards,* > *Abhishek Kumar Maheshwari* > *+91- 9999805591 <+91%2099998%2005591> (Mobile)* > > Times Internet Ltd. | A Times of India Group Company > > FC - 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA > > *P** Please do not print this email unless it is absolutely necessary. > Spread environmental awareness.* > > > > *From:* Branislav Janosik -T (bjanosik - AAP3 INC at Cisco) [mailto: > bjanosik@cisco.com] > *Sent:* Thursday, December 22, 2016 6:18 AM > *To:* user@cassandra.apache.org > *Subject:* Re: Cassandra cluster performance > > > > Hi, > > > > - Consistency level is set to ONE > > - Keyspace definition: > > *"CREATE KEYSPACE IF NOT EXISTS onem2m " *+ > *"WITH replication =3D " *+ > *"{ 'class' : 'SimpleStrategy', 'replication_factor' : 1}"*; > > > > - yes, the client is on separate VM > > - In our project we use Cassandra API version 3.0.2 but the database (clu= ster) is version 3.9 > > - for 2node cluster: > > first VM: 25 GB RAM, 16 CPUs > > second VM: 16 GB RAM, 16 CPUs > > > > > > > > *From: *Ben Slater > *Reply-To: *"user@cassandra.apache.org" > *Date: *Wednesday, December 21, 2016 at 2:32 PM > *To: *"user@cassandra.apache.org" > *Subject: *Re: Cassandra cluster performance > > > > You would expect some drop when moving to single multiple nodes but on th= e > face of it that feels extreme to me (although I=E2=80=99ve never personal= ly tested > the difference). Some questions that might help provide an answer: > > - what consistency level are you using for the test? > > - what is your keyspace definition (replication factor most importantly)? > > - where are you running your test client (is it a separate box to > cassandra)? > > - what C* version? > > - what are specs (CPU, RAM) of the test servers? > > > > Cheers > > Ben > > > > On Thu, 22 Dec 2016 at 09:26 Branislav Janosik -T (bjanosik - AAP3 INC at > Cisco) wrote: > > Hi all, > > > > I=E2=80=99m working on a project and we have Java benchmark test for test= ing the > performance when using Cassandra database. Create operation on a single > node Cassandra cluster is about 15K operations per second. Problem we hav= e > is when I set up cluster with 2 or more nodes (each of them are on separa= te > virtual machines and servers), the performance goes down to 1K ops/sec. I > follow the official instructions on how to set up a multinode cluster =E2= =80=93 the > only things I change in Cassandra.yaml file are: change seeds to IP addre= ss > of one node, change listen and rpc address to IP address of the node and > finally change endpoint snitch to GossipingPropertyFileSnitch. The > replication factor is set to 1 when having 2-node cluster. I use only one > datacenter. The cluster seems to be doing fine (I can see nodes > communicating) and so is the CPU, RAM usage on the machines. > > > > Does anybody have any ideas? Any help would be very appreciated. > > > > Thanks! > > > > A must visit exhibition for all Fitness and Sports Freaks. TOI Global > Sports Business Show from 21 to 23 December 2016 Bombay Exhibition Centre= , > Mumbai. Meet the legends Kaizzad Capadia, Bhaichung Bhutia and more. Join > the workshops on Boxing & Football and more. www.TOI-GSBS.com > --001a11404e0ebec998054560d151 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
you should try switching to async writes and then perform= the test. sync writes won't make much difference from a single node bu= t multiple nodes there should be a massive difference.=C2=A0

On 4 Jan 2017 10:05, "= ;Branislav Janosik -T (bjanosik - AAP3 INC at Cisco)" <bjanosik@cisco.com> wrote:

Hi,

=C2=A0

Our column family definition is

=C2=A0

"CREATE TABLE onem2m.cse("= ; +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "name TEXT PRIMARY KEY," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "resourceId TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 ")";

"CREATE TABLE IF NOT EXISTS on= em2m.AeIdToResourceIdMapping(" +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "cseBaseCseId TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "aeId TEXT," +<= br> =C2=A0 =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0"resourceId TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "PRIMARY KEY ((cseBaseCseId), = aeId)" +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 ")";

=C2=A0

"CREATE TABLE IF NOT EXISTS on= em2m.Resources_" + = i + "(&qu= ot; +
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
"CONTENT_INSTANCE_OldestId TEX= T," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "CONTENT_INSTANCE_LatestId TEX= T," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "SUBSCRIPTION_OldestId TEXT,&q= uot; +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "SUBSCRIPTION_LatestId TEXT,&q= uot; +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "resourceId TEXT PRIMARY KEY,&= quot; +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "resourceType TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "resourceName TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "jsonContent TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "parentId TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 ")";

&qu= ot;CREATE TABLE IF NOT EXISTS onem2m.Children_" + = i + "(&qu= ot; +
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
"parentResourceId TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "childName TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "childResourceId TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "nextId TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "prevId TEXT," +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "PRIMARY KEY ((parentResourceI= d), childName)" +<= br> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 ")";

=C2=A0

=C2=A0

=C2=A0

F= rom: Abhishek Kumar Maheshwa= ri <Abhishek.Maheshwari@timesinternet.in>
Date: Sunday, December 25, 2016 at 8:54 PM
To: "Branislav Janosik -T (bjanosik - AAP3 INC at Cisco)" = <bjanosik@cisco.= com>
Cc: "user@cassandra.apache.org" <user@cassandra.apache.org>
Subject: RE: Cassandra cluster performance

=C2=A0

Hi Branislav,

=C2=A0

=C2=A0

What is your column family definition?

=C2=A0

=C2=A0

Thanks & Regards,
Abhishek Kumar Maheshwari = +91-=C2=A09999805591=C2=A0(Mobile)

Times Internet Ltd. | A Times of India Group Company<= /u>

FC - 6, Sector 16A, Film City,=C2=A0 Noida,=C2=A0 U.P. 201301 |= INDIA

P Please do not print this email= unless it is absolutely necessary. Spread environmental awareness.

=C2=A0

From: B= ranislav Janosik -T (bjanosik - AAP3 INC at Cisco) [mailto:bjanosik@cisco.com]
Sent: Thursday, December 22, 2016 6:18 AM
To: u= ser@cassandra.apache.org
Subject: Re: Cassandra cluster performance

=C2=A0

Hi,

=C2=A0

- Consistency level is set to ONE

-=C2=A0 Keyspace definition:

=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=
=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "CREATE KEYSPACE=C2=
=A0 IF NOT EXISTS=C2=A0 onem2m " +
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2= =A0=C2=A0
"WITH replication =3D " +
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0= =C2=A0=C2=A0
"{ 'class' : 'SimpleStrategy', 'replicat= ion_factor' : 1}";
=C2=A0
- yes, the client is on separate VM=
- In our project we use Cassandra API version 3.0.2 but =
the database (cluster) is version 3.9
- for 2node cluster: 
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 first V=
M: 25 GB RAM, 16 CPUs
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 second =
VM: 16 GB RAM, 16 CPUs
=C2=A0

=C2=A0

=C2=A0

F= rom: Ben Slater <ben.slater@instacl= ustr.com>
Reply-To: "user@cassandra.apache.org" <user@cassandra.apache.org> Date: Wednesday, December 21, 2016 at 2:32 PM
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Subject: Re: Cassandra cluster performance

=C2=A0

You would expect some drop when moving to single mul= tiple nodes but on the face of it that feels extreme to me (although I=E2= =80=99ve never personally tested the difference). Some questions that might= help provide an answer:

- what consistency level are you using for the test?=

- what is your keyspace definition (replication fact= or most importantly)?

- where are you running your test client (is it a se= parate box to cassandra)?

- what C* version?

- what are specs (CPU, RAM) of the test servers?<= /u>

=C2=A0

Cheers

Ben

=C2=A0

On Thu, 22 Dec 2016 at 09:26 Branislav Janosik -T (b= janosik - AAP3 INC at Cisco) <bjanosik@cisco.com> wrote:

Hi all,

=C2=A0

I=E2=80=99m working on a project and we have Ja= va benchmark test for testing the performance when using Cassandra database= . Create operation on a single node Cassandra cluster is about 15K operations per s= econd. Problem we have is when I set up cluster with 2 or more nodes (each = of them are on separate virtual machines and servers), the performance goes= down to 1K ops/sec. I follow the official instructions on how to set up a multinode cluster =E2=80=93 the o= nly things I change in Cassandra.yaml file are: change seeds to IP address = of one node, change listen and rpc address to IP address of the node and fi= nally change endpoint snitch to GossipingPropertyFileSnitch. The replication factor is set to 1 when having 2-node cluster. I use only = one datacenter. The cluster seems to be doing fine (I can see nodes communi= cating) and so is the CPU, RAM usage on the machines.

=C2=A0

Does anybody have any ideas? Any help would be = very appreciated.

=C2=A0

Thanks!

=C2=A0

A must visit exhibition for all Fitness and Sports F= reaks. TOI Global Sports Business Show from 21 to 23 December 2016 Bombay E= xhibition Centre, Mumbai. Meet the legends Kaizzad Capadia, Bhaichung Bhuti= a and more. Join the workshops on Boxing & Football and more. www.TOI-GSBS.com

--001a11404e0ebec998054560d151--