Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 8ABE02004F1 for ; Wed, 30 Aug 2017 17:47:05 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 892431694F2; Wed, 30 Aug 2017 15:47:05 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A8F931694E8 for ; Wed, 30 Aug 2017 17:47:04 +0200 (CEST) Received: (qmail 75142 invoked by uid 500); 30 Aug 2017 15:47:03 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 75131 invoked by uid 99); 30 Aug 2017 15:47:03 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Aug 2017 15:47:03 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 94C7F182B1F for ; Wed, 30 Aug 2017 15:47:02 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 581ffqmbDbCM for ; Wed, 30 Aug 2017 15:47:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 5F32E5FD7D for ; Wed, 30 Aug 2017 15:47:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 96771E099C for ; Wed, 30 Aug 2017 15:47:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 3F8FB24147 for ; Wed, 30 Aug 2017 15:47:00 +0000 (UTC) Date: Wed, 30 Aug 2017 15:47:00 +0000 (UTC) From: "Romain Hardouin (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CASSANDRA-11363) High Blocked NTR When Connecting MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 30 Aug 2017 15:47:05 -0000 [ https://issues.apache.org/jira/browse/CASSANDRA-11363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16147483#comment-16147483 ] Romain Hardouin commented on CASSANDRA-11363: --------------------------------------------- [~sadagopan88] When using Open Source Apache Cassandra you have to specify it in {{cassandra-env.sh}}: {code} JVM_OPTS="$JVM_OPTS -Dcassandra.max_queued_native_transport_requests=1024" {code} I don't know if DSE set this setting to something different than default value (128). You can ask to DataStax. > High Blocked NTR When Connecting > -------------------------------- > > Key: CASSANDRA-11363 > URL: https://issues.apache.org/jira/browse/CASSANDRA-11363 > Project: Cassandra > Issue Type: Bug > Components: Coordination > Reporter: Russell Bradberry > Assignee: T Jake Luciani > Fix For: 2.1.16, 2.2.8, 3.0.10, 3.10 > > Attachments: cassandra-102-cms.stack, cassandra-102-g1gc.stack, max_queued_ntr_property.txt, thread-queue-2.1.txt > > > When upgrading from 2.1.9 to 2.1.13, we are witnessing an issue where the machine load increases to very high levels (> 120 on an 8 core machine) and native transport requests get blocked in tpstats. > I was able to reproduce this in both CMS and G1GC as well as on JVM 7 and 8. > The issue does not seem to affect the nodes running 2.1.9. > The issue seems to coincide with the number of connections OR the number of total requests being processed at a given time (as the latter increases with the former in our system) > Currently there is between 600 and 800 client connections on each machine and each machine is handling roughly 2000-3000 client requests per second. > Disabling the binary protocol fixes the issue for this node but isn't a viable option cluster-wide. > Here is the output from tpstats: > {code} > Pool Name Active Pending Completed Blocked All time blocked > MutationStage 0 8 8387821 0 0 > ReadStage 0 0 355860 0 0 > RequestResponseStage 0 7 2532457 0 0 > ReadRepairStage 0 0 150 0 0 > CounterMutationStage 32 104 897560 0 0 > MiscStage 0 0 0 0 0 > HintedHandoff 0 0 65 0 0 > GossipStage 0 0 2338 0 0 > CacheCleanupExecutor 0 0 0 0 0 > InternalResponseStage 0 0 0 0 0 > CommitLogArchiver 0 0 0 0 0 > CompactionExecutor 2 190 474 0 0 > ValidationExecutor 0 0 0 0 0 > MigrationStage 0 0 10 0 0 > AntiEntropyStage 0 0 0 0 0 > PendingRangeCalculator 0 0 310 0 0 > Sampler 0 0 0 0 0 > MemtableFlushWriter 1 10 94 0 0 > MemtablePostFlush 1 34 257 0 0 > MemtableReclaimMemory 0 0 94 0 0 > Native-Transport-Requests 128 156 387957 16 278451 > Message type Dropped > READ 0 > RANGE_SLICE 0 > _TRACE 0 > MUTATION 0 > COUNTER_MUTATION 0 > BINARY 0 > REQUEST_RESPONSE 0 > PAGED_RANGE 0 > READ_REPAIR 0 > {code} > Attached is the jstack output for both CMS and G1GC. > Flight recordings are here: > https://s3.amazonaws.com/simple-logs/cassandra-102-cms.jfr > https://s3.amazonaws.com/simple-logs/cassandra-102-g1gc.jfr > It is interesting to note that while the flight recording was taking place, the load on the machine went back to healthy, and when the flight recording finished the load went back to > 100. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscribe@cassandra.apache.org For additional commands, e-mail: commits-help@cassandra.apache.org