Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 9DA64200AE3 for ; Wed, 4 May 2016 13:47:01 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 9C3CF1601A3; Wed, 4 May 2016 11:47:01 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E6F0D1601A2 for ; Wed, 4 May 2016 13:47:00 +0200 (CEST) Received: (qmail 84226 invoked by uid 500); 4 May 2016 11:47:00 -0000 Mailing-List: contact user-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@ignite.apache.org Delivered-To: mailing list user@ignite.apache.org Received: (qmail 84215 invoked by uid 99); 4 May 2016 11:47:00 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 May 2016 11:47:00 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id B6DED18009B for ; Wed, 4 May 2016 11:46:59 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.972 X-Spam-Level: X-Spam-Status: No, score=0.972 tagged_above=-999 required=6.31 tests=[RCVD_IN_DNSWL_NONE=-0.0001, SPF_SOFTFAIL=0.972] autolearn=disabled Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id n3fdb9su0HCI for ; Wed, 4 May 2016 11:46:57 +0000 (UTC) Received: from mbob.nabble.com (mbob.nabble.com [162.253.133.15]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTP id E084A5F24A for ; Wed, 4 May 2016 11:46:56 +0000 (UTC) Received: from malf.nabble.com (unknown [162.253.133.59]) by mbob.nabble.com (Postfix) with ESMTP id 8883726C61D6 for ; Wed, 4 May 2016 04:31:53 -0700 (PDT) Date: Wed, 4 May 2016 04:32:24 -0700 (PDT) From: Denis Magda To: user@ignite.apache.org Message-ID: <1462361544380-4757.post@n6.nabble.com> In-Reply-To: <1462348109910-4752.post@n6.nabble.com> References: <1462348109910-4752.post@n6.nabble.com> Subject: Re: Stopping the node in order to prevent cluster wide instability. MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable archived-at: Wed, 04 May 2016 11:47:01 -0000 Hi,=20 Please properly subscribe to the user list (this way we will not have to manually approve your emails). All you need to do is send an email to =C3= =AC user-subscribe@ignite.apache.org=C3=AE and follow simple instructions in th= e reply. Upon your questions. Most likely the node, that was stopped, became segmented: was kicked out of the topology by some reason. Look for "local node segmented" message is the log. More often the reason is either is a slow network connection when a node can't reply on a message during IgniteConfiguration.failureDetectionTimeout or a long GC pauses. I would suggest that there are no long GC pauses first. Refer to this page for more details on how to gather GC logs [1]. If you see pauses bigger tha= n 10 secs (default value of IgniteConfiguration.failureDetectionTimeout) then this is the reason why the node was segmented and you have to tune Java hea= p [2] and/or your app. [1] https://apacheignite.readme.io/v1.5/docs/jvm-and-system-tuning#section-deta= iled-garbage-collection-stats [2] https://apacheignite.readme.io/v1.5/docs/jvm-and-system-tuning#jvm-tuning-f= or-clusters-with-on_heap-caches -- Denis -- View this message in context: http://apache-ignite-users.70518.x6.nabble.co= m/Stopping-the-node-in-order-to-prevent-cluster-wide-instability-tp4752p475= 7.html Sent from the Apache Ignite Users mailing list archive at Nabble.com.