Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A2144200B65 for ; Wed, 3 Aug 2016 06:13:30 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id A0A37160AA8; Wed, 3 Aug 2016 04:13:30 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 197A4160A76 for ; Wed, 3 Aug 2016 06:13:29 +0200 (CEST) Received: (qmail 26719 invoked by uid 500); 3 Aug 2016 04:13:29 -0000 Mailing-List: contact user-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@ignite.apache.org Delivered-To: mailing list user@ignite.apache.org Received: (qmail 26709 invoked by uid 99); 3 Aug 2016 04:13:29 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 03 Aug 2016 04:13:29 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id CFCB0C0244 for ; Wed, 3 Aug 2016 04:13:28 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.972 X-Spam-Level: X-Spam-Status: No, score=0.972 tagged_above=-999 required=6.31 tests=[RCVD_IN_DNSWL_NONE=-0.0001, SPF_SOFTFAIL=0.972] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id FcrY6fWWzaQb for ; Wed, 3 Aug 2016 04:13:27 +0000 (UTC) Received: from mbob.nabble.com (mbob.nabble.com [162.253.133.15]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 6EC055F306 for ; Wed, 3 Aug 2016 04:13:26 +0000 (UTC) Received: from malf.nabble.com (unknown [162.253.133.59]) by mbob.nabble.com (Postfix) with ESMTP id 45FF42DEC1FA for ; Tue, 2 Aug 2016 20:48:25 -0700 (PDT) Date: Tue, 2 Aug 2016 20:55:42 -0700 (PDT) From: Jason To: user@ignite.apache.org Message-ID: <1470196542634-6691.post@n6.nabble.com> In-Reply-To: <1469835383440-6633.post@n6.nabble.com> References: <1468394674256-6252.post@n6.nabble.com> <1468447383439-6280.post@n6.nabble.com> <1469808299059-6624.post@n6.nabble.com> <1469835383440-6633.post@n6.nabble.com> Subject: Re: Failed to wait for initial partition map exchange MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit archived-at: Wed, 03 Aug 2016 04:13:30 -0000 hi Val, seems that when there's assertion or OOM in one node, it doesn't exit, right? so it still sends heartbeat to others, then the whole cluster are waiting for it to recover (hanging). Is there any easy way to let one node restart immediately when encounter unrecoverable errors, like OOM or severe assertion? Thanks, -Jason -- View this message in context: http://apache-ignite-users.70518.x6.nabble.com/Failed-to-wait-for-initial-partition-map-exchange-tp6252p6691.html Sent from the Apache Ignite Users mailing list archive at Nabble.com.