Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 841C420049C for ; Fri, 11 Aug 2017 23:39:00 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 8217016E1B3; Fri, 11 Aug 2017 21:39:00 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A135616E1B0 for ; Fri, 11 Aug 2017 23:38:59 +0200 (CEST) Received: (qmail 26056 invoked by uid 500); 11 Aug 2017 21:38:58 -0000 Mailing-List: contact user-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@ignite.apache.org Delivered-To: mailing list user@ignite.apache.org Received: (qmail 26044 invoked by uid 99); 11 Aug 2017 21:38:57 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Aug 2017 21:38:57 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id DC7D1C0352 for ; Fri, 11 Aug 2017 21:38:56 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.299 X-Spam-Level: * X-Spam-Status: No, score=1.299 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id blTd8VGME46U for ; Fri, 11 Aug 2017 21:38:53 +0000 (UTC) Received: from mx0a-000f0801.pphosted.com (mx0b-000f0801.pphosted.com [67.231.152.113]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id D137A5FB4E for ; Fri, 11 Aug 2017 21:38:52 +0000 (UTC) Received: from pps.filterd (m0000700.ppops.net [127.0.0.1]) by mx0b-000f0801.pphosted.com (8.16.0.21/8.16.0.21) with SMTP id v7BLc7GW020300 for ; Fri, 11 Aug 2017 14:38:46 -0700 Received: from brmwp-exmb12.corp.brocade.com ([208.47.132.227]) by mx0b-000f0801.pphosted.com with ESMTP id 2c93kj37rk-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-SHA384 bits=256 verify=NOT) for ; Fri, 11 Aug 2017 14:38:45 -0700 Received: from BRMWP-EXMB12.corp.brocade.com (172.16.59.130) by BRMWP-EXMB12.corp.brocade.com (172.16.59.130) with Microsoft SMTP Server (TLS) id 15.0.1293.2; Fri, 11 Aug 2017 15:38:44 -0600 Received: from BRMWP-EXMB12.corp.brocade.com ([fe80::c813:1b29:645:33bf]) by BRMWP-EXMB12.corp.brocade.com ([fe80::c813:1b29:645:33bf%25]) with mapi id 15.00.1293.002; Fri, 11 Aug 2017 15:38:44 -0600 From: "Roger Fischer (CW)" To: "user@ignite.apache.org" Subject: Activation: slow and: Ignite node crashed in the middle of checkpoint. Thread-Topic: Activation: slow and: Ignite node crashed in the middle of checkpoint. Thread-Index: AdMS6ZBR9VLxgFG7QrCFTr9mfd4FJQ== Date: Fri, 11 Aug 2017 21:38:43 +0000 Message-ID: <887d2b4cca7040c2abe93dda8bd5000f@BRMWP-EXMB12.corp.brocade.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-exchange-transport-fromentityheader: Hosted x-originating-ip: [172.16.181.50] Content-Type: multipart/alternative; boundary="_000_887d2b4cca7040c2abe93dda8bd5000fBRMWPEXMB12corpbrocadec_" MIME-Version: 1.0 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2017-08-11_07:,, signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1011 lowpriorityscore=0 impostorscore=0 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1706020000 definitions=main-1708110327 archived-at: Fri, 11 Aug 2017 21:39:00 -0000 --_000_887d2b4cca7040c2abe93dda8bd5000fBRMWPEXMB12corpbrocadec_ Content-Type: text/plain; charset="us-ascii" Hello, I am wondering if the following behavior is typical, or if it represents a concern. I have a 3 node cluster with native persistence. Each node as 4 CPU and 16 GB of RAM. Each node has ~45 GB used in work/db. Total across the 3 nodes is about 36.5 M objects. I am using SQL queries, and there are 3 indexes. The servers start up normally and join the cluster, as expected. When I start the client, which calls active(), all 3 servers report the following: [12:41:28] Topology snapshot [ver=5, servers=3, clients=1, CPUs=16, heap=4.8GB] [12:41:29] Default checkpoint page buffer size is too small, setting to an adjusted value: 2.0 GiB [12:41:29] Ignite node crashed in the middle of checkpoint. Will restore memory state and enforce checkpoint on node start. 1) Should I worry about the "crashed" log? The activation takes more than 30 minutes (until active() returns). 2) Is that normal for activate to take that long? ver. 2.1.0#20170720-sha1:a6ca5c8a OS: Linux 3.10.0-514.el7.x86_64 amd64 Thanks... Roger --_000_887d2b4cca7040c2abe93dda8bd5000fBRMWPEXMB12corpbrocadec_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

Hello,

 

I am wondering if the following behavior is typical,= or if it represents a concern.

 

I have a 3 node cluster with native persistence. Eac= h node as 4 CPU and 16 GB of RAM.

Each node has ~45 GB used in work/db. Total across t= he 3 nodes is about 36.5 M objects.

I am using SQL queries, and there are 3 indexes.

 

The servers start up normally and join the cluster, = as expected.

 

When I start the client, which calls active(), all 3= servers report the following:

 

[12:41:28] Topology snapshot [ver=3D5, servers=3D3, = clients=3D1, CPUs=3D16, heap=3D4.8GB]

[12:41:29] Default checkpoint page buffer size is to= o small, setting to an adjusted value: 2.0 GiB

[12:41:29] Ignite node crashed in the middle of chec= kpoint. Will restore memory state and enforce checkpoint on node start.

 

1) Should I worry about the “crashed” lo= g?

 

The activation takes more than 30 minutes (until act= ive() returns).

 

2) Is that normal for activate to take that long?

 

ver. 2.1.0#20170720-sha1:a6ca5c8a

OS: Linux 3.10.0-514.el7.x86_64 amd64

 

Thanks…

 

Roger

 

--_000_887d2b4cca7040c2abe93dda8bd5000fBRMWPEXMB12corpbrocadec_--