From user-return-29861-archive-asf-public=cust-asf.ponee.io@ignite.apache.org Wed Apr 15 14:36:19 2020 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 81ED218065C for ; Wed, 15 Apr 2020 16:36:18 +0200 (CEST) Received: (qmail 21813 invoked by uid 500); 15 Apr 2020 14:36:17 -0000 Mailing-List: contact user-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@ignite.apache.org Delivered-To: mailing list user@ignite.apache.org Received: (qmail 21803 invoked by uid 99); 15 Apr 2020 14:36:17 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Apr 2020 14:36:17 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id D5DF8C0FC4 for ; Wed, 15 Apr 2020 14:36:16 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.002 X-Spam-Level: X-Spam-Status: No, score=0.002 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.2, NUMERIC_HTTP_ADDR=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, WEIRD_PORT=0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-he-de.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id dNpEMh0kyEAq for ; Wed, 15 Apr 2020 14:36:14 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=2607:f8b0:4864:20::d2d; helo=mail-io1-xd2d.google.com; envelope-from=e.zhuravlev.wk@gmail.com; receiver= Received: from mail-io1-xd2d.google.com (mail-io1-xd2d.google.com [IPv6:2607:f8b0:4864:20::d2d]) by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with ESMTPS id 5F3657F742 for ; Wed, 15 Apr 2020 14:36:14 +0000 (UTC) Received: by mail-io1-xd2d.google.com with SMTP id e127so2020108iof.6 for ; Wed, 15 Apr 2020 07:36:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=J+WcXdtboU6VQYCgCOaWLsNgJjHnhMw8AB/HwseP/1Y=; b=L80VOjkI51QLjpINBMb/s5VBmPpGuUcwufDmtU6YE2ZrasyhI2aRw4zwSCH1egYo0b q4r7qdUhUwt1LdTAxT/xY3Vn+oA2UeOmvX2imuR7XbNtDLCi8r3Y3CGCvKLJoXzbjXH0 wvFtoAarcOAHwXyxfL5SxvK8Gf28pAnZf20kBr2KPtNPPfRWQ2lkKScL1v1lFirhkhaW Z1CWp1AFsKQF0kglJBBDdLDF6qK3D9hB+Wv5T2Bp3e/zwzs7nBokw0h7gVtqdN7N66y1 n/gFfyG/qDGeFt/Xsjo4W5LsbAtIMJt3kV2aqkQn+GedNW/oTp72yRpzROc7mJp4m7AU hdHg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=J+WcXdtboU6VQYCgCOaWLsNgJjHnhMw8AB/HwseP/1Y=; b=eJfcaI9k/5nORJ0X+X2Y0+mO7MuBqIdndMnxC7iOvZOpaUJp7pO5HrmS3QVJypTTKQ vhC6Oeg1fjFdo7KTnRJ8dhRc+w6JyHkvlCYT9IVKpOm0gwEaBqY4s2Fxd+3+PwVkOmYN Q6vXQlFZCqtawg536fxMZJ1UCnDU30X1IVL+EiejThmuxREWQoKFaolH/EWDDPZEcZZ+ mngZtTuMJYSOQrWX1Ay0mMHtOFX21/s+fp1M8JObJA7nOVBiSKrlz1t6Oy3yE1RUF4yl rMloaSX/H7R5fpOQlA4SHA+GgEDLc57ZlhsadzPa6kWWWw6ABo0QxPgVD41OXMEGQpxc JzDg== X-Gm-Message-State: AGi0PuZqiTog2EklnkDdB4a44nggM97pAKpUjYOvineJtMRWN36etMiZ HIfazydFohpJlzpqJHRMHp5bL1A0xSNluRXrYOuUvSDLRZ0= X-Google-Smtp-Source: APiQypIfnhb5O6iMkTweuvIHrLKXKduymdq/9SV26TgK+iSNI01+SOy4Gd4n+xeRPVLnGe5IWQapIz4jTF2lvBhOchg= X-Received: by 2002:a05:6638:c45:: with SMTP id g5mr11519294jal.33.1586961364853; Wed, 15 Apr 2020 07:36:04 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Evgenii Zhuravlev Date: Wed, 15 Apr 2020 07:35:53 -0700 Message-ID: Subject: Re: org.apache.ignite.spi.discovery.tcp.TcpDiscoverySpi - Failed to reconnect to cluster (will retry): class o.a.i.IgniteCheckedException: Failed to deserialize object with given class loader: org.springframework.boot.loader.LaunchedURLClassLoader To: user Content-Type: multipart/alternative; boundary="0000000000006ca51a05a3553ccb" --0000000000006ca51a05a3553ccb Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hi, Please provide logs not only from the server node, bu from the client node too. You mentioned that only one client has this problems, so, please provide full log from this node. Also, you said that you set not default timeouts for clients, while there are still default values for server node - I wouldn't recommend to do this, timeouts should be the same for all nodes in cluster. Evgenii =D1=81=D1=80, 15 =D0=B0=D0=BF=D1=80. 2020 =D0=B3. =D0=B2 03:04, Rajan Ahlaw= at : > Shared file with email-id: > e.zhuravlev.wk@gmail.com > > We have single instance of ignite, File contains all log of date Mar > 30, 2019. Line 6429 is the first incident of occurrence. > > On Tue, Apr 14, 2020 at 8:27 PM Evgenii Zhuravlev > wrote: > > > > Can you provide full log files from all nodes? it's impossible to find > the root cause from this. > > > > Evgenii > > > > =D0=B2=D1=82, 14 =D0=B0=D0=BF=D1=80. 2020 =D0=B3. =D0=B2 07:49, Rajan A= hlawat : > >> > >> server starts with following configuration: > >> > >> ignite_application-1-2020-03-17.log:14:[2020-03-17T08:23:33,664][INFO > >> ][main][IgniteKernal%igniteStart] IgniteConfiguration > >> [igniteInstanceName=3DigniteStart, pubPoolSize=3D32, svcPoolSize=3D32, > >> callbackPoolSize=3D32, stripedPoolSize=3D32, sysPoolSize=3D30, > >> mgmtPoolSize=3D4, igfsPoolSize=3D32, dataStreamerPoolSize=3D32, > >> utilityCachePoolSize=3D32, utilityCacheKeepAliveTime=3D60000, > >> p2pPoolSize=3D2, qryPoolSize=3D32, > >> igniteHome=3D/home/patrochandan01/ignite/apache-ignite-fabric-2.6.0-bi= n, > >> > igniteWorkDir=3D/home/patrochandan01/ignite/apache-ignite-fabric-2.6.0-bi= n/work, > >> mbeanSrv=3Dcom.sun.jmx.mbeanserver.JmxMBeanServer@6f94fa3e, > >> nodeId=3D53396cb7-1b66-43da-bf10-ebb5f7cc9693, > >> marsh=3Dorg.apache.ignite.internal.binary.BinaryMarshaller@42b3b079, > >> marshLocJobs=3Dfalse, daemon=3Dfalse, p2pEnabled=3Dfalse, netTimeout= =3D5000, > >> sndRetryDelay=3D1000, sndRetryCnt=3D3, metricsHistSize=3D10000, > >> metricsUpdateFreq=3D2000, metricsExpTime=3D9223372036854775807, > >> discoSpi=3DTcpDiscoverySpi [addrRslvr=3Dnull, sockTimeout=3D0, ackTime= out=3D0, > >> marsh=3Dnull, reconCnt=3D100, reconDelay=3D10000, maxAckTimeout=3D6000= 00, > >> forceSrvMode=3Dfalse, clientReconnectDisabled=3Dfalse, internalLsnr=3D= null], > >> segPlc=3DSTOP, segResolveAttempts=3D2, waitForSegOnStart=3Dtrue, > >> allResolversPassReq=3Dtrue, segChkFreq=3D10000, > >> commSpi=3DTcpCommunicationSpi [connectGate=3Dnull, connPlc=3Dnull, > >> enableForcibleNodeKill=3Dfalse, enableTroubleshootingLog=3Dfalse, > >> > srvLsnr=3Dorg.apache.ignite.spi.communication.tcp.TcpCommunicationSpi$2@6= 692b6c6 > , > >> locAddr=3Dnull, locHost=3Dnull, locPort=3D47100, locPortRange=3D100, > >> shmemPort=3D-1, directBuf=3Dtrue, directSndBuf=3Dfalse, > >> idleConnTimeout=3D600000, connTimeout=3D5000, maxConnTimeout=3D600000, > >> reconCnt=3D10, sockSndBuf=3D32768, sockRcvBuf=3D32768, msgQueueLimit= =3D1024, > >> slowClientQueueLimit=3D1000, nioSrvr=3Dnull, shmemSrv=3Dnull, > >> usePairedConnections=3Dfalse, connectionsPerNode=3D1, tcpNoDelay=3Dtru= e, > >> filterReachableAddresses=3Dfalse, ackSndThreshold=3D32, > >> unackedMsgsBufSize=3D0, sockWriteTimeout=3D2000, lsnr=3Dnull, > >> boundTcpPort=3D-1, boundTcpShmemPort=3D-1, selectorsCnt=3D16, > >> selectorSpins=3D0, addrRslvr=3Dnull, > >> ctxInitLatch=3Djava.util.concurrent.CountDownLatch@1cd629b3[Count =3D = 1], > >> stopping=3Dfalse, > >> > metricsLsnr=3Dorg.apache.ignite.spi.communication.tcp.TcpCommunicationMet= ricsListener@589da3f3 > ], > >> evtSpi=3Dorg.apache.ignite.spi.eventstorage.NoopEventStorageSpi@39d76c= b5, > >> colSpi=3DNoopCollisionSpi [], deploySpi=3DLocalDeploymentSpi [lsnr=3Dn= ull], > >> indexingSpi=3Dorg.apache.ignite.spi.indexing.noop.NoopIndexingSpi@1cb3= 46ea > , > >> addrRslvr=3Dnull, clientMode=3Dfalse, rebalanceThreadPoolSize=3D1, > >> txCfg=3Dorg.apache.ignite.configuration.TransactionConfiguration@4c012= 563 > , > >> cacheSanityCheckEnabled=3Dtrue, discoStartupDelay=3D60000, > >> deployMode=3DSHARED, p2pMissedCacheSize=3D100, locHost=3Dnull, > >> timeSrvPortBase=3D31100, timeSrvPortRange=3D100, > >> failureDetectionTimeout=3D10000, clientFailureDetectionTimeout=3D30000= , > >> metricsLogFreq=3D60000, hadoopCfg=3Dnull, > >> > connectorCfg=3Dorg.apache.ignite.configuration.ConnectorConfiguration@14a= 50707 > , > >> odbcCfg=3Dnull, warmupClos=3Dnull, atomicCfg=3DAtomicConfiguration > >> [seqReserveSize=3D1000, cacheMode=3DPARTITIONED, backups=3D1, aff=3Dnu= ll, > >> grpName=3Dnull], classLdr=3Dnull, sslCtxFactory=3Dnull, platformCfg=3D= null, > >> binaryCfg=3Dnull, memCfg=3Dnull, pstCfg=3Dnull, > >> dsCfg=3DDataStorageConfiguration [sysRegionInitSize=3D41943040, > >> sysCacheMaxSize=3D104857600, pageSize=3D0, concLvl=3D25, > >> dfltDataRegConf=3DDataRegionConfiguration [name=3DDefault_Region, > >> maxSize=3D20971520, initSize=3D15728640, swapPath=3Dnull, > >> pageEvictionMode=3DRANDOM_2_LRU, evictionThreshold=3D0.9, > >> emptyPagesPoolSize=3D100, metricsEnabled=3Dfalse, > >> metricsSubIntervalCount=3D5, metricsRateTimeInterval=3D60000, > >> persistenceEnabled=3Dfalse, checkpointPageBufSize=3D0], storagePath=3D= null, > >> checkpointFreq=3D180000, lockWaitTime=3D10000, checkpointThreads=3D4, > >> checkpointWriteOrder=3DSEQUENTIAL, walHistSize=3D20, walSegments=3D10, > >> walSegmentSize=3D67108864, walPath=3Ddb/wal, > >> walArchivePath=3Ddb/wal/archive, metricsEnabled=3Dfalse, walMode=3DLOG= _ONLY, > >> walTlbSize=3D131072, walBuffSize=3D0, walFlushFreq=3D2000, > >> walFsyncDelay=3D1000, walRecordIterBuffSize=3D67108864, > >> alwaysWriteFullPages=3Dfalse, > >> > fileIOFactory=3Dorg.apache.ignite.internal.processors.cache.persistence.f= ile.AsyncFileIOFactory@4bd31064 > , > >> metricsSubIntervalCnt=3D5, metricsRateTimeInterval=3D60000, > >> walAutoArchiveAfterInactivity=3D-1, writeThrottlingEnabled=3Dfalse, > >> walCompactionEnabled=3Dfalse], activeOnStart=3Dtrue, autoActivation=3D= true, > >> longQryWarnTimeout=3D3000, sqlConnCfg=3Dnull, > >> cliConnCfg=3DClientConnectorConfiguration [host=3Dnull, port=3D10800, > >> portRange=3D100, sockSndBufSize=3D0, sockRcvBufSize=3D0, tcpNoDelay=3D= true, > >> maxOpenCursorsPerConn=3D128, threadPoolSize=3D32, idleTimeout=3D0, > >> jdbcEnabled=3Dtrue, odbcEnabled=3Dtrue, thinCliEnabled=3Dtrue, > >> sslEnabled=3Dfalse, useIgniteSslCtxFactory=3Dtrue, sslClientAuth=3Dfal= se, > >> sslCtxFactory=3Dnull], authEnabled=3Dfalse, failureHnd=3Dnull, > >> commFailureRslvr=3Dnull] > >> > >> > >> > >> and error while connecting client: > >> > >> [2020-04-14T09:41:33,547][WARN > >> ][grid-timeout-worker-#71%igniteStart%][TcpDiscoverySpi] Socket write > >> has timed out (consider increasing 'sockTimeout' configuration > >> property) [sockTimeout=3D5000, rmtAddr=3D/10.80.104.224:51856, > >> rmtPort=3D51856, sockTimeout=3D5000] > >> > >> In server configuration we didn't define any socketTimeout, server > >> might be throwing socket timeout not client. But It occurs for only > >> one particular client and this server. Other web applications are able > >> to connect with same server on our production environment. > >> > >> Thanks > >> > >> On Mon, Apr 13, 2020 at 8:09 PM Evgenii Zhuravlev > >> wrote: > >> > > >> > Hi, > >> > > >> > Can you share full logs from all nodes? I mean log files, not the > console output. > >> > > >> > Evgenii > >> > > >> > =D0=B2=D1=81, 12 =D0=B0=D0=BF=D1=80. 2020 =D0=B3. =D0=B2 20:30, Raja= n Ahlawat : > >> >> > >> >> ? > >> >> > >> >> On Thu, Apr 9, 2020 at 3:11 AM Rajan Ahlawat < > rajan.ahlawat@gmail.com> wrote: > >> >> > > >> >> > ---------- Forwarded message --------- > >> >> > From: Rajan Ahlawat > >> >> > Date: Thu, Apr 9, 2020 at 3:09 AM > >> >> > Subject: org.apache.ignite.spi.discovery.tcp.TcpDiscoverySpi - > Failed > >> >> > to reconnect to cluster (will retry): class > >> >> > o.a.i.IgniteCheckedException: Failed to deserialize object with > given > >> >> > class loader: > org.springframework.boot.loader.LaunchedURLClassLoader > >> >> > To: > >> >> > > >> >> > > >> >> > Hi > >> >> > > >> >> > We suddenly started getting following exception on client side > after > >> >> > node running application got restarted: > >> >> > > >> >> > org.apache.ignite.spi.discovery.tcp.TcpDiscoverySpi - Failed to > >> >> > reconnect to cluster (will retry): class > o.a.i.IgniteCheckedException: > >> >> > Failed to deserialize object with given class loader: > >> >> > org.springframework.boot.loader.LaunchedURLClassLoader > >> >> > > >> >> > I see similar bug was raised here for version 2.7.0: > >> >> > https://issues.apache.org/jira/browse/IGNITE-11730 > >> >> > > >> >> > We are currently using version 2.6.0 > >> >> > Following is our tcpDiscoveryApi configurations: > >> >> > > >> >> > private void setDiscoverySpiConfig(IgniteConfiguration cfg) { > >> >> > TcpDiscoverySpi discoverySpi =3D new TcpDiscoverySpi(); > >> >> > > >> >> > setIpFinder(discoverySpi); > >> >> > > discoverySpi.setNetworkTimeout(platformCachingConfiguration.getIgnite().= getSocketTimeout()); > >> >> > > discoverySpi.setSocketTimeout(platformCachingConfiguration.getIgnite().g= etSocketTimeout()); > >> >> > > discoverySpi.setJoinTimeout(platformCachingConfiguration.getIgnite().get= JoinTimeout()); > >> >> > > discoverySpi.setClientReconnectDisabled(platformCachingConfiguration.get= Ignite().isClientReconnectDisabled()); > >> >> > > discoverySpi.setReconnectCount(platformCachingConfiguration.getIgnite().= getReconnectCount()); > >> >> > > discoverySpi.setReconnectDelay(platformCachingConfiguration.getIgnite().= getReconnectDelay()); > >> >> > > >> >> > cfg.setDiscoverySpi(discoverySpi); > >> >> > } > >> >> > > >> >> > Its IPfinder config is > >> >> > > >> >> > private void setTcpIpFinder(TcpDiscoverySpi discoverySpi) { > >> >> > TcpDiscoveryVmIpFinder ipFinder =3D new TcpDiscoveryVmIpFinde= r(); > >> >> > > >> >> > > ipFinder.setAddresses(platformCachingConfiguration.getIgnite().getNodes(= )); > >> >> > discoverySpi.setIpFinder(ipFinder); > >> >> > } > >> >> > > >> >> > We have tried every combination of timeouts, right now timeouts a= re > >> >> > set at very hight value . > >> >> > > >> >> > (1) If we are having same bug mentioned for 2.7.0 version, but bu= g > >> >> > desc says it occurs on server side, but we are getting exact same > >> >> > stack trance in ClientImpl.java on client side. > >> >> > (2) assuming it is same issues, is there a way to disable data ba= g > >> >> > compression check, since upgrading both client and server version > >> >> > would not be possible immediately. > >> >> > > >> >> > Thanks in advance. > --0000000000006ca51a05a3553ccb Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Hi,

Please provide logs not only fr= om the server node, bu from the client node too. You mentioned that only on= e client has this problems, so, please provide full log from this node.

Also, you said that you set not default timeouts for = clients, while there are still default values for server node - I wouldn= 9;t recommend to do this, timeouts should be the same for all nodes in clus= ter.

Evgenii

=D1=81=D1=80, 15 =D0=B0=D0=BF=D1= =80. 2020 =D0=B3. =D0=B2 03:04, Rajan Ahlawat <rajan.ahlawat@gmail.com>:
Shared file with email-id:
e.zhuravlev.w= k@gmail.com

We have single instance of ignite, File contains all log of date Mar
30, 2019. Line 6429 is the first incident of occurrence.

On Tue, Apr 14, 2020 at 8:27 PM Evgenii Zhuravlev
<e.zhuravl= ev.wk@gmail.com> wrote:
>
> Can you provide full log files from all nodes? it's impossible to = find the root cause from this.
>
> Evgenii
>
> =D0=B2=D1=82, 14 =D0=B0=D0=BF=D1=80. 2020 =D0=B3. =D0=B2 07:49, Rajan = Ahlawat <ra= jan.ahlawat@gmail.com>:
>>
>> server starts with following configuration:
>>
>> ignite_application-1-2020-03-17.log:14:[2020-03-17T08:23:33,664][I= NFO
>> ][main][IgniteKernal%igniteStart] IgniteConfiguration
>> [igniteInstanceName=3DigniteStart, pubPoolSize=3D32, svcPoolSize= =3D32,
>> callbackPoolSize=3D32, stripedPoolSize=3D32, sysPoolSize=3D30,
>> mgmtPoolSize=3D4, igfsPoolSize=3D32, dataStreamerPoolSize=3D32, >> utilityCachePoolSize=3D32, utilityCacheKeepAliveTime=3D60000,
>> p2pPoolSize=3D2, qryPoolSize=3D32,
>> igniteHome=3D/home/patrochandan01/ignite/apache-ignite-fabric-2.6.= 0-bin,
>> igniteWorkDir=3D/home/patrochandan01/ignite/apache-ignite-fabric-2= .6.0-bin/work,
>> mbeanSrv=3Dcom.sun.jmx.mbeanserver.JmxMBeanServer@6f94fa3e,
>> nodeId=3D53396cb7-1b66-43da-bf10-ebb5f7cc9693,
>> marsh=3Dorg.apache.ignite.internal.binary.BinaryMarshaller@42b3b07= 9,
>> marshLocJobs=3Dfalse, daemon=3Dfalse, p2pEnabled=3Dfalse, netTimeo= ut=3D5000,
>> sndRetryDelay=3D1000, sndRetryCnt=3D3, metricsHistSize=3D10000, >> metricsUpdateFreq=3D2000, metricsExpTime=3D9223372036854775807, >> discoSpi=3DTcpDiscoverySpi [addrRslvr=3Dnull, sockTimeout=3D0, ack= Timeout=3D0,
>> marsh=3Dnull, reconCnt=3D100, reconDelay=3D10000, maxAckTimeout=3D= 600000,
>> forceSrvMode=3Dfalse, clientReconnectDisabled=3Dfalse, internalLsn= r=3Dnull],
>> segPlc=3DSTOP, segResolveAttempts=3D2, waitForSegOnStart=3Dtrue, >> allResolversPassReq=3Dtrue, segChkFreq=3D10000,
>> commSpi=3DTcpCommunicationSpi [connectGate=3Dnull, connPlc=3Dnull,=
>> enableForcibleNodeKill=3Dfalse, enableTroubleshootingLog=3Dfalse,<= br> >> srvLsnr=3Dorg.apache.ignite.spi.communication.tcp.TcpCommunication= Spi$2@6692b6c6,
>> locAddr=3Dnull, locHost=3Dnull, locPort=3D47100, locPortRange=3D10= 0,
>> shmemPort=3D-1, directBuf=3Dtrue, directSndBuf=3Dfalse,
>> idleConnTimeout=3D600000, connTimeout=3D5000, maxConnTimeout=3D600= 000,
>> reconCnt=3D10, sockSndBuf=3D32768, sockRcvBuf=3D32768, msgQueueLim= it=3D1024,
>> slowClientQueueLimit=3D1000, nioSrvr=3Dnull, shmemSrv=3Dnull,
>> usePairedConnections=3Dfalse, connectionsPerNode=3D1, tcpNoDelay= =3Dtrue,
>> filterReachableAddresses=3Dfalse, ackSndThreshold=3D32,
>> unackedMsgsBufSize=3D0, sockWriteTimeout=3D2000, lsnr=3Dnull,
>> boundTcpPort=3D-1, boundTcpShmemPort=3D-1, selectorsCnt=3D16,
>> selectorSpins=3D0, addrRslvr=3Dnull,
>> ctxInitLatch=3Djava.util.concurrent.CountDownLatch@1cd629b3[Count = =3D 1],
>> stopping=3Dfalse,
>> metricsLsnr=3Dorg.apache.ignite.spi.communication.tcp.TcpCommunica= tionMetricsListener@589da3f3],
>> evtSpi=3Dorg.apache.ignite.spi.eventstorage.NoopEventStorageSpi@39= d76cb5,
>> colSpi=3DNoopCollisionSpi [], deploySpi=3DLocalDeploymentSpi [lsnr= =3Dnull],
>> indexingSpi=3Dorg.apache.ignite.spi.indexing.noop.NoopIndexingSpi@= 1cb346ea,
>> addrRslvr=3Dnull, clientMode=3Dfalse, rebalanceThreadPoolSize=3D1,=
>> txCfg=3Dorg.apache.ignite.configuration.TransactionConfiguration@4= c012563,
>> cacheSanityCheckEnabled=3Dtrue, discoStartupDelay=3D60000,
>> deployMode=3DSHARED, p2pMissedCacheSize=3D100, locHost=3Dnull,
>> timeSrvPortBase=3D31100, timeSrvPortRange=3D100,
>> failureDetectionTimeout=3D10000, clientFailureDetectionTimeout=3D3= 0000,
>> metricsLogFreq=3D60000, hadoopCfg=3Dnull,
>> connectorCfg=3Dorg.apache.ignite.configuration.ConnectorConfigurat= ion@14a50707,
>> odbcCfg=3Dnull, warmupClos=3Dnull, atomicCfg=3DAtomicConfiguration=
>> [seqReserveSize=3D1000, cacheMode=3DPARTITIONED, backups=3D1, aff= =3Dnull,
>> grpName=3Dnull], classLdr=3Dnull, sslCtxFactory=3Dnull, platformCf= g=3Dnull,
>> binaryCfg=3Dnull, memCfg=3Dnull, pstCfg=3Dnull,
>> dsCfg=3DDataStorageConfiguration [sysRegionInitSize=3D41943040, >> sysCacheMaxSize=3D104857600, pageSize=3D0, concLvl=3D25,
>> dfltDataRegConf=3DDataRegionConfiguration [name=3DDefault_Region,<= br> >> maxSize=3D20971520, initSize=3D15728640, swapPath=3Dnull,
>> pageEvictionMode=3DRANDOM_2_LRU, evictionThreshold=3D0.9,
>> emptyPagesPoolSize=3D100, metricsEnabled=3Dfalse,
>> metricsSubIntervalCount=3D5, metricsRateTimeInterval=3D60000,
>> persistenceEnabled=3Dfalse, checkpointPageBufSize=3D0], storagePat= h=3Dnull,
>> checkpointFreq=3D180000, lockWaitTime=3D10000, checkpointThreads= =3D4,
>> checkpointWriteOrder=3DSEQUENTIAL, walHistSize=3D20, walSegments= =3D10,
>> walSegmentSize=3D67108864, walPath=3Ddb/wal,
>> walArchivePath=3Ddb/wal/archive, metricsEnabled=3Dfalse, walMode= =3DLOG_ONLY,
>> walTlbSize=3D131072, walBuffSize=3D0, walFlushFreq=3D2000,
>> walFsyncDelay=3D1000, walRecordIterBuffSize=3D67108864,
>> alwaysWriteFullPages=3Dfalse,
>> fileIOFactory=3Dorg.apache.ignite.internal.processors.cache.persis= tence.file.AsyncFileIOFactory@4bd31064,
>> metricsSubIntervalCnt=3D5, metricsRateTimeInterval=3D60000,
>> walAutoArchiveAfterInactivity=3D-1, writeThrottlingEnabled=3Dfalse= ,
>> walCompactionEnabled=3Dfalse], activeOnStart=3Dtrue, autoActivatio= n=3Dtrue,
>> longQryWarnTimeout=3D3000, sqlConnCfg=3Dnull,
>> cliConnCfg=3DClientConnectorConfiguration [host=3Dnull, port=3D108= 00,
>> portRange=3D100, sockSndBufSize=3D0, sockRcvBufSize=3D0, tcpNoDela= y=3Dtrue,
>> maxOpenCursorsPerConn=3D128, threadPoolSize=3D32, idleTimeout=3D0,=
>> jdbcEnabled=3Dtrue, odbcEnabled=3Dtrue, thinCliEnabled=3Dtrue,
>> sslEnabled=3Dfalse, useIgniteSslCtxFactory=3Dtrue, sslClientAuth= =3Dfalse,
>> sslCtxFactory=3Dnull], authEnabled=3Dfalse, failureHnd=3Dnull,
>> commFailureRslvr=3Dnull]
>>
>>
>>
>> and error while connecting client:
>>
>> [2020-04-14T09:41:33,547][WARN
>> ][grid-timeout-worker-#71%igniteStart%][TcpDiscoverySpi] Socket wr= ite
>> has timed out (consider increasing 'sockTimeout' configura= tion
>> property) [sockTimeout=3D5000, rmtAddr=3D/10.80.104.224:51856= ,
>> rmtPort=3D51856, sockTimeout=3D5000]
>>
>> In server configuration we didn't define any socketTimeout, se= rver
>> might be throwing socket timeout not client. But It occurs for onl= y
>> one particular client and this server. Other web applications are = able
>> to connect with same server on our production environment.
>>
>> Thanks
>>
>> On Mon, Apr 13, 2020 at 8:09 PM Evgenii Zhuravlev
>> <= e.zhuravlev.wk@gmail.com> wrote:
>> >
>> > Hi,
>> >
>> > Can you share full logs from all nodes? I mean log files, not= the console output.
>> >
>> > Evgenii
>> >
>> > =D0=B2=D1=81, 12 =D0=B0=D0=BF=D1=80. 2020 =D0=B3. =D0=B2 20:3= 0, Rajan Ahlawat <rajan.ahlawat@gmail.com>:
>> >>
>> >> ?
>> >>
>> >> On Thu, Apr 9, 2020 at 3:11 AM Rajan Ahlawat <rajan.ahlawat@gmail.c= om> wrote:
>> >> >
>> >> > ---------- Forwarded message ---------
>> >> > From: Rajan Ahlawat <rajan.ahlawat@gmail.com>
>> >> > Date: Thu, Apr 9, 2020 at 3:09 AM
>> >> > Subject: org.apache.ignite.spi.discovery.tcp.TcpDisc= overySpi - Failed
>> >> > to reconnect to cluster (will retry): class
>> >> > o.a.i.IgniteCheckedException: Failed to deserialize = object with given
>> >> > class loader: org.springframework.boot.loader.Launch= edURLClassLoader
>> >> > To: <user@ignite.apache.org>
>> >> >
>> >> >
>> >> > Hi
>> >> >
>> >> > We suddenly started getting following exception on c= lient side after
>> >> > node running application got restarted:
>> >> >
>> >> > org.apache.ignite.spi.discovery.tcp.TcpDiscoverySpi = - Failed to
>> >> > reconnect to cluster (will retry): class o.a.i.Ignit= eCheckedException:
>> >> > Failed to deserialize object with given class loader= :
>> >> > org.springframework.boot.loader.LaunchedURLClassLoad= er
>> >> >
>> >> > I see similar bug was raised here for version 2.7.0:=
>> >> > https://issues.apache.org/j= ira/browse/IGNITE-11730
>> >> >
>> >> > We are currently using version 2.6.0
>> >> > Following is our tcpDiscoveryApi configurations:
>> >> >
>> >> > private void setDiscoverySpiConfig(IgniteConfigurati= on cfg) {
>> >> >=C2=A0 =C2=A0 =C2=A0TcpDiscoverySpi discoverySpi =3D = new TcpDiscoverySpi();
>> >> >
>> >> >=C2=A0 =C2=A0 =C2=A0setIpFinder(discoverySpi);
>> >> >=C2=A0 =C2=A0 =C2=A0discoverySpi.setNetworkTimeout(pl= atformCachingConfiguration.getIgnite().getSocketTimeout());
>> >> >=C2=A0 =C2=A0 =C2=A0discoverySpi.setSocketTimeout(pla= tformCachingConfiguration.getIgnite().getSocketTimeout());
>> >> >=C2=A0 =C2=A0 =C2=A0discoverySpi.setJoinTimeout(platf= ormCachingConfiguration.getIgnite().getJoinTimeout());
>> >> >=C2=A0 =C2=A0 =C2=A0discoverySpi.setClientReconnectDi= sabled(platformCachingConfiguration.getIgnite().isClientReconnectDisabled()= );
>> >> >=C2=A0 =C2=A0 =C2=A0discoverySpi.setReconnectCount(pl= atformCachingConfiguration.getIgnite().getReconnectCount());
>> >> >=C2=A0 =C2=A0 =C2=A0discoverySpi.setReconnectDelay(pl= atformCachingConfiguration.getIgnite().getReconnectDelay());
>> >> >
>> >> >=C2=A0 =C2=A0 =C2=A0cfg.setDiscoverySpi(discoverySpi)= ;
>> >> > }
>> >> >
>> >> > Its IPfinder config is
>> >> >
>> >> > private void setTcpIpFinder(TcpDiscoverySpi discover= ySpi) {
>> >> >=C2=A0 =C2=A0 =C2=A0TcpDiscoveryVmIpFinder ipFinder = =3D new TcpDiscoveryVmIpFinder();
>> >> >
>> >> >=C2=A0 =C2=A0 =C2=A0ipFinder.setAddresses(platformCac= hingConfiguration.getIgnite().getNodes());
>> >> >=C2=A0 =C2=A0 =C2=A0discoverySpi.setIpFinder(ipFinder= );
>> >> > }
>> >> >
>> >> > We have tried every combination of timeouts, right n= ow timeouts are
>> >> > set at very hight value .
>> >> >
>> >> > (1) If we are having same bug mentioned for 2.7.0 ve= rsion, but bug
>> >> > desc says it occurs on server side, but we are getti= ng exact same
>> >> > stack trance in ClientImpl.java on client side.
>> >> > (2) assuming it is same issues, is there a way to di= sable data bag
>> >> > compression check, since upgrading both client and s= erver version
>> >> > would not be possible immediately.
>> >> >
>> >> > Thanks in advance.
--0000000000006ca51a05a3553ccb--