From issues-return-96108-archive-asf-public=cust-asf.ponee.io@ignite.apache.org Sun May 19 07:24:09 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 51D07180663 for ; Sun, 19 May 2019 09:24:09 +0200 (CEST) Received: (qmail 35406 invoked by uid 500); 19 May 2019 07:24:05 -0000 Mailing-List: contact issues-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ignite.apache.org Delivered-To: mailing list issues@ignite.apache.org Received: (qmail 35393 invoked by uid 99); 19 May 2019 07:24:05 -0000 Received: from mailrelay1-us-west.apache.org (HELO mailrelay1-us-west.apache.org) (209.188.14.139) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 19 May 2019 07:24:05 +0000 Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 47258E00C8 for ; Sun, 19 May 2019 07:24:01 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id CA538256C1 for ; Sun, 19 May 2019 07:24:00 +0000 (UTC) Date: Sun, 19 May 2019 07:24:00 +0000 (UTC) From: "Dmitriy Govorukhin (JIRA)" To: issues@ignite.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (IGNITE-11425) Log information about inaccessible nodes through Communication MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/IGNITE-11425?page=3Dcom.atlassi= an.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D16= 843340#comment-16843340 ]=20 Dmitriy Govorukhin commented on IGNITE-11425: --------------------------------------------- [~Denis Chudov] Thanks for the contribution, merged to master. > Log information about inaccessible nodes through Communication > -------------------------------------------------------------- > > Key: IGNITE-11425 > URL: https://issues.apache.org/jira/browse/IGNITE-11425 > Project: Ignite > Issue Type: Improvement > Reporter: Vladislav Pyatkov > Assignee: Denis Chudov > Priority: Major > Fix For: 2.8 > > Time Spent: 10m > Remaining Estimate: 0h > > In case of long getting communication TCP client (longe than this CONNECT= ION_ESTABLISH_THRESHOLD_MS =3D 100) message will printed: > {noformat} > [sys-#20167%dht.CacheGetReadFromBackupFailoverTest0%][TcpCommunicationSpi= ] TCP client created [client=3DGridTcpNioCommunicationClient [ses=3DGridSel= ectorNioSessionImpl [worker=3DDirectNioClientWorker [super=3DAbstractNioCli= entWorker [idx=3D3, bytesRcvd=3D0, bytesSent=3D0, bytesRcvd0=3D0, bytesSent= 0=3D0, select=3Dtrue, super=3DGridWorker [name=3Dgrid-nio-worker-tcp-comm-3= , igniteInstanceName=3Ddht.CacheGetReadFromBackupFailoverTest0, finished=3D= false, heartbeatTs=3D1550512236151, hashCode=3D140561231, interrupted=3Dfal= se, runner=3Dgrid-nio-worker-tcp-comm-3-#20147%dht.CacheGetReadFromBackupFa= iloverTest0%]]], writeBuf=3Djava.nio.DirectByteBuffer[pos=3D0 lim=3D32768 c= ap=3D32768], readBuf=3Djava.nio.DirectByteBuffer[pos=3D0 lim=3D32768 cap=3D= 32768], inRecovery=3DGridNioRecoveryDescriptor [acked=3D0, resendCnt=3D0, r= cvCnt=3D0, sentCnt=3D0, reserved=3Dtrue, lastAck=3D0, nodeLeft=3Dfalse, nod= e=3DTcpDiscoveryNode [id=3D8a660330-6ddb-4031-b955-4cb4f4b00002, addrs=3DAr= rayList [127.0.0.1], sockAddrs=3DHashSet [/127.0.0.1:47502], discPort=3D475= 02, order=3D5, intOrder=3D4, lastExchangeTime=3D1550512235890, loc=3Dfalse,= ver=3D2.8.0#20190218-sha1:29232e37, isClient=3Dfalse], connected=3Dfalse, = connectCnt=3D2, queueLimit=3D4096, reserveCnt=3D2, pairedConnections=3Dfals= e], outRecovery=3DGridNioRecoveryDescriptor [acked=3D0, resendCnt=3D0, rcvC= nt=3D0, sentCnt=3D0, reserved=3Dtrue, lastAck=3D0, nodeLeft=3Dfalse, node= =3DTcpDiscoveryNode [id=3D8a660330-6ddb-4031-b955-4cb4f4b00002, addrs=3DArr= ayList [127.0.0.1], sockAddrs=3DHashSet [/127.0.0.1:47502], discPort=3D4750= 2, order=3D5, intOrder=3D4, lastExchangeTime=3D1550512235890, loc=3Dfalse, = ver=3D2.8.0#20190218-sha1:29232e37, isClient=3Dfalse], connected=3Dfalse, c= onnectCnt=3D2, queueLimit=3D4096, reserveCnt=3D2, pairedConnections=3Dfalse= ], super=3DGridNioSessionImpl [locAddr=3D/127.0.0.1:38770, rmtAddr=3D/127.0= .0.1:45212, createTime=3D1550512236151, closeTime=3D0, bytesSent=3D0, bytes= Rcvd=3D0, bytesSent0=3D0, bytesRcvd0=3D0, sndSchedTime=3D1550512236151, las= tSndTime=3D1550512236151, lastRcvTime=3D1550512236151, readsPaused=3Dfalse,= filterChain=3DFilterChain[filters=3D[GridNioCodecFilter [parser=3Dorg.apac= he.ignite.internal.util.nio.GridDirectParser@d240a48, directMode=3Dtrue], G= ridConnectionBytesVerifyFilter], accepted=3Dfalse, markedForClose=3Dfalse]]= , super=3DGridAbstractCommunicationClient [lastUsed=3D1550512236151, closed= =3Dfalse, connIdx=3D0]], duration=3D211ms] > {noformat} > but in some cases we can not to get client during time out, and the messa= ge reduce to > {noformat} > TCP client created [client=3Dnull, duration=3D60004 ms] > {noformat} > According to the message you cannot understand which nodes were inaccessi= ble. > Moreover, wants to see the connection trouble earlier than the 10 minutes= after. > Should to log ip/host for clear understanding what was the node and log W= ARN message each time when need to increase timeout: > {code} > if (lastWaitingTimeout < 60000) > lastWaitingTimeout *=3D 2; > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)