ignite-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ksenia Rybakova (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (IGNITE-3606) Node sometimes fails to detect broken connection
Date Tue, 01 Aug 2017 11:40:00 GMT

     [ https://issues.apache.org/jira/browse/IGNITE-3606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ksenia Rybakova updated IGNITE-3606:
------------------------------------
    Attachment: logs.zip

> Node sometimes fails to detect broken connection
> ------------------------------------------------
>
>                 Key: IGNITE-3606
>                 URL: https://issues.apache.org/jira/browse/IGNITE-3606
>             Project: Ignite
>          Issue Type: Bug
>          Components: general
>            Reporter: Semen Boikov
>            Priority: Critical
>             Fix For: 2.2
>
>         Attachments: logs.zip
>
>
> Here is test reproducing issue https://github.com/rossdanderson/IgniteDeadlock.
> When I run this test observe this sequence:
> - server starts
> - client starts
> - server sends 2000 messages to client, on client node communication backpressure pauses
reads
> - server gets write timeout and closes socket
> - for some reason client does not detect that existing connection was broken and thinks
that connection is still established (most probably because reads are paused and node does
not try to access connection)
> - when server tries to re-connec, client sees that connection already established and
rejects connection, so server constantly tries to reconnect and does not exit from reconnect
loop:
> {noformat}
> "main" prio=6 tid=0x0000000001f4a000 nid=0x3588 waiting on condition [0x00000000021ed000]
>    java.lang.Thread.State: TIMED_WAITING (sleeping)
> 	at java.lang.Thread.sleep(Native Method)
> 	at org.apache.ignite.internal.util.IgniteUtils.sleep(IgniteUtils.java:7414)
> 	at org.apache.ignite.spi.communication.tcp.TcpCommunicationSpi.reserveClient(TcpCommunicationSpi.java:2055)
> 	at org.apache.ignite.spi.communication.tcp.TcpCommunicationSpi.sendMessage0(TcpCommunicationSpi.java:1970)
> 	at org.apache.ignite.spi.communication.tcp.TcpCommunicationSpi.sendMessage(TcpCommunicationSpi.java:1936)
> 	at org.apache.ignite.internal.managers.communication.GridIoManager.send(GridIoManager.java:1304)
> 	at org.apache.ignite.internal.managers.communication.GridIoManager.sendOrderedMessage(GridIoManager.java:1540)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message