[ https://issues.apache.org/jira/browse/IGNITE-11968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16881038#comment-16881038
]
Stanislav Lukyanov commented on IGNITE-11968:
---------------------------------------------
This may or may not be fixed by IGNITE-11865. Need to reproduce and verify the fix.
> FailureProcessor reports tcp-comm-worker as blocked when resolving communication errors
in ZK
> ---------------------------------------------------------------------------------------------
>
> Key: IGNITE-11968
> URL: https://issues.apache.org/jira/browse/IGNITE-11968
> Project: Ignite
> Issue Type: Improvement
> Affects Versions: 2.7
> Reporter: Stanislav Lukyanov
> Priority: Major
>
> When multiple clients fail (stopped) at the same time and ZK discovery is used FailureProcessor
treats tcp-comm-worker as blocked with the following trace:
> {code}
> Thread [name="tcp-comm-worker-#1", id=122, state=WAITING, blockCnt=1465, waitCnt=4557]
> at sun.misc.Unsafe.park(Native Method)
> at java.util.concurrent.locks.LockSupport.park(LockSupport.java:304)
> at o.a.i.i.util.future.GridFutureAdapter.get0(GridFutureAdapter.java:178)
> at o.a.i.i.util.future.GridFutureAdapter.get(GridFutureAdapter.java:141)
> at o.a.i.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.resolveCommunicationError(ZookeeperDiscoveryImpl.java:359)
> at o.a.i.spi.discovery.zk.ZookeeperDiscoverySpi.resolveCommunicationFailure(ZookeeperDiscoverySpi.java:259)
> at o.a.i.i.managers.discovery.GridDiscoveryManager.resolveCommunicationError(GridDiscoveryManager.java:2549)
> at o.a.i.i.managers.GridManagerAdapter$1.resolveCommunicationFailure(GridManagerAdapter.java:626)
> at o.a.i.spi.communication.tcp.TcpCommunicationSpi.processClientCreationError(TcpCommunicationSpi.java:3530)
> at o.a.i.spi.communication.tcp.TcpCommunicationSpi.createTcpClient(TcpCommunicationSpi.java:3483)
> at o.a.i.spi.communication.tcp.TcpCommunicationSpi.createNioClient(TcpCommunicationSpi.java:2987)
> at o.a.i.spi.communication.tcp.TcpCommunicationSpi.reserveClient(TcpCommunicationSpi.java:2870)
> at o.a.i.spi.communication.tcp.TcpCommunicationSpi.access$6000(TcpCommunicationSpi.java:271)
> at o.a.i.spi.communication.tcp.TcpCommunicationSpi$CommunicationWorker.processDisconnect(TcpCommunicationSpi.java:4489)
> at o.a.i.spi.communication.tcp.TcpCommunicationSpi$CommunicationWorker.body(TcpCommunicationSpi.java:4294)
> at o.a.i.i.util.worker.GridWorker.run(GridWorker.java:120)
> at o.a.i.spi.communication.tcp.TcpCommunicationSpi$5.body(TcpCommunicationSpi.java:2237)
> at o.a.i.spi.IgniteSpiThread.run(IgniteSpiThread.java:62)
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
|