ignite-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mikhail Cherkasov (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (IGNITE-9184) Cluster hangs during concurrent node restart and continues query registration
Date Mon, 06 Aug 2018 13:00:00 GMT

    [ https://issues.apache.org/jira/browse/IGNITE-9184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16570181#comment-16570181
] 

Mikhail Cherkasov commented on IGNITE-9184:
-------------------------------------------

Stacktraces shows that client and server node are stuck on initial exchange:

 

 
{code:java}
"Thread-0" #10 prio=5 os_prio=31 tid=0x00007fd2a28e7800 nid=0x4003 waiting on condition [0x00007000051c8000]
 java.lang.Thread.State: TIMED_WAITING (parking)
 at sun.misc.Unsafe.park(Native Method)
 at java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:338)
 at org.apache.ignite.internal.util.future.GridFutureAdapter.get0(GridFutureAdapter.java:217)
 at org.apache.ignite.internal.util.future.GridFutureAdapter.get(GridFutureAdapter.java:159)
 at org.apache.ignite.internal.util.future.GridFutureAdapter.get(GridFutureAdapter.java:151)
 at org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager.onKernalStart(GridCachePartitionExchangeManager.java:632)
 at org.apache.ignite.internal.processors.cache.GridCacheProcessor.onKernalStart(GridCacheProcessor.java:865)
 at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1043)
 at org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1973)
 at org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1716)
 - locked <0x00000006d9f1e598> (a org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
 at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1144)
 at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:664)
 at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:589)
 at org.apache.ignite.Ignition.start(Ignition.java:327)
 at continues_query.StressTest$StartStopTask.run(StressTest.java:201)
 at java.lang.Thread.run(Thread.java:748)

"main" #1 prio=5 os_prio=31 tid=0x00007fd2a3800800 nid=0x2503 waiting on condition [0x0000700004218000]
 java.lang.Thread.State: TIMED_WAITING (parking)
 at sun.misc.Unsafe.park(Native Method)
 at java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:338)
 at org.apache.ignite.internal.util.future.GridFutureAdapter.get0(GridFutureAdapter.java:217)
 at org.apache.ignite.internal.util.future.GridFutureAdapter.get(GridFutureAdapter.java:159)
 at org.apache.ignite.internal.util.future.GridFutureAdapter.get(GridFutureAdapter.java:151)
 at org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager.onKernalStart(GridCachePartitionExchangeManager.java:632)
 at org.apache.ignite.internal.processors.cache.GridCacheProcessor.onKernalStart(GridCacheProcessor.java:865)
 at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1043)
 at org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1973)
 at org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1716)
 - locked <0x00000006de0aa3b0> (a org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
 at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1144)
 at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:664)
 at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:589)
 at org.apache.ignite.Ignition.start(Ignition.java:327)
 at continues_query.StressTest.main(StressTest.java:81)
{code}
 

I don't see any other stuck threads.

 

> Cluster hangs during concurrent node restart and continues query registration
> -----------------------------------------------------------------------------
>
>                 Key: IGNITE-9184
>                 URL: https://issues.apache.org/jira/browse/IGNITE-9184
>             Project: Ignite
>          Issue Type: Bug
>          Components: general
>    Affects Versions: 2.6
>            Reporter: Mikhail Cherkasov
>            Assignee: Dmitriy Govorukhin
>            Priority: Blocker
>             Fix For: 2.7
>
>         Attachments: StressTest.java, logs, stacktrace
>
>
> Please check the attached test case and stack trace.
> I can see: "Failed to wait for initial partition map exchange" message.
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message