ignite-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alexey Goncharuk (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (IGNITE-8053) Exception during checkpoint concurrent changes in topology
Date Thu, 29 Mar 2018 11:57:01 GMT

     [ https://issues.apache.org/jira/browse/IGNITE-8053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Alexey Goncharuk updated IGNITE-8053:
-------------------------------------
    Fix Version/s: 2.5

> Exception during checkpoint concurrent changes in topology
> ----------------------------------------------------------
>
>                 Key: IGNITE-8053
>                 URL: https://issues.apache.org/jira/browse/IGNITE-8053
>             Project: Ignite
>          Issue Type: Bug
>          Components: persistence
>    Affects Versions: 2.4
>            Reporter: Sergey Kosarev
>            Assignee: Alexey Goncharuk
>            Priority: Major
>             Fix For: 2.5
>
>
> Excpetion in checkpoint thread 
> {code}
> 2018-03-23 15:25:19.085 [ERROR][db-checkpoint-thread-#256%DPL_GRID%DplGridNodeName%][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager]
Runtime error caught during grid runnable execution: GridWorker [name=
> db-checkpoint-thread, igniteInstanceName=DPL_GRID%DplGridNodeName, finished=false, hashCode=981946370,
interrupted=false, runner=db-checkpoint-thread-#256%DPL_GRID%DplGridNodeName%]
> java.lang.IllegalStateException: Failed to add new partition to the partitions state
(no enough space reserved) [partId=32321, reserved=5416]
>   at org.apache.ignite.internal.pagemem.wal.record.CacheState.addPartitionState(CacheState.java:64)
>   at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager$Checkpointer.markCheckpointBegin(GridCacheDatabaseSharedManager.java:2955)
>   at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager$Checkpointer.doCheckpoint(GridCacheDatabaseSharedManager.java:2704)
>   at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager$Checkpointer.body(GridCacheDatabaseSharedManager.java:2629)
>   at org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:110)
>   at java.lang.Thread.run(Thread.java:745)
> {code}
> threre are 2 sequential invocations grp.topology().currentLocalPartitions() in
> GridCacheDatabaseSharedManager.Checkpointer#markCheckpointBegin
> it's assumed that results must be equal, but they doesn't actually.
> Possible problem is in
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtPartitionTopologyImpl#forceCreatePartition
> it does not check for checkpoint lock
> ctx.database().checkpointLockIsHeldByThread();



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message