ignite-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alexey Goncharuk (Jira)" <j...@apache.org>
Subject [jira] [Comment Edited] (IGNITE-12128) Potentially pds corruption on a failed node during checkpoint
Date Mon, 09 Sep 2019 13:42:00 GMT

    [ https://issues.apache.org/jira/browse/IGNITE-12128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16925701#comment-16925701
] 

Alexey Goncharuk edited comment on IGNITE-12128 at 9/9/19 1:41 PM:
-------------------------------------------------------------------

[~akalashnikov] thanks, merged your changes to master and ignite-2.7.6!


was (Author: agoncharuk):
[~akalashnikov] thanks, merged your changes to master!

> Potentially pds corruption on a failed node during checkpoint
> -------------------------------------------------------------
>
>                 Key: IGNITE-12128
>                 URL: https://issues.apache.org/jira/browse/IGNITE-12128
>             Project: Ignite
>          Issue Type: Bug
>            Reporter: Dmitriy Govorukhin
>            Assignee: Anton Kalashnikov
>            Priority: Critical
>             Fix For: 2.7.6
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> There are the case when we start a checkpoint but not create CP file marker, but PageMemory
may start to flush dirty pages from checkpoint pages to page store.  If node crashed at this
moment, we can get inconsistency state, because we still not write checkpoint marker to disk
but already write some pages for this checkpoint. If we try to recover from this state we
cat get any sort of corruption problem. Recovery logic may not recognize that crash was during
checkpoint because we did not write file marker when we start checkpoint but write some pages
for this checkpoint.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

Mime
View raw message