cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yuki Morishita (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CASSANDRA-10519) RepairException: [repair #... on .../..., (...,...]] Validation failed in /w.x.y.z
Date Thu, 15 Oct 2015 17:14:05 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-10519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14959239#comment-14959239
] 

Yuki Morishita commented on CASSANDRA-10519:
--------------------------------------------

{code}
Cannot start multiple repair sessions over the same sstables
{code}

There was leftover incremental repair session on one of the nodes.
Restarting node will solve the problem.

Recent version of C* will try to clear out leftover, so it should be less likely to happen.
(Not perfect though, we need something like CASSANDRA-10302 to keep state clean.)

> RepairException: [repair #... on .../..., (...,...]] Validation failed in /w.x.y.z
> ----------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-10519
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-10519
>             Project: Cassandra
>          Issue Type: Bug
>         Environment: CentOS 7, JDK 8u60, Cassandra 2.2.2 (upgraded from 2.1.5)
>            Reporter: Gábor Auth
>
> Sometimes the repair fails:
> {code}
> ERROR [Repair#3:1] 2015-10-14 06:22:56,490 CassandraDaemon.java:185 - Exception in thread
Thread[Repair#3:1,5,RMI Runtime]
> com.google.common.util.concurrent.UncheckedExecutionException: org.apache.cassandra.exceptions.RepairException:
[repair #018adc70-723c-11e5-b0d8-6b2151e4d388 on keyspace/table, (2414492737393085601,27880539413409
> 54029]] Validation failed in /w.y.x.z
>         at com.google.common.util.concurrent.Futures.wrapAndThrowUnchecked(Futures.java:1387)
~[guava-16.0.jar:na]
>         at com.google.common.util.concurrent.Futures.getUnchecked(Futures.java:1373)
~[guava-16.0.jar:na]
>         at org.apache.cassandra.repair.RepairJob.run(RepairJob.java:169) ~[apache-cassandra-2.2.2.jar:2.2.2]
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
~[na:1.8.0_60]
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
~[na:1.8.0_60]
>         at java.lang.Thread.run(Thread.java:745) ~[na:1.8.0_60]
> Caused by: org.apache.cassandra.exceptions.RepairException: [repair #018adc70-723c-11e5-b0d8-6b2151e4d388
on keyspace/table, (2414492737393085601,2788053941340954029]] Validation failed in /w.y.x.z
>         at org.apache.cassandra.repair.ValidationTask.treeReceived(ValidationTask.java:64)
~[apache-cassandra-2.2.2.jar:2.2.2]
>         at org.apache.cassandra.repair.RepairSession.validationComplete(RepairSession.java:183)
~[apache-cassandra-2.2.2.jar:2.2.2]
>         at org.apache.cassandra.service.ActiveRepairService.handleMessage(ActiveRepairService.java:399)
~[apache-cassandra-2.2.2.jar:2.2.2]
>         at org.apache.cassandra.repair.RepairMessageVerbHandler.doVerb(RepairMessageVerbHandler.java:163)
~[apache-cassandra-2.2.2.jar:2.2.2]
>         at org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:66)
~[apache-cassandra-2.2.2.jar:2.2.2]
>         ... 3 common frames omitted
> {code}
> And here is the w.y.x.z side:
> {code}
> ERROR [ValidationExecutor:7] 2015-10-14 06:22:56,487 CompactionManager.java:1053 - Cannot
start multiple repair sessions over the same sstables
> ERROR [ValidationExecutor:7] 2015-10-14 06:22:56,487 Validator.java:246 - Failed creating
a merkle tree for [repair #018adc70-723c-11e5-b0d8-6b2151e4d388 on keyspace/table, (2414492737393085601,2788053941340954029]],
/a.b.c.d (see log for details)
> ERROR [ValidationExecutor:7] 2015-10-14 06:22:56,488 CassandraDaemon.java:185 - Exception
in thread Thread[ValidationExecutor:7,1,main]
> java.lang.RuntimeException: Cannot start multiple repair sessions over the same sstables
>         at org.apache.cassandra.db.compaction.CompactionManager.doValidationCompaction(CompactionManager.java:1054)
~[apache-cassandra-2.2.2.jar:2.2.2]
>         at org.apache.cassandra.db.compaction.CompactionManager.access$700(CompactionManager.java:86)
~[apache-cassandra-2.2.2.jar:2.2.2]
>         at org.apache.cassandra.db.compaction.CompactionManager$10.call(CompactionManager.java:652)
~[apache-cassandra-2.2.2.jar:2.2.2]
>         at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[na:1.8.0_60]
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
~[na:1.8.0_60]
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
[na:1.8.0_60]
>         at java.lang.Thread.run(Thread.java:745) [na:1.8.0_60]
> ...
> ERROR [Reference-Reaper:1] 2015-10-14 06:23:21,439 Ref.java:187 - LEAK DETECTED: a reference
(org.apache.cassandra.utils.concurrent.Ref$State@74fc054a) to class org.apache.cassandra.io.sstable.format.SSTableReader$InstanceTidier@1949471967:/home/cassandra/dsc-cassandra-2.2.2/bin/../data/data/keyspace/table-b15521b0621111e4bbedcdee5e027297/la-1195-big
was not released before the reference was garbage collected
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message