cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stefania (JIRA)" <>
Subject [jira] [Commented] (CASSANDRA-9686) FSReadError and LEAK DETECTED after upgrading
Date Wed, 08 Jul 2015 08:26:04 GMT


Stefania commented on CASSANDRA-9686:

This is ready for review.

I added handling of {{CorruptSSTableException}} and {{FSError}} when loading sstables. When
the disk failure policy is {{die}}, the process will exit as expected. For {{CorruptSSTableException}},
the transports will be stopped only with {{stop_paranoid}}, not just {{stop}}, as per documentation
in the yaml file. 

One potential issue is that tables are loaded during the initial health checks, before the
transports are started. So stopping the transports has no effect. Later on they are started
as if nothing happened. So effectively only the {{die}} policy is honored in this scenario.
Not sure what to do about this as it could be legit to want to restart the transports later
on via JMX.

CI results:

One flacky dtest, I've scheduled a new build (#4).

> FSReadError and LEAK DETECTED after upgrading
> ---------------------------------------------
>                 Key: CASSANDRA-9686
>                 URL:
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>         Environment: Windows-7-32 bit, 3.2GB RAM, Java 1.7.0_55
>            Reporter: Andreas Schnitzerling
>            Assignee: Stefania
>             Fix For: 2.2.x
>         Attachments: cassandra.bat, cassandra.yaml,,,
> After upgrading one of 15 nodes from 2.1.7 to 2.2.0-rc1 I get FSReadError and LEAK DETECTED
on start. Deleting the listed files, the failure goes away.
> {code:title=system.log}
> ERROR [SSTableBatchOpen:1] 2015-06-29 14:38:34,554
- Error in ThreadPoolExecutor
> Compressed file with 0 chunks
> 	at
> 	at<init>(
> 	at
> 	at$Builder.metadata(
> 	at$Builder.complete(
> 	at$Builder.complete(
> 	at
> 	at
> 	at
> 	at
> 	at$
> 	at java.util.concurrent.Executors$ Source) ~[na:1.7.0_55]
> 	at Source) ~[na:1.7.0_55]
> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) [na:1.7.0_55]
> 	at java.util.concurrent.ThreadPoolExecutor$ Source) [na:1.7.0_55]
> 	at Source) [na:1.7.0_55]
> Caused by: Compressed file with 0 chunks encountered:
> 	at
> 	... 15 common frames omitted
> ERROR [Reference-Reaper:1] 2015-06-29 14:38:34,734 - LEAK DETECTED: a reference
(org.apache.cassandra.utils.concurrent.Ref$State@3e547f) to class$InstanceTidier@1926439:D:\Programme\Cassandra\data\data\system\compactions_in_progress\system-compactions_in_progress-ka-6866
was not released before the reference was garbage collected
> {code}

This message was sent by Atlassian JIRA

View raw message