cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Fong <>
Subject Effectiveness of Scrub Operation vs SSTable previously marked in blacklist
Date Tue, 22 Mar 2016 03:36:07 GMT
Hi, all,

We recently encountered a scenario under Cassandra 2.0 deployment. Cassandra detected a corrupted
sstable, and when we attempt to scrub the sstable (with all the associated sstables), the
corrupted sstable was not included in the sstable list. This continues until we restart Cassandra
and perform sstable again.

After we traced the Cassandra source code, we are a bit confused with the effectiveness of
scrubbing and SStable being marked in blacklist in Cassandra 2.0+

It seems from previous version (Cassandra 1.2), the scrub operation would operate on a sstable
regardless of it being previously marked. However, in Cassandra 2.0, the function flows seems

Here is function flow that we traced in Cassandra 2.0 source code:

>From org.apache.cassandra.db.compaction.CompactionManager

    public void performScrub(ColumnFamilyStore cfStore, final boolean skipCorrupted, final
boolean checkData) throws InterruptedException, ExecutionException


        performAllSSTableOperation(cfStore, new AllSSTablesOperation()

    private void performAllSSTableOperation(final ColumnFamilyStore cfs, final AllSSTablesOperation
operation) throws InterruptedException, ExecutionException
        final Iterable<SSTableReader> sstables = cfs.markAllCompacting();

org.apache.cassandra.db. ColumnFamilyStore

    public Iterable<SSTableReader> markAllCompacting()


        Callable<Iterable<SSTableReader>> callable = new Callable<Iterable<SSTableReader>>()


            public Iterable<SSTableReader> call() throws Exception


                assert data.getCompacting().isEmpty() : data.getCompacting();

                Iterable<SSTableReader> sstables = Lists.newArrayList(AbstractCompactionStrategy.filterSuspectSSTables(getSSTables()));

                if (Iterables.isEmpty(sstables))

                    return null;

If it is true, would this flow - marking corrupted sstable in blacklist, defeat the original
purpose of scrub operation?  Thanks in advanced!


Michael Fong

View raw message