Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 98702200B86 for ; Sun, 4 Sep 2016 04:36:27 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 96F52160ACD; Sun, 4 Sep 2016 02:36:27 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id DA5CD160ABB for ; Sun, 4 Sep 2016 04:36:26 +0200 (CEST) Received: (qmail 4156 invoked by uid 500); 4 Sep 2016 02:36:21 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 4045 invoked by uid 99); 4 Sep 2016 02:36:21 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 04 Sep 2016 02:36:21 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id B13992C1B81 for ; Sun, 4 Sep 2016 02:36:20 +0000 (UTC) Date: Sun, 4 Sep 2016 02:36:20 +0000 (UTC) From: "Wei Deng (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (CASSANDRA-8190) Compactions stop completely because of RuntimeException in CompactionExecutor MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Sun, 04 Sep 2016 02:36:27 -0000 [ https://issues.apache.org/jira/browse/CASSANDRA-8190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wei Deng updated CASSANDRA-8190: -------------------------------- Labels: lcs (was: ) > Compactions stop completely because of RuntimeException in CompactionExecutor > ----------------------------------------------------------------------------- > > Key: CASSANDRA-8190 > URL: https://issues.apache.org/jira/browse/CASSANDRA-8190 > Project: Cassandra > Issue Type: Bug > Environment: DSE 4.5.2 (Cassandra 2.0.10) > Reporter: Nikolai Grigoriev > Assignee: Marcus Eriksson > Labels: lcs > Attachments: cassandra-env.sh, cassandra.yaml, jstack.txt.gz, system.log.gz, system.log.gz > > > I have a cluster that is recovering from being overloaded with writes. I am using the workaround from CASSANDRA-6621 to prevent the STCS fallback (which is killing the cluster - see CASSANDRA-7949). > I have observed that after one or more exceptions like this > {code} > ERROR [CompactionExecutor:4087] 2014-10-26 22:50:05,016 CassandraDaemon.java (line 199) Exception in thread Thread[CompactionExecutor:4087,1,main] > java.lang.RuntimeException: Last written key DecoratedKey(425124616570337476, 0010000000001111000000000000033523da00001000000000033523da000000001111000000001000000000 > 00004000000000000000000100) >= current key DecoratedKey(-8778432288598355336, 0010000000001111000000000000040c7a8f00001000000000040c7a8f000000001111000000001000000000 > 00004000000000000000000100) writing into /cassandra-data/disk2/myks/mytable/myks-mytable-tmp-jb-130379-Data.db > at org.apache.cassandra.io.sstable.SSTableWriter.beforeAppend(SSTableWriter.java:142) > at org.apache.cassandra.io.sstable.SSTableWriter.append(SSTableWriter.java:165) > at org.apache.cassandra.db.compaction.CompactionTask.runWith(CompactionTask.java:160) > at org.apache.cassandra.io.util.DiskAwareRunnable.runMayThrow(DiskAwareRunnable.java:48) > at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28) > at org.apache.cassandra.db.compaction.CompactionTask.executeInternal(CompactionTask.java:60) > at org.apache.cassandra.db.compaction.AbstractCompactionTask.execute(AbstractCompactionTask.java:59) > at org.apache.cassandra.db.compaction.CompactionManager$BackgroundCompactionTask.run(CompactionManager.java:198) > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) > at java.util.concurrent.FutureTask.run(FutureTask.java:262) > at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > {code} > the node completely stops the compactions and I end up in the state like this: > {code} > # nodetool compactionstats > pending tasks: 1288 > compaction type keyspace table completed total unit progress > Active compaction remaining time : n/a > {code} > The node recovers if restarted and starts compactions - until getting more exceptions like this. -- This message was sent by Atlassian JIRA (v6.3.4#6332)