Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ABC61DC7C for ; Wed, 27 Jun 2012 15:50:18 +0000 (UTC) Received: (qmail 25903 invoked by uid 500); 27 Jun 2012 15:50:16 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 25879 invoked by uid 500); 27 Jun 2012 15:50:16 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 25870 invoked by uid 99); 27 Jun 2012 15:50:16 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Jun 2012 15:50:16 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.220.172] (HELO mail-vc0-f172.google.com) (209.85.220.172) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Jun 2012 15:50:09 +0000 Received: by vcqp1 with SMTP id p1so879938vcq.31 for ; Wed, 27 Jun 2012 08:49:48 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type :x-gm-message-state; bh=7Uib12qxq8BR6kTmFKEiq7JnGRdsKvj4xlw/bbgq86g=; b=c6sO8mFUiqlkE9vpeR4/2jg6pvCWaYYcXpFnR7UXF8/h+jOf9R4nkI/qhdeqEJ9L02 blp4eZRw+slkpU+/e7agCQk+YFmTNwerD4DS2fIANHO4D/uY2POLlaxLbPbINjiSiIch KjgD8wyKTH+/lHmrjYtHNT+vCdP28U1GAQX6dRs31GEFPGW4ZNrxXrGiVU3hyAG3g0Yd 8lYvzqEPkZ3bpfNfPGXnPLCvzPon7SH3ENyqs+0hUsz6n3Ef7cejN3ChFMVw1l5WlCm8 rxyGi8cU4niBU1+exTIIlATW97qdn5q5x7f+eQPkOaMdkCMBdIarr8mE+JXgU+uiFkyS Cwzg== MIME-Version: 1.0 Received: by 10.52.25.70 with SMTP id a6mr12033057vdg.78.1340812188674; Wed, 27 Jun 2012 08:49:48 -0700 (PDT) Received: by 10.52.74.162 with HTTP; Wed, 27 Jun 2012 08:49:48 -0700 (PDT) Date: Wed, 27 Jun 2012 17:49:48 +0200 Message-ID: Subject: Node crashing during read repair From: Robin Verlangen To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=20cf3079bdda376a3504c3762a27 X-Gm-Message-State: ALoCoQk9xTny9GkQgH7TlPSx38R5b3/Z363ZKS1rmB7fblVkesn4VzT2OkXVY46qsjFkbVPWGd0o --20cf3079bdda376a3504c3762a27 Content-Type: text/plain; charset=ISO-8859-1 Hi there, Today I found one node (running 1.1.1 in a 3 node cluster) being dead for the third time this week, it died with the following message: ERROR [ReadRepairStage:3] 2012-06-27 14:28:30,929 AbstractCassandraDaemon.java (line 134) Exception in thread Thread[ReadRepairStage:3,5,main] java.util.concurrent.RejectedExecutionException: ThreadPoolExecutor has shut down at org.apache.cassandra.concurrent.DebuggableThreadPoolExecutor$1.rejectedExecution(DebuggableThreadPoolExecutor.java:60) at java.util.concurrent.ThreadPoolExecutor.reject(Unknown Source) at java.util.concurrent.ThreadPoolExecutor.execute(Unknown Source) at org.apache.cassandra.net.MessagingService.receive(MessagingService.java:566) at org.apache.cassandra.net.MessagingService.sendOneWay(MessagingService.java:439) at org.apache.cassandra.net.MessagingService.sendRR(MessagingService.java:391) at org.apache.cassandra.net.MessagingService.sendRR(MessagingService.java:372) at org.apache.cassandra.net.MessagingService.sendRR(MessagingService.java:460) at org.apache.cassandra.service.RowRepairResolver.scheduleRepairs(RowRepairResolver.java:136) at org.apache.cassandra.service.RowRepairResolver.resolve(RowRepairResolver.java:94) at org.apache.cassandra.service.AsyncRepairCallback$1.runMayThrow(AsyncRepairCallback.java:54) at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source) at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) at java.lang.Thread.run(Unknown Source) Is this a common bug in 1.1.1, or did I get a "race" condition? -- With kind regards, Robin Verlangen *Software engineer* * * W http://www.robinverlangen.nl E robin@us2.nl Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies. --20cf3079bdda376a3504c3762a27 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi there,

Today I found one node (running 1.1.1 in a 3 n= ode cluster) being dead for the third time this week, it died with the foll= owing message:

ERROR [ReadRepairStage:3] 2012= -06-27 14:28:30,929 AbstractCassandraDaemon.java (line 134) Exception in th= read Thread[ReadRepairStage:3,5,main]
java.util.concurrent.RejectedExecutionException: ThreadPoolExecutor ha= s shut down
=A0 =A0 =A0 =A0 at org.apache.cassandra.concurrent.De= buggableThreadPoolExecutor$1.rejectedExecution(DebuggableThreadPoolExecutor= .java:60)
=A0 =A0 =A0 =A0 at java.util.concurrent.ThreadPoolExecutor.reject(Unkn= own Source)
=A0 =A0 =A0 =A0 at java.util.concurrent.ThreadPoolExe= cutor.execute(Unknown Source)
=A0 =A0 =A0 =A0 at org.apache.cassa= ndra.net.MessagingService.receive(MessagingService.java:566)
=A0 =A0 =A0 =A0 at org.apache.cassandra.net.MessagingService.sendOneWa= y(MessagingService.java:439)
=A0 =A0 =A0 =A0 at org.apache.cassan= dra.net.MessagingService.sendRR(MessagingService.java:391)
=A0 = =A0 =A0 =A0 at org.apache.cassandra.net.MessagingService.sendRR(MessagingSe= rvice.java:372)
=A0 =A0 =A0 =A0 at org.apache.cassandra.net.MessagingService.sendRR(Me= ssagingService.java:460)
=A0 =A0 =A0 =A0 at org.apache.cassandra.= service.RowRepairResolver.scheduleRepairs(RowRepairResolver.java:136)
=
=A0 =A0 =A0 =A0 at org.apache.cassandra.service.RowRepairResolver.reso= lve(RowRepairResolver.java:94)
=A0 =A0 =A0 =A0 at org.apache.cassandra.service.AsyncRepairCallback$1.= runMayThrow(AsyncRepairCallback.java:54)
=A0 =A0 =A0 =A0 at org.a= pache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30)
=A0 =A0 =A0 =A0 at java.util.concurrent.ThreadPoolExecutor$Worker.runTask= (Unknown Source)
=A0 =A0 =A0 =A0 at java.util.concurrent.ThreadPoolExecutor$Worker.run(= Unknown Source)
=A0 =A0 =A0 =A0 at java.lang.Thread.run(Unknown S= ource)

Is this a common bug in 1.1.1, or did I get= a "race" condition?

--
With kind regards,

Robin Verlangen=
Software engineer

E robin@us2.nl

Disclaimer: The information= contained in this message and attachments is intended solely for the atten= tion and use of the named addressee and may be confidential. If you are not= the intended recipient, you are reminded that the information remains the = property of the sender. You must not use, disclose, distribute, copy, print= or rely on this e-mail. If you have received this message in error, please= contact the sender immediately and irrevocably delete this message and any= copies.

--20cf3079bdda376a3504c3762a27--