Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 52160 invoked from network); 13 Apr 2010 16:24:08 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 13 Apr 2010 16:24:08 -0000 Received: (qmail 45271 invoked by uid 500); 13 Apr 2010 16:24:07 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 45244 invoked by uid 500); 13 Apr 2010 16:24:07 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 45226 invoked by uid 99); 13 Apr 2010 16:24:07 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Apr 2010 16:24:07 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=AWL,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of scottblanc@gmail.com designates 209.85.212.44 as permitted sender) Received: from [209.85.212.44] (HELO mail-vw0-f44.google.com) (209.85.212.44) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Apr 2010 16:24:01 +0000 Received: by vws11 with SMTP id 11so933385vws.31 for ; Tue, 13 Apr 2010 09:23:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:received:message-id :subject:from:to:content-type; bh=qwmhWlL7kAZaM9zST1xvD4LCzhaDVf+bzi1JQOEjgkw=; b=YbHlOiaag4FE9gePrDadJ9MwWmZOqJDF3K4FGbi+atTEZwnQEatBsF+G5a/lI5uiIR 0LYEXIhD0BvzPzemVEOXjddR94beRBjWVPbOjb85zfjNEC9WNYSmBk8IpqUEkQxU2rpB 7TyNVgRFd/WZsRRhfKA42KqWh5cMR956Erhnw= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=F4ddSw8i+uNxjXwmV21PFkw1BpFTT0AGyd2vWp2TVKwXne75yInfZT0oX/+3Rtq+GV Ld66Wz2I1C54v9f/Iq18bVTSJ1nBeSfRQAs/I1gL08+2C14ihFGavLp2lwCHa+3Dvyhj 1mEjeC2WHgS9U4MH5IYpBDNEaD/+GHlEvcH+Y= MIME-Version: 1.0 Received: by 10.220.91.3 with HTTP; Tue, 13 Apr 2010 09:23:39 -0700 (PDT) Date: Tue, 13 Apr 2010 09:23:39 -0700 Received: by 10.220.107.148 with SMTP id b20mr3173034vcp.90.1271175819973; Tue, 13 Apr 2010 09:23:39 -0700 (PDT) Message-ID: Subject: Internal error From: Scott White To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=00c09f8a4d9e325675048420af7c --00c09f8a4d9e325675048420af7c Content-Type: text/plain; charset=ISO-8859-1 We started seeing for the first time errors that look like this last night after things were running smoothly for a couple days after switching to 0.6-rc1: ERROR [pool-1-thread-60] 2010-04-13 04:58:11,806 Cassandra.java (line 1492) Internal error processing insert We had not made any other changes to cassandra server or client config in days. Operationally we did try to bootstrap two new nodes, one of which failed after timing out. The other thing we did was a cleanup operation. Is it possible somehow any of those operations/events would somehow get things into a bad state? If not, any other ideas of what to look for as the root cause? Overall, the cluster seemed relatively healthy with the exception that one of our nodes which happens to be the seed node has been having excessively high network delays, hovering around 30-40 ms although it's been in that state for more than a week. Also a side question I was curious about, is the state of a boostrap or cleanup operation maintained after restart? thanks, Scott --00c09f8a4d9e325675048420af7c Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable We started seeing for the first time errors that look like this last night = after things were running smoothly for a couple days after switching to 0.6= -rc1:

ERROR [pool-1-thread-60] 2010-04-13 04:58:11,806 Cassandra.jav= a (line=20 1492) Internal error processing insert

We had not made any other cha= nges to cassandra server or client config in days. Operationally we did try= to bootstrap two new nodes, one of which failed after timing out. The othe= r thing we did was a cleanup operation.

Is it possible somehow any of those operations/events would somehow get= things into a bad state? If not, any other ideas of what to look for as th= e root cause?

Overall, the cluster seemed relatively healthy with th= e exception that one of our nodes which happens to be the seed node has bee= n having excessively high network delays, hovering around 30-40 ms although= it's been in that state for more than a week.

Also a side question I was curious about, is the state of a boostrap or= cleanup operation maintained after restart?

thanks,
Scott
--00c09f8a4d9e325675048420af7c--