Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 70635 invoked from network); 24 Jun 2010 21:58:51 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 24 Jun 2010 21:58:51 -0000 Received: (qmail 39940 invoked by uid 500); 24 Jun 2010 21:58:50 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 39898 invoked by uid 500); 24 Jun 2010 21:58:50 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 39890 invoked by uid 99); 24 Jun 2010 21:58:49 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Jun 2010 21:58:49 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jbellis@gmail.com designates 209.85.160.44 as permitted sender) Received: from [209.85.160.44] (HELO mail-pw0-f44.google.com) (209.85.160.44) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Jun 2010 21:58:43 +0000 Received: by pwi6 with SMTP id 6so3084736pwi.31 for ; Thu, 24 Jun 2010 14:58:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=ks9X3/HT+gKzQTE+/EAJpDKck2slFYa87oKZtS4GjKQ=; b=p8H3TERdzrGXPu5wPPcZrxUM1UJlcmcvqNKCuZfQaVn2JQL94Z9XUoDrEXoYwOBfCe vhl8rZQ1qxrNx/2s6KWOAShJbgSrHeZPVojv0DTLG/nJ4GK/YB1aAoPrBigI/kDovMan 4Y/vJh5gpVZZ2bnfalbun+/vloKsHXKsAfUjg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=VH7HrkLc+M+xcSDwpTZ+TqGG/WicNfmRJGO4fr/WYkHDV/8zPMqVrcKd7i6gzsoh/j pLtcx35EDQsSgd8h87I9FMuBAX0OhWvMkSZjNHrhYFOK4iQlXKQcBG+cTLxQFo3kaPg6 +V/uAlFRWw5LJ9QrkYUQu/7CkzMEKcikM5wjw= Received: by 10.142.249.4 with SMTP id w4mr10038919wfh.171.1277416702164; Thu, 24 Jun 2010 14:58:22 -0700 (PDT) MIME-Version: 1.0 Received: by 10.143.28.8 with HTTP; Thu, 24 Jun 2010 14:58:02 -0700 (PDT) In-Reply-To: References: From: Jonathan Ellis Date: Thu, 24 Jun 2010 17:58:02 -0400 Message-ID: Subject: Re: Timeout when cluster node fails/restarts To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org getting a TimedOutException for a few requests when a machine fails before Cassandra's Failure Detector notices is normal. On Wed, Jun 23, 2010 at 12:34 PM, Wouter de Bie wrote: > Hi, > > I've currently setup a cluster of 11 nodes. When running a small applicat= ion that uses Hector to read and write keys, and restarting one of the node= s (not the one the application is connected to), the application stalls, ti= mes out and reconnects. This takes roughly 10 seconds. When the node is mar= ked as dead, the application seems to continue again. The application itsel= f is only connecting to localhost on one of the nodes. > Maybe interesting to mention is the fact that all nodes in the cluster ar= e configured as seeds and have all other nodes configured as seeds as well.= I'm not sure if this is causing the problem and if it's even related. > > I'm using cassandra 0.6.2 and Hector 0.6.0-15 (latest github branch) > > What am I doing wrong here? > > Regards, > > Wouter --=20 Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com