Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 18662 invoked from network); 9 Apr 2011 19:11:44 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 9 Apr 2011 19:11:44 -0000 Received: (qmail 62051 invoked by uid 500); 9 Apr 2011 19:11:42 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 62028 invoked by uid 500); 9 Apr 2011 19:11:42 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 62020 invoked by uid 99); 9 Apr 2011 19:11:42 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 09 Apr 2011 19:11:42 +0000 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of vram.kouramajian@gmail.com designates 209.85.220.172 as permitted sender) Received: from [209.85.220.172] (HELO mail-vx0-f172.google.com) (209.85.220.172) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 09 Apr 2011 19:11:37 +0000 Received: by vxg33 with SMTP id 33so4160602vxg.31 for ; Sat, 09 Apr 2011 12:11:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type:content-transfer-encoding; bh=HHg5lApUOthnQHC/K0uUzoImTVQkMSXVxdTTebJaTDE=; b=K0XPEdGg8o1n3PeO26oPpm4///Q1oOEGKm7m03iQtRWxdVrq37exkGijlgjyb774Lx fnflrd7+bC8tOnfqZVZoIH0xf7Xz2A/ChuQgeIoxMzflaAzJW8tfooFX5GQBAP6XvHpY 7mqnd8UbMEolhKDwsRLHTOUJIP1naO+noiVlg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=Yz/4Hjo6YRM6tYmxlrl494584cFq7pLxYZEvzNB1a1UxYU5f9eUtHhHg841clcVtgm vaH4HQ2dVsaSDFlZEqNUGGolCL0QJgiIBN4r9w5M+DX60roDbnxzxhXz8Jn71pSTH0/M TQPpnYWCGFBE/Onv0JERe2hxHMCOXO/EojuMw= MIME-Version: 1.0 Received: by 10.52.70.49 with SMTP id j17mr2179868vdu.292.1302376276624; Sat, 09 Apr 2011 12:11:16 -0700 (PDT) Received: by 10.52.167.105 with HTTP; Sat, 9 Apr 2011 12:11:16 -0700 (PDT) In-Reply-To: <301C715B-CD4A-4C09-9E4A-A2F04635D937@joestump.net> References: <301C715B-CD4A-4C09-9E4A-A2F04635D937@joestump.net> Date: Sat, 9 Apr 2011 12:11:16 -0700 Message-ID: Subject: Re: Site Not Surviving a Single Cassandra Node Crash From: Vram Kouramajian To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable The hector clients are used as part of our jetty servers. And, the jetty servers stop responding when one of the Cassandra nodes go down. Vram On Sat, Apr 9, 2011 at 11:54 AM, Joe Stump wrote: > Did the Cassandra cluster go down or did you start getting failures from = the client when it routed queries to the downed node? The key in the client= is to keep working around the ring if the initial node is down. > > --Joe > > On Apr 9, 2011, at 12:52 PM, Vram Kouramajian wrote: > >> We have a 5 Cassandra nodes with the following configuration: >> >> Casandra Version: 0.6.11 >> Number of Nodes: 5 >> Replication Factor: 3 >> Client: Hector 0.6.0-14 >> Write Consistency Level: Quorum >> Read Consistency Level: Quorum >> Ring Topology: >> =A0 Owns =A0 =A0Range =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 =A0 =A0 =A0 =A0Ring >> >> 132756707369141912386052673276321963528 >> 192.168.89.153Up =A0 =A0 =A0 =A0 4.15 GB =A0 =A0 =A0 33.87% >> 20237398133070283622632741498697119875 =A0 =A0 |<--| >> 192.168.89.155Up =A0 =A0 =A0 =A0 5.17 GB =A0 =A0 =A0 18.29% >> 51358066040236348437506517944084891398 =A0 =A0 | =A0 ^ >> 192.168.89.154Up =A0 =A0 =A0 =A0 7.41 GB =A0 =A0 =A0 33.97% >> 109158969152851862753910401160326064203 =A0 =A0v =A0 | >> 192.168.89.152Up =A0 =A0 =A0 =A0 5.07 GB =A0 =A0 =A0 6.34% >> 119944993359936402983569623214763193674 =A0 =A0| =A0 ^ >> 192.168.89.151Up =A0 =A0 =A0 =A0 4.22 GB =A0 =A0 =A0 7.53% >> 132756707369141912386052673276321963528 =A0 =A0|-->| >> >> We believe that our setup should survive the crash of one of the >> Cassandra nodes. But, we had few crashes and the system stopped >> functioning until we brought back the Cassandra nodes. >> >> Any clues? >> >> Vram > >