Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 92127 invoked from network); 15 Jul 2010 01:44:40 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 15 Jul 2010 01:44:40 -0000 Received: (qmail 31455 invoked by uid 500); 15 Jul 2010 01:44:39 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 31408 invoked by uid 500); 15 Jul 2010 01:44:38 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 31400 invoked by uid 99); 15 Jul 2010 01:44:38 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Jul 2010 01:44:38 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jbellis@gmail.com designates 74.125.83.172 as permitted sender) Received: from [74.125.83.172] (HELO mail-pv0-f172.google.com) (74.125.83.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Jul 2010 01:44:32 +0000 Received: by pvd12 with SMTP id 12so124409pvd.31 for ; Wed, 14 Jul 2010 18:44:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=0Qakyutg0y5FrbDi+i9oz0zcyBwwVmzkwNmsSrNJw3Y=; b=V4+rWZ1PPmawi744H/9PCoGfVu3P0GPdtkd+EyTr00Ic7ATsRRKEnlwptRdMVl7eYR UmCjYU/b9nHXIRTrAG0EL17+gUzVyw+npmpo+coVOUPfrbVhPPqOZB046ztr9CjfzTUr 1pBL4es1RqPRqcE6oUp1Qljauja8gUHQc8Dwo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=ugsQrdsr1D13raY8wvzviR6kjx+TssbvLXnKqRfGZkY54p5KyY7uU2xoS+4UAT10hR YkwGrvOt8ADtZAb0qMF+HH5BKjAxV5dpM2PhiIw+935yec1RZV7wHqNNzUrXdHDO53BR UyfUyyhcxELjUZKhiK3x8O14dCeL6174OQ14g= Received: by 10.142.211.5 with SMTP id j5mr611682wfg.261.1279158250540; Wed, 14 Jul 2010 18:44:10 -0700 (PDT) MIME-Version: 1.0 Received: by 10.142.103.9 with HTTP; Wed, 14 Jul 2010 18:43:50 -0700 (PDT) In-Reply-To: <1279142234.20148.47.camel@rnwk-dell> References: <1279132996.20148.44.camel@rnwk-dell> <1279142234.20148.47.camel@rnwk-dell> From: Jonathan Ellis Date: Wed, 14 Jul 2010 20:43:50 -0500 Message-ID: Subject: Re: node down window To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Coordination in a distributed system is difficult. I don't think we can fix HH's existing edge cases, without introducing other more complicated edge cases. So weekly-or-so repair will remain a common maintenance task for the forseeable future. On Wed, Jul 14, 2010 at 4:17 PM, B. Todd Burruss wrote: > thx, but disappointing :) > > is this just something we have to live with and periodically "repair" > the nodes? =A0or is there future work to tighten up the window? > > thx > > > On Wed, 2010-07-14 at 12:13 -0700, Jonathan Ellis wrote: >> On Wed, Jul 14, 2010 at 1:43 PM, B. Todd Burruss wro= te: >> > there is a window of time from when a node goes down and when the rest >> > of the cluster actually realizes that it is down. >> > >> > what happens to writes during this time frame? =A0does hinted handoff >> > record these writes and then "handoff" when the down node returns? =A0= or >> > does hinted handoff not kick in until the cluster realizes the node is >> > down? >> >> the latter. >> >> > ... is the only way these missed writes are repaired is through read >> > repair and/or manually kicking off "nodetool repair"? >> >> yes. >> > > > --=20 Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com