Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3A1D9D962 for ; Fri, 16 Nov 2012 00:30:26 +0000 (UTC) Received: (qmail 56082 invoked by uid 500); 16 Nov 2012 00:30:23 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 56065 invoked by uid 500); 16 Nov 2012 00:30:23 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 56055 invoked by uid 99); 16 Nov 2012 00:30:23 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Nov 2012 00:30:23 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of edlinuxguru@gmail.com designates 209.85.223.172 as permitted sender) Received: from [209.85.223.172] (HELO mail-ie0-f172.google.com) (209.85.223.172) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Nov 2012 00:30:19 +0000 Received: by mail-ie0-f172.google.com with SMTP id 9so3180363iec.31 for ; Thu, 15 Nov 2012 16:29:58 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=cDvh5qCj69YS/Dvw9c9Csmyl5H1aCBDodB1Tssh1iY0=; b=LobHGLhas5Sn4gPGab5xKz2OMtol/elP82s/D7p4FptFNaZbxHzoDuP9gukIDAD7el Ouqbd1oRXti7VJiE0asWueueYoD4AJbuk4dBy/LzKr1n+zu6saI1yYI0pNdMUH5AOtZt IdZS4XDnvOA9y9Z/qGhRe4VL13Wvrc+6rVCHljfwM6Q6BuH/htUUgvsNzy/UMk/zXASB UC7GwFDfm1xOw702C0mKhj7kdAejfY19WCixxVPOYWj+Si89RBTMbgwtS5hoinUmAxjO FrvhwtxFV2cDSD2phW1dihAZdtvOT6jD2PzbgJha/B++dlyKbRe6QZVS8Pp25cHPXc4K IS6Q== MIME-Version: 1.0 Received: by 10.43.60.212 with SMTP id wt20mr2314221icb.51.1353025798549; Thu, 15 Nov 2012 16:29:58 -0800 (PST) Received: by 10.64.97.106 with HTTP; Thu, 15 Nov 2012 16:29:58 -0800 (PST) In-Reply-To: References: Date: Thu, 15 Nov 2012 19:29:58 -0500 Message-ID: Subject: Re: Question regarding the need to run nodetool repair From: Edward Capriolo To: "user@cassandra.apache.org" Content-Type: multipart/alternative; boundary=bcaec51b1d6918316004ce91de3d X-Virus-Checked: Checked by ClamAV on apache.org --bcaec51b1d6918316004ce91de3d Content-Type: text/plain; charset=ISO-8859-1 On Thursday, November 15, 2012, Dwight Smith wrote: > I have a 4 node cluster, version 1.1.2, replication factor of 4, read/write consistency of 3, level compaction. Several questions. > > > > 1) Should nodetool repair be run regularly to assure it has completed before gc_grace? If it is not run, what are the exposures? Yes. Lost tombstones could cause deleted data to re appear. > > 2) If a node goes down, and is brought back up prior to the 1 hour hinted handoff expiration, should repair be run immediately? If node is brought up prior to 1 hour. You should let the hints replay. Repair is always safe to run. > > 3) If the hinted handoff has expired, the plan is to remove the node and start a fresh node in its place. Does this approach cause problems? > You only need to join a fresh mode if the node was down longer then gc grace. Default is 10 days. > > > Thanks > > If you read and write at quorum and run repair regularly you can worry less about the things above because they are essentially non factors. --bcaec51b1d6918316004ce91de3d Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable

On Thursday, November 15, 2012, Dwight Smith <Dwight.Smith@genesyslab.com> wrote:
= > I have a 4 node cluster, =A0version 1.1.2, replication factor of 4, re= ad/write consistency of 3, level compaction. Several questions.
>
> =A0
>
> 1)=A0=A0=A0=A0=A0 Should nodetool repair b= e run regularly to assure it has completed before gc_grace?=A0 If it is not= run, what are the exposures?

Yes. Lost tombstones could cause delet= ed data to re appear.
>
> 2)=A0=A0=A0=A0=A0 If a node goes down, and is brought back up = prior to the 1 hour hinted handoff expiration, should repair be run immedia= tely?

If node is brought up prior to 1 hour. You should let the hint= s replay. =A0Repair is always safe to run.
>
> 3)=A0=A0=A0=A0=A0 If the hinted handoff has expired, the plan = is to remove the node and start a fresh node in its place.=A0 Does this app= roach cause problems?
>
You only need to join a fresh mode if the = node was down longer then gc grace. Default is 10 days.
> =A0
>
> Thanks
>
>

If you read and writ= e at quorum and run repair regularly you can worry less about the things ab= ove because they are essentially non factors. --bcaec51b1d6918316004ce91de3d--