Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 63846 invoked from network); 10 Dec 2010 17:31:38 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 10 Dec 2010 17:31:38 -0000 Received: (qmail 24030 invoked by uid 500); 10 Dec 2010 17:31:36 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 24011 invoked by uid 500); 10 Dec 2010 17:31:36 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 24002 invoked by uid 99); 10 Dec 2010 17:31:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Dec 2010 17:31:36 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [74.125.83.42] (HELO mail-gw0-f42.google.com) (74.125.83.42) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Dec 2010 17:31:27 +0000 Received: by gwb20 with SMTP id 20so2893904gwb.29 for ; Fri, 10 Dec 2010 09:31:06 -0800 (PST) MIME-Version: 1.0 Received: by 10.151.79.1 with SMTP id g1mr1759079ybl.375.1292002266262; Fri, 10 Dec 2010 09:31:06 -0800 (PST) Sender: scode@scode.org Received: by 10.150.220.9 with HTTP; Fri, 10 Dec 2010 09:31:06 -0800 (PST) X-Originating-IP: [95.193.249.15] In-Reply-To: <324C33D7-8A43-4159-9D23-EDE2BEB08238@toptarif.de> References: <46C412D5-2781-4682-9F9E-B8D5C1D60664@toptarif.de> <88F8E565-C3E6-4F07-BB6C-360582A9AB15@toptarif.de> <3E66DF50-DBF6-4D06-B0BF-67A2209445B8@toptarif.de> <324C33D7-8A43-4159-9D23-EDE2BEB08238@toptarif.de> Date: Fri, 10 Dec 2010 18:31:06 +0100 X-Google-Sender-Auth: n-GhlaAitGhySfsjD3B5QiJOzis Message-ID: Subject: Re: Quorum: killing 1 out of 3 server kills the cluster (?) From: Peter Schuller To: user@cassandra.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org > That's finally a precise statement! :) I was wondering what " to at least= 1 replica's commit log" is supposed to actually mean: http://wiki.apache.o= rg/cassandra/API The main idea is that it has been "officially delivered" to one replicate. If Cassandra only did batch-wise commit such that a write was never ACK:ed until it was durable, it would mean that it had been durably written to 1 replica set. I suspect the phrasing is to get around the fact that it is not actually durably written if nodes are configured to use periodic sync mode. > Does quorum mean that data is replicated to q nodes or to at least q node= s? That it is replicated to at least a quorom of nodes before the write is considered successful. This does not prevent further propagation to all nodes; data always gets replicated according to replication factor. Consistency levels only affect the consistency requirements of the particular request. > I just added another blank machine to my cluster. Nothing happened as ex= pected (stopped writing to the cluster) but after I ran nodetool repair it = held more data than all other nodes. So it copied data from the other nodes= to this one? I assumed that data is replicated to q nodes not to all, is q= uorum 'only' about consistency and not about saving storage space? The new node should have gotten its appropriate amount according to the ring responsibility (i.e., tokens). I'm not sure why a new node would get more than its fair share (according to tokens) of data though. There is one extreme case which would be if the cluster has seen lots of writes in degraded states so that there is a lot of data around the cluster that has not yet reached their full replica sets. A repair on a new node might make the new node be the only one that has all the data it should have... but you'd have to have written data at low consistency level during pretty shaky periods for this to have a significant effect (especially if hinted handoff is turned on). --=20 / Peter Schuller