Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DF85F6028 for ; Sat, 14 May 2011 14:25:13 +0000 (UTC) Received: (qmail 22830 invoked by uid 500); 14 May 2011 14:25:12 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 22806 invoked by uid 500); 14 May 2011 14:25:11 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 22798 invoked by uid 99); 14 May 2011 14:25:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 May 2011 14:25:11 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of watanabe.maki@gmail.com designates 209.85.214.44 as permitted sender) Received: from [209.85.214.44] (HELO mail-bw0-f44.google.com) (209.85.214.44) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 May 2011 14:25:07 +0000 Received: by bwz13 with SMTP id 13so3309378bwz.31 for ; Sat, 14 May 2011 07:24:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type:content-transfer-encoding; bh=zCs+6t3JWQrp/y/GTPN1WrJN+b5r62hZrgs1jeUdXp8=; b=XzQ0DZ+aruFaQvQlc5BKWck4TQYVvTcDfiJdOzOTspZbH868zYT9OlkrZ2kYMG+Qfd dJXNJpLILE+H0BzAyf32T3k85etpER0GvqQGlczEDNzFOv4V6vvYlp7sO4GyxA6IBTtM iSQ82srTZoNBCBG7DL/TeyvHaBzmJ7/cdScqU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=gvZjwys9YLvQV2Fa9eNjPPNpAp7eh/sJiSazQUfDq3TiPv1iroQBIDn0SYXLJYc57V XKoPmTeYgscOfOpw72H01ZPyY0jkxhTljVUB2Kkm89Wb6cx9GvQRaKeU3R94baVrXhKS nPAva3cbFw3gQVLXZl4EBFKyXveEuZhPA802M= MIME-Version: 1.0 Received: by 10.204.7.211 with SMTP id e19mr2366117bke.139.1305383085746; Sat, 14 May 2011 07:24:45 -0700 (PDT) Received: by 10.204.42.78 with HTTP; Sat, 14 May 2011 07:24:45 -0700 (PDT) In-Reply-To: References: <2F578912-1E00-4541-BA95-01815151888D@sgizmo.com> Date: Sat, 14 May 2011 07:24:45 -0700 Message-ID: Subject: Re: Inconsistent data issues when running nodetool move. From: Maki Watanabe To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable If you do CL=3DONE write + CL=3DALL read, then it seems OK... You should better to stay in this thread until some of experts would answer your question. 2011/5/14 Ryan Hadley : > Thanks Maki, > > That makes sense with my symptoms... =A0I was doing a CL=3DONE for write = and a CL=3DALL for read, expecting that to be sufficient. > > I will try both set to ALL and see if I get better consistency. > > -Ryan > > On May 14, 2011, at 4:41 AM, Maki Watanabe wrote: > >> It depends on what you really use which CL for your operations. >> Your RF is 2, so if you read/write with CL=3DALL, your r/w will be >> always consistent. If your read is CL=3DONE, you have chance to read old >> data anytime, decommission is not matter. CL=3DQUORUM on RF=3D2 is >> semantically identical with CL=3DALL. >> >> maki >> >> 2011/5/13 Ryan Hadley : >>> Hi, >>> >>> I'm running Cassandra (0.7.4) on a 4 node ring. =A0It was a 3 node ring= , but we ended up expanding it to 4... So then I followed the many suggesti= ons to rebalance the ring. =A0I found a script that suggested I use: >>> >>> # ~/nodes_calc.py >>> How many nodes are in your cluster? 4 >>> node 0: 0 >>> node 1: 42535295865117307932921825928971026432 >>> node 2: 85070591730234615865843651857942052864 >>> node 3: 127605887595351923798765477786913079296 >>> >>> So I started to migrate each node to those tokens. >>> >>> I have my replication factor set to 2, so I guess I was expecting to be= able to continue to use this ring without issues. =A0But it seems that the= node still accepts writes while it's decommissioning? =A0I say this becaus= e if I interrupt the decommission by stopping Cassandra and starting it aga= in, it appears to run through several commit logs. =A0And as soon as it's t= hrough with those commit logs, I no longer get consistency issues. >>> >>> The issue I'm seeing is that writes to this ring will succeed, but it's= possible for a subsequent read to return an older object. =A0For several m= inutes even. >>> >>> I'm not sure if I did something wrong... learning as I go here and this= list archive has been very useful. =A0But, is there anyway I can rebalance= the node and get better consistency? >>> >>> Thanks, >>> Ryan > > --=20 w3m