Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D3A989CE0 for ; Tue, 17 Apr 2012 15:37:13 +0000 (UTC) Received: (qmail 52678 invoked by uid 500); 17 Apr 2012 15:37:11 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 52653 invoked by uid 500); 17 Apr 2012 15:37:11 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 52645 invoked by uid 99); 17 Apr 2012 15:37:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Apr 2012 15:37:11 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of saasira@gmail.com designates 209.85.213.172 as permitted sender) Received: from [209.85.213.172] (HELO mail-yx0-f172.google.com) (209.85.213.172) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Apr 2012 15:37:05 +0000 Received: by yenm5 with SMTP id m5so3579054yen.31 for ; Tue, 17 Apr 2012 08:36:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to:content-type; bh=w7uE0zvI1TyzA9p9TTJ8LcYIpG0zR7YGaUZZ1Jb/HlI=; b=fmXwnuFBHXkYWZ1kTbWrI9P9I13zKPsVeCvz5kQdnl6sCQ32RL4Mx3n3HuikpRseXR 66K5s2NgaOjoFDF1q6M4e0RUXhpTNzzbvHAuCsN3F6TBmzqpwH1gGlbiUikdACef6t/H aQhLubouhV1nwYMbYhOYFyKDFTZ+cxLsz5MS3xBlv+MKYh+nAkETlHcVEjzRW/XGiDYy gjG02MZj47Lm7ue/pCNkRE7hYvhvXnResfKGWEom0zNCJFuyPgDKviSoN0Ho+pA+22DC gILsOkxaZ+bpqBtw/8UVQ8KGRJlFzJtMgcjNrRtl+f08ny8MXCT0Yi+8Q2D3tFsLiOWX DFxQ== Received: by 10.60.27.170 with SMTP id u10mr22824926oeg.50.1334677004180; Tue, 17 Apr 2012 08:36:44 -0700 (PDT) MIME-Version: 1.0 Received: by 10.60.120.8 with HTTP; Tue, 17 Apr 2012 08:36:23 -0700 (PDT) From: Samba Date: Tue, 17 Apr 2012 21:06:23 +0530 Message-ID: Subject: Multi Master replication : rejoining a node after split network To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=e89a8f643316b95f2604bde1b425 X-Virus-Checked: Checked by ClamAV on apache.org --e89a8f643316b95f2604bde1b425 Content-Type: text/plain; charset=ISO-8859-1 Hi all, We are evaluating Cassandra for a geographically distributed deployment that requires multi master replication. We have a few questions regarding how replication is handled in Cassandra, like: 1. Which mechanism is used to replicate the changes from one system to another: statement distribution or recording the changeset via triggers or storing the changeset in transaction log? 2. Since replication is continuous copying of changes from one node to another, these changes would have to be snapshotted in order to sustain temporary network failures so that replication can resume after the network problem is healed. is there a mechanism to define how long we can store/archive the snaphotted changes before we discard and would demand a recreation of node from the scratch rather than rejoin 3. What options are available for conflict resolution since we are talking about master-master replication across tens of nodes? 4. If a node is rejoined after a split network where same records would have been modified on multiple nodes, is there a mechanism to merge the data, resolve conflicts and eventually reach to a consistent state? Thanks and Regards, Samba --e89a8f643316b95f2604bde1b425 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi all,
= We are evaluating Cassandra for a geographically distributed deployment tha= t requires multi master replication.

We have a few questions regardi= ng how replication is handled in Cassandra, like:

  1. Which mechanism is used to replicate the changes from one syste= m to another: statement distribution or recording the changeset via trigger= s or storing the changeset in transaction log?
  2. Since replication is= continuous copying of changes from one node to another, these changes woul= d have to be snapshotted in order to sustain temporary network failures so = that replication can resume after the network problem is healed. is there a= mechanism to define how long we can store/archive the snaphotted changes b= efore we discard and would demand a recreation of node from the scratch rat= her than rejoin
  3. What options are available for conflict resolution since we are talking= about master-master replication across tens of nodes?
  4. If a node is= rejoined after a split network where same records would have been modified= on multiple nodes, is there a mechanism to merge the data, resolve conflic= ts and eventually reach to a consistent state?
Thanks and Regards,
Samba --e89a8f643316b95f2604bde1b425--