Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B8B3EDB7E for ; Tue, 26 Jun 2012 17:20:40 +0000 (UTC) Received: (qmail 78551 invoked by uid 500); 26 Jun 2012 17:20:38 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 78512 invoked by uid 500); 26 Jun 2012 17:20:38 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 78503 invoked by uid 99); 26 Jun 2012 17:20:38 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Jun 2012 17:20:38 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of karthik.nar@gmail.com designates 209.85.160.172 as permitted sender) Received: from [209.85.160.172] (HELO mail-gh0-f172.google.com) (209.85.160.172) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Jun 2012 17:20:30 +0000 Received: by ghbg16 with SMTP id g16so179400ghb.31 for ; Tue, 26 Jun 2012 10:20:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=UHuahtg9tCHsJzZYSyVHDIaG50KsJg2kLair3INO9g8=; b=aKkrUh1TLBPZ0a7kv2PKmRtPdBbQDnsTNN1t5B4+Dqf6rESKo4veO5U5OTZrjLVHnn GQ/31wfCVx8fuyGk+6dCla6nyQ95/8TEvLujmyORRW2fs0EKoEC9Bl9C1IpFxxuQG2PA NbuaTZjFSZxVoYNCaW4u972fkt1XDoGVhuHzN8tfuMNolVZhJ4Hw+d/ea0kmM/p3WVrO meRMpJbCtde+/CqnRpjaoijic7EALzrhTM6+IJhTjm2susYV9rkh+8jDmOCkWAFawRtT 23Ce3Cxa835FH+qWJ4ycQEh8SIgytq9ct82K1rb6XgbYLhjpk0kOmvG7iBxIyo48T9wv xu3A== MIME-Version: 1.0 Received: by 10.42.40.75 with SMTP id k11mr8757625ice.30.1340731209505; Tue, 26 Jun 2012 10:20:09 -0700 (PDT) Received: by 10.231.153.83 with HTTP; Tue, 26 Jun 2012 10:20:09 -0700 (PDT) In-Reply-To: References: Date: Tue, 26 Jun 2012 10:20:09 -0700 Message-ID: Subject: Re: Multi datacenter, WAN hiccups and replication From: Karthik N To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 I re-read my last post and didn't think I had done a good job articulating. Sorry! I'll try again... Say I choose LOCAL_QUORUM with a Replication Factor of 3. Cassandra stores three copies in my local datacenter. Therefore the cost associated with "losing" one node is not very high locally, and I usually disable HH, and use read repair/nodetool repair instead. However over the WAN, network blips are quite normal and HH really helps. More so because for WAN replication Cassandra sends only one copy to a coordinator in the remote datacenter, and it's rather vital for that copy to make it over to keep the two datacenters in sync. Therefore I was wondering if Cassandra already intelligently special cases for HH-over-WAN (since this is common) even if HH is disabled or alternately if there's a way to enable HH for WAN replication only while disabling it for the LOCAL_QUORUM? Thank you. Thanks, Karthik On Tue, Jun 26, 2012 at 10:14 AM, Karthik N wrote: > Let me attempt to articulate my question a little better. > > Say I choose LOCAL_QUORUM with a Replication Factor of 3. Cassandra > stores three copies in my local datacenter. Therefore the cost > associated with "losing" one node is not very high locally, and I > usually HH, and use read repair/nodetool repair instead. > > However over the WAN network blips are quite normal and HH really > helps. More so because for WAN replication Cassandra sends only one > copy to a coordinator in the remote datacenter. > > Therefore I was wondering if Cassandra already intelligently optimizes > for HH-over-WAN (since this is common) or alternately if there's a way > to enable HH for WAN replication? > > Thank you. > > On Tue, Jun 26, 2012 at 9:22 AM, Mohit Anchlia wrote: >> >> >> On Tue, Jun 26, 2012 at 8:16 AM, Karthik N wrote: >>> >>> Since Cassandra optimizes and sends only one copy over the WAN, can I opt >>> in only for HH for WAN replication and avoid HH for the local quorum? (since >>> I know I have more copies) >>> >>> >> >> I am not sure if I understand your question. In general I don't think you >> can selectively decide on HH. Besides HH should only be used when the outage >> is in mts, for longer outages using HH would only create memory pressure. >>> >>> On Tuesday, June 26, 2012, Mohit Anchlia wrote: >>>> >>>> >>>> On Tue, Jun 26, 2012 at 7:52 AM, Karthik N wrote: >>>>> >>>>> My Cassandra ring spans two DCs. I use local quorum with replication >>>>> factor=3. I do a write in DC1 with local quorum. Data gets written to >>>>> multiple nodes in DC1. For the same write to propagate to DC2 only one >>>>> copy is sent from the coordinator node in DC1 to a coordinator node in >>>>> DC2 for optimizing traffic over the WAN (from what I have read in the >>>>> Cassandra documentation) >>>>> >>>>> Will a Wan hiccup result in a Hinted Handoff (HH) being created in >>>>> DC1's coordinator for DC2 to be delivered when the Wan link is up >>>>> again? >>>> >>>> >>>> I have seen hinted handoff messages in the log files when the remote DC >>>> is unreachable. But this mechanism is only used for a the time defined in >>>> cassandra.yaml file. >>> >>> >>> >>> -- >>> Thanks, Karthik >> >>