Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 2350 invoked from network); 8 Mar 2011 17:54:01 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 8 Mar 2011 17:54:01 -0000 Received: (qmail 12590 invoked by uid 500); 8 Mar 2011 17:25:58 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 12569 invoked by uid 500); 8 Mar 2011 17:25:58 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 12561 invoked by uid 99); 8 Mar 2011 17:25:58 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Mar 2011 17:25:58 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lewilists@gmail.com designates 74.125.83.43 as permitted sender) Received: from [74.125.83.43] (HELO mail-gw0-f43.google.com) (74.125.83.43) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Mar 2011 17:25:50 +0000 Received: by gwb11 with SMTP id 11so3635012gwb.30 for ; Tue, 08 Mar 2011 09:25:30 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:content-type:mime-version:subject:from :in-reply-to:date:content-transfer-encoding:message-id:references:to :x-mailer; bh=mslPX0/aSHxVYjR2M0wPlM/KZm7i2t8tjzeOLYkJalk=; b=jdcOGiMU15Lq9Kkxb0ygaRb0KeM+BnXSVI+M3CnbI+gRwjQeIC013DXgzfTk+tVr65 sXTR1h5QG5j+81H5U+HrgdYWtA1cKU5qtyQ2/7n+wwrMq0m/sY+JtDzArVZa3w49jP/t +1gGHjzdrn3uYrU7dRDxjhckFSDt0ZTba8IpU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; b=k8uQGFgwtnVoFO73kSFE8IJlhpJ0mgCsFHXBiZtqXNmntko7b6RR6GAPkNfptkMN6h iD7eF5AMQ6mtZyKZv4in6Kx2aCouL6BRmkG5yc5Ubd+IWttzSu+OiTwYmMuxtcPpgJzy R/7mRGUK3Ko0+BbRIdwR4/18Tj85Wb/iCGV7c= Received: by 10.90.145.11 with SMTP id s11mr5048397agd.88.1299605129949; Tue, 08 Mar 2011 09:25:29 -0800 (PST) Received: from [192.168.0.197] (173-13-177-162-sfba.hfc.comcastbusiness.net [173.13.177.162]) by mx.google.com with ESMTPS id x34sm129270ana.30.2011.03.08.09.25.29 (version=TLSv1/SSLv3 cipher=OTHER); Tue, 08 Mar 2011 09:25:29 -0800 (PST) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1082) Subject: Re: how large can a cluster over the WAN be? From: John Lewis In-Reply-To: Date: Tue, 8 Mar 2011 09:25:27 -0800 Content-Transfer-Encoding: quoted-printable Message-Id: <1796F02B-E02B-4D40-A816-DE147CC67722@gmail.com> References: <23A30D2D-AB34-4CA0-8B59-79BD6D4637CA@gmail.com> To: user@cassandra.apache.org X-Mailer: Apple Mail (2.1082) Thanks for the reply, I realize my question was rather nebulous as I = consider this proposed deployment to be rather nebulous as well. Any bit = of information and a direction on which sections of documentation are = relevant helps this challenge become less nebulous over time. I will do = some reading on the topics you have provided. Thanks again Lewis On Mar 7, 2011, at 11:52 PM, Robert Coli wrote: > On Mon, Mar 7, 2011 at 11:32 AM, John Lewis = wrote: >> When you say decent latency and throughput what numbers do you = consider decent? I know throughput would be highly dependent on the = quantity of kb shoved through the pipe so I would expect throughput = needs would be highly dependent on the data actually in cassandra. >=20 > As you say, throughput needed is dependent on Cassandra payload size, > but also (in 0.7) read repair percentage. Cassandra is a large > consumer of network traffic relative to the amount of data serviced to > clients due to background repair processes like read repair and manual > AES repair. There are obviously scenarios where you might saturate the > WAN link given large enough nodes or numbers of nodes per datacenter.. >=20 > When I am talking about latency, my experience is with WAN latency > under 100ms and without DynamicEndpointSnitch. I suspect that within > an order of magnitude of that latency, with or without DES, is likely > to be fine for many use cases. There are a few tunables which might > be appropriate to increase when operating in more than two datacenters > with greater possible latency between any two as well as replication > strategies and consistency levels which offer certain latency > behavior. As always, simulating your actual workload is likely to give > you the most relevant information as to the impact of inter-cassandra > latency on your application. :) >=20 > =3DRob