Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 6258 invoked from network); 11 May 2010 18:32:31 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 11 May 2010 18:32:31 -0000 Received: (qmail 78853 invoked by uid 500); 11 May 2010 18:32:30 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 78829 invoked by uid 500); 11 May 2010 18:32:30 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 78821 invoked by uid 99); 11 May 2010 18:32:30 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 May 2010 18:32:30 +0000 X-ASF-Spam-Status: No, hits=3.1 required=10.0 tests=AWL,FREEMAIL_FROM,FS_REPLICA,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of greenemj@gmail.com designates 209.85.160.44 as permitted sender) Received: from [209.85.160.44] (HELO mail-pw0-f44.google.com) (209.85.160.44) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 May 2010 18:32:25 +0000 Received: by pwj2 with SMTP id 2so2702752pwj.31 for ; Tue, 11 May 2010 11:32:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type; bh=rL3uFIk8CnnQQdDY5N0dIk4JYuzNtZmTuwyRda2Tr9Y=; b=OUSfd76Q0fSLZN16DVA2/2ac712l5N8GxQMA+5KxaBi1iMKpMkKuMTJSd/8VY3P3tD Cwar9yE4pR69cU0NxhNciBit/RvbVvZYYFvUcF7XX77AQgo+qEIf1kT2U1mUNzvMfVIR ITrahx16SigfkkTmzfbNCLmZWS3FgizEfQM9c= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; b=DlFJ0xl3WjGyGzCyRCI9/l9kYBQhbQ0b0/HSYFqjAYcwXcT+rK55tyL6Y2SMKPUtNB DXxAj6iZfRC9qmwinJsjetDpANE4rx6G6iutAGMn7hUYrCrbB+VkjZZhZQEGFyYjdsRf LrAtBynkQGpmHxuC1BFZpgVBmAgDyH+ZUZ+Qc= Received: by 10.141.100.17 with SMTP id c17mr4132912rvm.0.1273602724768; Tue, 11 May 2010 11:32:04 -0700 (PDT) MIME-Version: 1.0 Received: by 10.140.248.17 with HTTP; Tue, 11 May 2010 11:31:27 -0700 (PDT) In-Reply-To: <4BE9815D.1060208@dehora.net> References: <4BE9815D.1060208@dehora.net> From: Mark Greene Date: Tue, 11 May 2010 14:31:27 -0400 Message-ID: Subject: Re: replication impact on write throughput To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=000e0cd13886feacd2048655bd40 --000e0cd13886feacd2048655bd40 Content-Type: text/plain; charset=ISO-8859-1 If you have for example, your replication factor equal to the total amount of nodes in the ring, I suspect you will hit a brick wall pretty soon. The biggest impact on your write performance will most likely be the consistency level of your writes. In other words, how many nodes you want to wait for before you acknowledge the write back to the client. On Tue, May 11, 2010 at 12:10 PM, Bill de hOra wrote: > If I had 10 Cassandra nodes each with a write capacity of 5K per second and > a replication factor of 2, would that mean the expected write capacity of > the system would be ~25K writes per second because the nodes are also > serving other nodes and not just clients? > > I know this is highly simplified take on things (ie no consideration for > reads or quorum), I'm just trying to understand what the implication of > replication is on write scalability. Intuitively it would seem actual write > capacity is total write capacity divided by the replication factor. > > Bill > --000e0cd13886feacd2048655bd40 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable If you have for example, your replication factor equal to the total amount = of nodes in the ring, I suspect you will hit a brick wall pretty soon.
=
The biggest impact on your write performance will most likel= y be the consistency level of your writes. In other words, how many nodes y= ou want to wait for before you acknowledge the write back to the client.



On Tue, May 11, 2010= at 12:10 PM, Bill de hOra <bill@dehora.net> wrote:
If I had 10 Cassandra nodes each with a write capacity of 5K per second and= a replication factor of 2, would that mean the expected write capacity of = the system would be ~25K writes per second because the nodes are also servi= ng other nodes and not just clients?

I know this is highly simplified take on things (ie no consideration for re= ads or quorum), I'm just trying to understand what the implication of r= eplication is on write scalability. Intuitively it would seem actual write = capacity is total write capacity divided by the replication factor.

Bill

--000e0cd13886feacd2048655bd40--