Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 74070 invoked from network); 12 May 2010 12:46:29 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 12 May 2010 12:46:29 -0000 Received: (qmail 50673 invoked by uid 500); 12 May 2010 12:46:28 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 50637 invoked by uid 500); 12 May 2010 12:46:28 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 50629 invoked by uid 99); 12 May 2010 12:46:28 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 May 2010 12:46:28 +0000 X-ASF-Spam-Status: No, hits=4.5 required=10.0 tests=AWL,FREEMAIL_FROM,FS_REPLICA,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of greenemj@gmail.com designates 74.125.83.172 as permitted sender) Received: from [74.125.83.172] (HELO mail-pv0-f172.google.com) (74.125.83.172) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 May 2010 12:46:22 +0000 Received: by pvb32 with SMTP id 32so24458pvb.31 for ; Wed, 12 May 2010 05:46:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type; bh=WjGpEfizVi/TI+G7ks0jKqRdLk5c20ghMx8aN50lXKs=; b=b4qqllRqmzPxtqizvWhGTmr11vHkJOjP5lwIR+3Wv6T2RdxrENNMNL1FYceIMJTK/j v70Ny/nif9bViyqLyIEraCVERWS3HPYfbU7aYBytVTZFnoJoNaswwuZQNafwlbKIZnrj e/vZc6XWOyHiK8m772YyEUMDFs1LNeiQUOygo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; b=UOcwQdpdtn937QB+DGeaVikUJoKTtLJuHVq5GJ9WvJFcnu8vA6WxoHO8E0mOIU6e/+ ugDDJn/I8jHjxapZJdY1zmpT1zGC/ohLLTDbWG9cpbGSGT2pnNowbwCzJKeegeZVSE/J MOPYMwsJ2acr1kUAY/bbOwoF1Nlm3BaJIjzRw= Received: by 10.141.88.12 with SMTP id q12mr4979032rvl.188.1273668362176; Wed, 12 May 2010 05:46:02 -0700 (PDT) MIME-Version: 1.0 Received: by 10.140.248.17 with HTTP; Wed, 12 May 2010 05:45:42 -0700 (PDT) In-Reply-To: References: <4BE9815D.1060208@dehora.net> <4BE9AB17.3040801@dehora.net> From: Mark Greene Date: Wed, 12 May 2010 08:45:42 -0400 Message-ID: Subject: Re: replication impact on write throughput To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=000e0cd13be04a0a870486650665 --000e0cd13be04a0a870486650665 Content-Type: text/plain; charset=ISO-8859-1 >>If the replication factor is 2, then everything is written twice. So >>your throughput is cut in half. throughput of new inserts is cut in half right? I think I was thinking about capacity in more general terms from the node's perspective. The node has the ability to write so many operations per second whether that be a replicated key or not and you could potentially take that number and multiply it by the number of nodes in the ring to give you your write capacity per second. On Tue, May 11, 2010 at 10:13 PM, Paul Prescod wrote: > On Tue, May 11, 2010 at 5:56 PM, Mark Greene wrote: > > I was under the impression from what I've seen talked about on this list > > (perhaps I'm wrong here) that given the write throughput of one node in a > > cluster (again assuming each node has a given throughput and the same > > config) that you would simply multiply that throughput by the number of > > nodes you had, giving you the total throughput for the entire ring (like > you > > said this is somewhat artificial). The main benefit being that adding > > capacity was as simple as adding more nodes to the ring with no > degradation. > > So by your math, 100 nodes with each node getting 5k wps, I would assume > the > > total capacity is 500k wps. But perhaps I've misunderstood some key > > concepts. Still a novice myself ;-) > > If the replication factor is 2, then everything is written twice. So > your throughput is cut in half. > > Paul Prescod > --000e0cd13be04a0a870486650665 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable

>>If the replication factor is 2, then everything is w= ritten twice. So
>>your throughput is cut in half.

<= div>throughput of new inserts is cut in half right? I think I was thinking = about capacity in more general terms from the node's perspective. The n= ode has the ability to write so many operations per second whether that be = a replicated key or not and you could potentially take that number and mult= iply it by the number of nodes in the ring to give you your write capacity = per second.=A0

On Tue, May 11, 2010 at 10:13 PM, Paul Presc= od <paul@prescod.n= et> wrote:
On Tue, May 11, 2010 at 5:56 PM, Mark Greene <greenemj@gmail.com> wrote:
> I was under the impression from what I've seen talked about on thi= s list
> (perhaps I'm wrong here) that given the write throughput of one no= de in a
> cluster (again assuming each node has a given throughput and the same<= br> > config) that you would simply multiply that throughput by the number o= f
> nodes you had, giving you the total throughput for the entire ring (li= ke you
> said this is somewhat=A0artificial). The main benefit being that addin= g
> capacity was as simple as adding more nodes to the ring with no degrad= ation.
> So by your math, 100 nodes with each node getting 5k wps, I would assu= me the
> total capacity is 500k wps. But perhaps I've misunderstood some ke= y
> concepts. Still a novice myself ;-)

If the replication factor is 2, then everything is written twice. So<= br> your throughput is cut in half.

=A0Paul Prescod

--000e0cd13be04a0a870486650665--