Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 43285 invoked from network); 21 Sep 2010 21:10:08 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 21 Sep 2010 21:10:08 -0000 Received: (qmail 67265 invoked by uid 500); 21 Sep 2010 21:10:06 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 67220 invoked by uid 500); 21 Sep 2010 21:10:05 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 67212 invoked by uid 99); 21 Sep 2010 21:10:05 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Sep 2010 21:10:05 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,MIME_QP_LONG_LINE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [208.113.200.5] (HELO homiemail-a40.g.dreamhost.com) (208.113.200.5) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Sep 2010 21:10:00 +0000 Received: from homiemail-a40.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a40.g.dreamhost.com (Postfix) with ESMTP id 5D9E874C079 for ; Tue, 21 Sep 2010 14:09:39 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=to:cc:from :subject:date:message-id:content-type:mime-version:in-reply-to; q=dns; s=thelastpickle.com; b=UQJxnT8SLx2KXMC+TDFh3c4zavkzhag5n TzLCkANcxHzssO/aO8RnyVEOrrZ6F2yTLItyfX54iOH1NHeglFlaZy6gi6qGy7Np Q7JwP0R4nqBc76i+Q9SQyD46JARuvLNGo1IdXjCe7QVVq88lwvFt6B0EWC9KFges PLO6Oh3W50= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=thelastpickle.com; h=to:cc :from:subject:date:message-id:content-type:mime-version: in-reply-to; s=thelastpickle.com; bh=C/UyKA7LIuzCNhjbTF5Og3bQScs =; b=CjtlEZ1+uo3kyYnRb8BtNi0ExhyqF8ICOJKNs3xi+XGHHy4q7jhXs6mkLnp aWzUEhyzy296lLUHKfe7akyEyPhkmHZ56tv+GfBhtRNxw8FjeDSWI2/I86a1Uulq gmIj3JU5J4IMf2iv93CHIbioZWlJcu0ViqD/CG8PqUgAFkLQ= Received: from localhost (webms.mac.com [17.148.16.116]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) (Authenticated sender: aaron@thelastpickle.com) by homiemail-a40.g.dreamhost.com (Postfix) with ESMTPSA id 4257774C072; Tue, 21 Sep 2010 14:09:39 -0700 (PDT) To: user@cassandra.apache.org Cc: user@cassandra.apache.org From: Aaron Morton Subject: Re: Cassandra operation success ratio survey results Date: Tue, 21 Sep 2010 21:09:38 GMT X-Mailer: MobileMe Mail (1C3203) Message-id: <2780cb82-a638-ee2c-839d-d295b5406a6e@me.com> Content-Type: multipart/alternative; boundary=Apple-Webmail-42--d05cec93-f7e8-4ebc-9709-b8734e79bbb6 MIME-Version: 1.0 In-Reply-To: --Apple-Webmail-42--d05cec93-f7e8-4ebc-9709-b8734e79bbb6 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=ISO-8859-1; format=flowed Thanks for this, really interesting stuff.=A0=0A=0AJust to make sure I'm=A0= understanding=A0it, this is for PHP clients witha =A01 second timeout and = retry is to a different=A0=0Anode in the cluster with the same timeout.=A0= =0A=0AHave you enabled the Dynamic Snitch ?=A0http://www.riptano.com/blog/= whats-new-cassandra-065=0A=0AAaron=0A=0AOn 22 Sep, 2010,at 01:29 AM, Juho = M=E4kinen wrote:=0A=0AIt's known that compaction = hurts the node performance so that it might=0Amiss some requests. That's w= hy it's important to handle these=0Asituations and the client needs to ret= ry the operation into another=0Aworking host. We have been storing perform= ance data from each=0Acassandra request which we do into our five node cas= sandra production=0Acluster.=0A=0AWe log the retry count and request type = into our data warehouse=0Asolution and I've now extracted the data from a = 10 day period and=0Acalculated how many retry requests is needed so that t= he results can=0Abe obtained. The following chart tells how many time an o= peration had=0Ato be retried until it was successfully completed. The perc= ents tells=0Athe probability like that "the request will be successful wit= h the=0Afirst try in 99.933 % times."=0A=0ATotal amount of operations: 94 = 682 251 within 10 days.=0A=0ARetry times | operations | percentage from to= tal operations=0A0 | 94618468 | 99.93263 %=0A1 | 56688 | 0.05987 %=0A2 | 5= 018 | 0.00529 %=0A3 | 1359 | 0.00144 %=0A4 | 111 | 0.00012 %=0A5 | 25 | 0.= 00003 %=0A=0AThere were also few operations which needed more than five re= tries, so=0Apreparing to try up to ten times is not a bad idea.=0A=0AThe c= luster users 0.6.5 with RF=3D3. Each operation is executed until it=0Asucc= eeds or until 10 retries using this php wrapper=0Ahttp://github.com/dynamo= id/cassandra-utilities=0A=0AHave others found similar results? Please disc= uss :)=0A=0A- Juho M=E4kinen=0A --Apple-Webmail-42--d05cec93-f7e8-4ebc-9709-b8734e79bbb6 Content-Type: multipart/related; type="text/html"; boundary=Apple-Webmail-86--d05cec93-f7e8-4ebc-9709-b8734e79bbb6 --Apple-Webmail-86--d05cec93-f7e8-4ebc-9709-b8734e79bbb6 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=ISO-8859-1;
Thanks for this, really interesting stuff. 

=
Just to make sure I'm understanding it, this is for PHP cli= ents witha  1 second timeout and retry is to a different 
<= div>node in the cluster with the same timeout. 

<= div>Have you enabled the Dynamic Snitch ? http://www.riptano.com/blog/whats-new= -cassandra-065

Aaron

= On 22 Sep, 2010,at 01:29 AM, Juho M=E4kinen <juho.makinen@gmail.com>= wrote:

It's known that compaction hurts the node performance so that it mi= ght
=0Amiss some requests. That's why it's important to handle these=0Asituations and the client needs to retry the operation into another=0Aworking host. We have been storing performance data from each
=0Aca= ssandra request which we do into our five node cassandra production
=0A= cluster.
=0A
=0AWe log the retry count and request type into our dat= a warehouse
=0Asolution and I've now extracted the data from a 10 day p= eriod and
=0Acalculated how many retry requests is needed so that the r= esults can
=0Abe obtained. The following chart tells how many time an o= peration had
=0Ato be retried until it was successfully completed. The = percents tells
=0Athe probability like that "the request will be succes= sful with the
=0Afirst try in 99.933 % times."
=0A
=0ATotal amoun= t of operations: 94 682 251 within 10 days.
=0A
=0ARetry times | ope= rations | percentage from total operations
=0A 0 | 94618468 = | 99.93263 %
=0A 1 | 56688 | 0.05987 %
=0A 2= | 5018 | 0.00529 %
=0A 3 | 1359 | 0.00144 %
= =0A 4 | 111 | 0.00012 %
=0A 5 | 25 | = 0.00003 %
=0A
=0AThere were also few operations which needed more t= han five retries, so
=0Apreparing to try up to ten times is not a bad i= dea.
=0A
=0AThe cluster users 0.6.5 with RF=3D3. Each operation is e= xecuted until it
=0Asucceeds or until 10 retries using this php wrapper=
=0Ahttp://github.com/dyn= amoid/cassandra-utilities
=0A
=0AHave others found similar resul= ts? Please discuss :)
=0A
=0A - Juho M=E4kinen
=0A
--Apple-Webmail-86--d05cec93-f7e8-4ebc-9709-b8734e79bbb6-- --Apple-Webmail-42--d05cec93-f7e8-4ebc-9709-b8734e79bbb6--