Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 62992 invoked from network); 9 Apr 2010 14:26:39 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 9 Apr 2010 14:26:39 -0000 Received: (qmail 72007 invoked by uid 500); 9 Apr 2010 14:26:38 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 71983 invoked by uid 500); 9 Apr 2010 14:26:38 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 71975 invoked by uid 99); 9 Apr 2010 14:26:38 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Apr 2010 14:26:38 +0000 X-ASF-Spam-Status: No, hits=0.5 required=10.0 tests=AWL,RCVD_NUMERIC_HELO,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of gcdcu-cassandra-user-1@m.gmane.org designates 80.91.229.12 as permitted sender) Received: from [80.91.229.12] (HELO lo.gmane.org) (80.91.229.12) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Apr 2010 14:26:32 +0000 Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1O0F9u-0004Px-5E for user@cassandra.apache.org; Fri, 09 Apr 2010 16:26:10 +0200 Received: from 38.98.147.130 ([38.98.147.130]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Fri, 09 Apr 2010 16:26:10 +0200 Received: from tzz by 38.98.147.130 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Fri, 09 Apr 2010 16:26:10 +0200 X-Injected-Via-Gmane: http://gmane.org/ To: user@cassandra.apache.org From: Ted Zlatanov Subject: Re: writes to Cassandra failing occasionally Date: Fri, 09 Apr 2010 09:25:57 -0500 Organization: =?utf-8?B?0KLQtdC+0LTQvtGAINCX0LvQsNGC0LDQvdC+0LI=?= @ Cienfuegos Lines: 20 Message-ID: <87wrwgr9ca.fsf@lifelogs.com> References: <4BBCE8CE.9000805@gmail.com> <87k4sit7nj.fsf@lifelogs.com> <4BBE257E.3070207@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: 38.98.147.130 X-Face: bd.DQ~'29fIs`T_%O%C\g%6jW)yi[zuz6;d4V0`@y-~$#3P_Ng{@m+e4o<4P'#(_GJQ%TT= D}[Ep*b!\e,fBZ'j_+#"Ps?s2!4H2-Y"sx" User-Agent: Gnus/5.110011 (No Gnus v0.11) Emacs/24.0.50 (gnu/linux) Cancel-Lock: sha1:3c4pXZTC6AHSU6htVO0QEg4FeJc= On Thu, 08 Apr 2010 11:50:38 -0700 Mike Gallamore wrote: MG> Yes I agree single threaded is probably not the best. I wonder how MG> much of a performance hit it is on a single CPU machine though? I MG> guess I still would be blocking on ram writes but isn't like there is MG> multiple CPUs I need to keep busy or anything. Cassandra may have to load data from disk for a particular query but another may already be in memory. A third may cause a hit on another cluster node. So if you issue queries serially you'll see performance drop off with the total number of queries because they are dependent on each other's performance, while the distribution of the performance of independent parallel queries will have skew and kurtosis much closer to a normal distribution. In other words, your slowest (or unluckiest) queries are less damaging when you issue them in parallel. On the client side you still have slow serialization/deserialization and not much can be done about that. Ted