Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0037877F1 for ; Mon, 21 Nov 2011 12:27:20 +0000 (UTC) Received: (qmail 33769 invoked by uid 500); 21 Nov 2011 12:27:17 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 33746 invoked by uid 500); 21 Nov 2011 12:27:17 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 33738 invoked by uid 99); 21 Nov 2011 12:27:17 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 21 Nov 2011 12:27:17 +0000 X-ASF-Spam-Status: No, hits=1.6 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of md.jahangir27@gmail.com designates 209.85.161.44 as permitted sender) Received: from [209.85.161.44] (HELO mail-fx0-f44.google.com) (209.85.161.44) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 21 Nov 2011 12:27:11 +0000 Received: by fabs1 with SMTP id s1so1476643fab.31 for ; Mon, 21 Nov 2011 04:26:50 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=8F/c06eipf0axs1NKmqqTsBaf3W0RYQXr7sUDJBgHRg=; b=X1LAiUzHh7JF6033LrvrxMSjnb6BxVT+YQN4e7rPxVY8FB9SUrEA9g8e7hEsiShxiZ QNGihb5CW9tp/SZrdRnOF6Dp2ZlFHETz4UzIbcV5nqgM6tHxscZdB+G0JFuWwWMgs5t5 QONNYyo90J7tFXAT+QUwTEFOZJizIN0QRtksM= MIME-Version: 1.0 Received: by 10.152.103.51 with SMTP id ft19mr8711543lab.42.1321878410789; Mon, 21 Nov 2011 04:26:50 -0800 (PST) Received: by 10.152.20.165 with HTTP; Mon, 21 Nov 2011 04:26:50 -0800 (PST) In-Reply-To: <4ECA2E22.9020801@ic-drei.de> References: <4ECA2E22.9020801@ic-drei.de> Date: Mon, 21 Nov 2011 07:26:50 -0500 Message-ID: Subject: Re: Pending ReadStage is exploding on only one node From: Jahangir Mohammed To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=f46d04083d9f1c830004b23dcd04 X-Virus-Checked: Checked by ClamAV on apache.org --f46d04083d9f1c830004b23dcd04 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable I am not so sure from version to version. 1. Which client are you using? Any custom load balancer? 2. Is the hardware on this node any different from other nodes? Thanks, Jahangir. On Mon, Nov 21, 2011 at 5:55 AM, Johann H=F6chtl wro= te: > Hi all, > > I'm experiencing strange behaviour of my 6-node cassandra cluster and I > hope some one can explain, what I'm doing wrong. > > The setting: > 6-Cassandra Nodes 1.0.3 > Random Partitioning > The ColumnFamily in question has a replication factor of 2 and stores > products of different shops with a secondary index on shop_id. > > Twice a day, I do an update of the data with the following mechanism: > Get all keys of a shop. > Read the new CSV. > Insert the rows from the csv, which keys are not present and delete the > rows which are not longer present. > Update all prices of the products from the csv and set an update_date. > > I'm measuring a high load value on a few nodes during the update process > (which is normal), but one node keeps the high load after the process for= a > long time. > I checked the tpstats and found out, that on this node there are over 50k > pending ReadStage tasks. > All the other nodes don't have that behaviour. > > I already had this problem on cassandra 0.7, but after upgrading to 0.8 i= t > disappeared. Now it is back. > > Any suggestions? > > Thanks, > Hans > --f46d04083d9f1c830004b23dcd04 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable I am not so sure from version to version.

1. Which client are you us= ing? Any custom load balancer?
2. Is the hardware on this node any diffe= rent from other nodes?

Thanks,
Jahangir.

On Mon, Nov 21, 2011 at 5:55 AM, Johann H=F6chtl <h.hoechtl@ic-drei.de> wro= te:
Hi all,

I'm experiencing strange behaviour of my 6-node cassandra cluster and I= hope some one can explain, what I'm doing wrong.

The setting:
6-Cassandra Nodes 1.0.3
Random Partitioning
The ColumnFamily in question has a replication factor of 2 and stores produ= cts of different shops with a secondary index on shop_id.

Twice a day, I do an update of the data with the following mechanism:
Get all keys of a shop.
Read the new CSV.
Insert the rows from the csv, which keys are not present and delete the row= s which are not longer present.
Update all prices of the products from the csv and set an update_date.

I'm measuring a high load value on a few nodes during the update proces= s (which is normal), but one node keeps the high load after the process for= a long time.
I checked the tpstats and found out, that on this node there are over 50k p= ending ReadStage tasks.
All the other nodes don't have that behaviour.

I already had this problem on cassandra 0.7, but after upgrading to 0.8 it = disappeared. Now it is back.

Any suggestions?

Thanks,
Hans

--f46d04083d9f1c830004b23dcd04--