Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6D851193EE for ; Thu, 14 Apr 2016 23:28:41 +0000 (UTC) Received: (qmail 58112 invoked by uid 500); 14 Apr 2016 23:28:36 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 58067 invoked by uid 500); 14 Apr 2016 23:28:36 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 58057 invoked by uid 99); 14 Apr 2016 23:28:36 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 14 Apr 2016 23:28:36 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 085B4C0C72 for ; Thu, 14 Apr 2016 23:28:36 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.198 X-Spam-Level: * X-Spam-Status: No, score=1.198 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id jkVSHzCpugDb for ; Thu, 14 Apr 2016 23:28:33 +0000 (UTC) Received: from mail-io0-f169.google.com (mail-io0-f169.google.com [209.85.223.169]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id CB5705F23E for ; Thu, 14 Apr 2016 23:28:32 +0000 (UTC) Received: by mail-io0-f169.google.com with SMTP id 2so119622906ioy.1 for ; Thu, 14 Apr 2016 16:28:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to; bh=Zf80eBpryqA8JsUE2YGvg29Oftcecj7or0uiOVx6/WQ=; b=WFt8Sdjbn1aTch78FyphkcwalhqculIpQPde0QUtTOF60kSDdyWkwY0UG1w8wQTaeJ +7J9cfJ39ZGBh8YVQlC2rTwBI1mKRiumrFiizjh0/o2m8b8am+1hW0SIufyHlPaEMsM0 M+GV5sLe1VEWQoZU4esDjtDEnZHFmXde+cc9PULMH2CrH+9RPNCttbB06qKZQpVkdnkG /RtzuHY6QJYXI3IGPGaHQuqjW89NyGlFs7LGf3ltyFyxSqwtGxbOoZDvzc4e3/3lvvmR UHVqJk+mpZ3JsGz8b2YU0vJHe3/GoEFJ3WRTXBE+S0rNgHeRBdtlaJ89YXCHtOuF2MUW FzEw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to; bh=Zf80eBpryqA8JsUE2YGvg29Oftcecj7or0uiOVx6/WQ=; b=Io4GgjqcfCIilZp4pALTVms4H7I+b4g59f1MLJMCfXWWJVL9Jolp5x4amZhwT5ykeN AP7H8QD8qEjnmRdGxSqGjZ9lBx+2Y179MYIxj9o0U3qDI4WZ0Jsp4mWqodUE5AvVDXrv 4NRvvX5eOlYlYJEM0+Qp1NKChy9uoh9P7sLd4LzR3vv6XLy6tsSzhjnwZyTRe7xviYZs 8Gj5h2NUeLddSh4eewZ0WDSrwXE2DtZ5N3kbGzEmY6wHjSXQGDeloH96CPKgzx+nqEm2 9DPRi1cRF3Evk98mRrQY3zSYO0UxMBjUGJDoxtxIF922vnE7hK4SyW6a3L0j3kJ+hqSE SGzg== X-Gm-Message-State: AOPr4FWI/+NOXSPBB4DxwjFMm2BVUbXBAIpkETqDga4MLgL2xJqGqPvum/kRNXzZvS0vwyttXI8e5AzEeDs32g== MIME-Version: 1.0 X-Received: by 10.107.29.132 with SMTP id d126mr19522867iod.14.1460676506219; Thu, 14 Apr 2016 16:28:26 -0700 (PDT) Received: by 10.79.89.67 with HTTP; Thu, 14 Apr 2016 16:28:26 -0700 (PDT) In-Reply-To: References: Date: Thu, 14 Apr 2016 16:28:26 -0700 Message-ID: Subject: Fwd: Cassandra Load spike From: kavya To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=001a113ff5a249108605307a40a3 --001a113ff5a249108605307a40a3 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi, We are running a 6 node cassandra 2.2.4 cluster and we are seeing a spike in the disk Load as per the =E2=80=98nodetool status=E2=80=99 command that = does not correspond with the actual disk usage. Load reported by nodetool was as high as 3 times actual disk usage on certain nodes. We noticed that the periodic repair failed with below error on running the command : =E2=80=99nodetool repair -pr=E2=80=99 ERROR [RepairJobTask:2] 2016-04-12 15:46:29,902 RepairRunnable.java:243 - Repair session 64b54d50-0100-11e6-b46e-a511fd37b526 for range (-3814318684016904396,-3810689996127667017] failed with error [=E2=80=A6.] Validation failed in / org.apache.cassandra.exceptions.RepairException: [=E2=80=A6.] Validation fa= iled in at org.apache.cassandra.repair.ValidationTask.treeReceived(ValidationTask.java= :64) ~[apache-cassandra-2.2.4.jar:2.2.4] at org.apache.cassandra.repair.RepairSession.validationComplete(RepairSession.= java:183) ~[apache-cassandra-2.2.4.jar:2.2.4] at org.apache.cassandra.service.ActiveRepairService.handleMessage(ActiveRepair= Service.java:410) ~[apache-cassandra-2.2.4.jar:2.2.4] at org.apache.cassandra.repair.RepairMessageVerbHandler.doVerb(RepairMessageVe= rbHandler.java:163) ~[apache-cassandra-2.2.4.jar:2.2.4] at org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:6= 7) ~[apache-cassandra-2.2.4.jar:2.2.4] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1= 142) [na:1.8.0_40] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:= 617) [na:1.8.0_40] at java.lang.Thread.run(Thread.java:745) [na:1.8.0_40 We restarted all nodes in the cluster and ran a full repair which completed successfully without any validation errors, however we still see Load spike on the same nodes after a while. Please advice. Thanks! --001a113ff5a249108605307a40a3 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi,
We are running a 6 node cassandra 2.2.4 cluster and we are seeing a spike= in the disk Load as per the =E2=80=98nodetool status=E2=80=99 command that= does not correspond with the actual disk usage. Load reported by nodetool = was as high as 3 times actual disk usage on certain nodes.
We noticed th= at the periodic repair failed with below error on running the command : =E2= =80=99nodetool repair -pr=E2=80=99

ERROR [RepairJobTask:2] 2016-04-1= 2 15:46:29,902 RepairRunnable.java:243 - Repair session 64b54d50-0100-11e6-= b46e-a511fd37b526 for range (-3814318684016904396,-3810689996127667017] fai= led with error [=E2=80=A6.] Validation failed in /<ip>
org.apache.= cassandra.exceptions.RepairException: [=E2=80=A6.] Validation failed in <= ;ip>
=C2=A0=C2=A0=C2=A0 at org.apache.cassandra.repair.ValidationTask= .treeReceived(ValidationTask.java:64) ~[apache-cassandra-2.2.4.jar:2.2.4]=C2=A0=C2=A0=C2=A0 at org.apache.cassandra.repair.RepairSession.validatio= nComplete(RepairSession.java:183) ~[apache-cassandra-2.2.4.jar:2.2.4]
= =C2=A0=C2=A0=C2=A0 at org.apache.cassandra.service.ActiveRepairService.hand= leMessage(ActiveRepairService.java:410) ~[apache-cassandra-2.2.4.jar:2.2.4]=
=C2=A0=C2=A0=C2=A0 at org.apache.cassandra.repair.RepairMessageVerbHand= ler.doVerb(RepairMessageVerbHandler.java:163) ~[apache-cassandra-2.2.4.jar:= 2.2.4]
=C2=A0=C2=A0=C2=A0 at org.apache.cassandra.net.MessageDeliveryTas= k.run(MessageDeliveryTask.java:67) ~[apache-cassandra-2.2.4.jar:2.2.4]
= =C2=A0=C2=A0=C2=A0 at java.util.concurrent.ThreadPoolExecutor.runWorker(Thr= eadPoolExecutor.java:1142) [na:1.8.0_40]
=C2=A0=C2=A0=C2=A0 at java.util= .concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:= 1.8.0_40]
=C2=A0=C2=A0=C2=A0 at java.lang.Thread.run(Thread.java:745) [n= a:1.8.0_40

We restarted all nodes in the cluster and ran a full rep= air which completed successfully without any validation errors, however we = still see Load spike on the same nodes after a while. Please advice.
Thanks!

--001a113ff5a249108605307a40a3--