Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C471D160C15 for ; Wed, 3 Jan 2018 17:23:23 +0100 (CET) Received: (qmail 55676 invoked by uid 500); 3 Jan 2018 16:23:22 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 55666 invoked by uid 99); 3 Jan 2018 16:23:21 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 03 Jan 2018 16:23:21 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 7B2441A0572 for ; Wed, 3 Jan 2018 16:23:21 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.121 X-Spam-Level: X-Spam-Status: No, score=-0.121 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id w2s2VN8vf9mI for ; Wed, 3 Jan 2018 16:23:20 +0000 (UTC) Received: from mail-lf0-f52.google.com (mail-lf0-f52.google.com [209.85.215.52]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 9FD415F295 for ; Wed, 3 Jan 2018 16:23:19 +0000 (UTC) Received: by mail-lf0-f52.google.com with SMTP id h5so2222230lfj.2 for ; Wed, 03 Jan 2018 08:23:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:content-transfer-encoding:mime-version:subject:message-id:date :to; bh=KtXeXSwQtqEGW/1hxkIvrvgMWslqj52C6ZHydp+fh+E=; b=jEufpPm+Fn3gyh2BALuoBVbpSUMOmnVl0Svm0/a6bOd6RTfuH4lop+Z2cbXTEc0IoX RiU5ybcisSLwfmX4yNY0jOXoaJzA75EwpP8iPWNjW/kDEqv2+iTiQINyo37cFT6yOGKv 83INtGE37v6/YjwZBwFya4xDNIEexmS2Kwu6xveHsQu92JA2t5q2Lo3GwF4ZRpLPDepn UFTX5ZssF8N+E9TyK/lUCrPmLs8jlz94TNkN0Hl88hDbmHYLpVHg8syKIW5tRormuXKV 4YmijUUY91db8aUZa7eCX4YxKl3gSv3WYmo1/9KkGHUvUMynBAHikG23j6HV5fonVTTF Xlbg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:content-transfer-encoding:mime-version :subject:message-id:date:to; bh=KtXeXSwQtqEGW/1hxkIvrvgMWslqj52C6ZHydp+fh+E=; b=FuUT6QNhyHoN6zbhuKB9ySGVO4YNIjioFL046i6XhDZzzM3XXRRNTkHTw+JG/dh917 eNXGJP8NVz7VWTeHoTfa8PR0/HWy2X5fMeA9smQGkoBHWVb1MrkA7eLgVXQWajEiMg87 wR1WaVYsL6yb7O+hUDJs7xvRew7pbe9wcZZI5UOXUx0iu2Sy7mlBmH3DCZO8OuTEH50Q 1hhl3RWIQP9UG6G0tVGpFD1NFLXArnSXZbPV8ldMrFPVJj7t6Hrha8D4XHm9OT+GZJoH Q67MXucLdd5BsQaeHgTPJGNGHDh1MsapKQdlQ/KSVMaMSFt+wkmHrCSZRhRdCvfJCGWK b65A== X-Gm-Message-State: AKGB3mLyNAPuJE0NSL9VZ10OBQioaVzoOJzG//qCK+sDvhjFu5d/sm6W GW12CW2zLQLLOgG+3jkQx/G4m2Ye X-Google-Smtp-Source: ACJfBotMo0zXS2B/WHJn981il46ZfGR3Jg1kNpiP/p5xN30C+kfnwsR+Pejn71AmM+DdL+Ii4y1lzA== X-Received: by 10.46.64.6 with SMTP id n6mr1272057lja.129.1514996593232; Wed, 03 Jan 2018 08:23:13 -0800 (PST) Received: from [10.11.139.96] ([194.157.185.165]) by smtp.gmail.com with ESMTPSA id x10sm239863lja.55.2018.01.03.08.23.12 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 03 Jan 2018 08:23:12 -0800 (PST) From: =?utf-8?Q?Hannu_Kr=C3=B6ger?= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 11.2 \(3445.5.20\)) Subject: Repair fails for unknown reason Message-Id: Date: Wed, 3 Jan 2018 18:23:11 +0200 To: user X-Mailer: Apple Mail (2.3445.5.20) archived-at: Wed, 03 Jan 2018 16:23:24 -0000 Hello, Situation is as follows: Repair was started on node X on this keyspace with =E2=80=94full =E2=80=94= pr. Repair fails on node Y. Node Y has debug logging on (DEBUG on org.apache.cassandra) and I=E2=80=99= m looking at the debug.log. I see following messages related to this = repair request: ----------- DEBUG [AntiEntropyStage:1] 2018-01-02 17:52:12,530 = RepairMessageVerbHandler.java:114 - Validating = ValidationRequest{gcBefore=3D1511473932} = org.apache.cassandra.repair.messages.ValidationRequest@5a17430c DEBUG [ValidationExecutor:4] 2018-01-02 17:52:12,531 = StorageService.java:3321 - Forcing flush on keyspace mykeyspace, CF = mytable DEBUG [MemtablePostFlush:54] 2018-01-02 17:52:12,531 = ColumnFamilyStore.java:954 - forceFlush requested but everything is = clean in mytable ERROR [ValidationExecutor:4] 2018-01-02 17:52:12,532 Validator.java:268 = - Failed creating a merkle tree for [repair = #1df000a0-effa-11e7-8361-b7c9edfbfc33 on mykeyspace/mytable, = [(6917529027641081856,-9223372036854775808]]], /123.123.123.123 (see log = for details) ----------- then the same about another table and after that which indicates that = repair =E2=80=9Cmaster=E2=80=9D has told to abort basically, right? ----------- DEBUG [AntiEntropyStage:1] 2018-01-02 17:52:12,563 = RepairMessageVerbHandler.java:142 - Got anticompaction request = AnticompactionRequest{parentRepairSession=3D1de949e0-effa-11e7-8361-b7c9ed= fbfc33} = org.apache.cassandra.repair.messages.AnticompactionRequest@5dc8be ea ERROR [AntiEntropyStage:1] 2018-01-02 17:52:12,563 = RepairMessageVerbHandler.java:168 - Got error, removing parent repair = session ERROR [AntiEntropyStage:1] 2018-01-02 17:52:12,564 = CassandraDaemon.java:228 - Exception in thread = Thread[AntiEntropyStage:1,5,main] java.lang.RuntimeException: java.lang.RuntimeException: Parent repair = session with id =3D 1de949e0-effa-11e7-8361-b7c9edfbfc33 has failed. at = org.apache.cassandra.repair.RepairMessageVerbHandler.doVerb(RepairMessageV= erbHandler.java:171) ~[apache-cassandra-3.11.0.jar:3.11.0] at = org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:= 66) ~[apache-cassandra-3.11.0.jar:3.11.0] at = java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) = ~[na:1.8.0_111] at java.util.concurrent.FutureTask.run(FutureTask.java:266) = ~[na:1.8.0_111] at = java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:= 1142) ~[na:1.8.0_111] at = java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java= :617) [na:1.8.0_111] at = org.apache.cassandra.concurrent.NamedThreadFactory.lambda$threadLocalDeall= ocator$0(NamedThreadFactory.java:81) = [apache-cassandra-3.11.0.jar:3.11.0] at java.lang.Thread.run(Thread.java:745) ~[na:1.8.0_111] Caused by: java.lang.RuntimeException: Parent repair session with id =3D = 1de949e0-effa-11e7-8361-b7c9edfbfc33 has failed. at = org.apache.cassandra.service.ActiveRepairService.getParentRepairSession(Ac= tiveRepairService.java:409) ~[apache-cassandra-3.11.0.jar:3.11.0] at = org.apache.cassandra.service.ActiveRepairService.doAntiCompaction(ActiveRe= pairService.java:444) ~[apache-cassandra-3.11.0.jar:3.11.0] at = org.apache.cassandra.repair.RepairMessageVerbHandler.doVerb(RepairMessageV= erbHandler.java:143) ~[apache-cassandra-3.11.0.jar:3.11.0] ... 7 common frames omitted ----------- But that is almost all in the log and I don=E2=80=99t really see what = the original problem here is.=20 Cassandra flushes the table to start building merkle tree and on next = millisecond it already fails the repair but without proper exception or = error logging about the problem. Cassandra version is the 3.11.0. Any ideas? Cheers, Hannu= --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscribe@cassandra.apache.org For additional commands, e-mail: user-help@cassandra.apache.org