Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id E8093200D43 for ; Tue, 21 Nov 2017 21:56:04 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id E6726160BFC; Tue, 21 Nov 2017 20:56:04 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 10F9E160BE3 for ; Tue, 21 Nov 2017 21:56:03 +0100 (CET) Received: (qmail 36994 invoked by uid 500); 21 Nov 2017 20:56:03 -0000 Mailing-List: contact dev-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@zookeeper.apache.org Delivered-To: mailing list dev@zookeeper.apache.org Received: (qmail 36983 invoked by uid 99); 21 Nov 2017 20:56:03 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Nov 2017 20:56:03 +0000 Received: from mail-wr0-f173.google.com (mail-wr0-f173.google.com [209.85.128.173]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 239E51A006D for ; Tue, 21 Nov 2017 20:56:01 +0000 (UTC) Received: by mail-wr0-f173.google.com with SMTP id k18so7590502wre.1 for ; Tue, 21 Nov 2017 12:56:01 -0800 (PST) X-Gm-Message-State: AJaThX6dnK+ZYcCpZQib9ukdW2Lt87v8rSB2Ww4KhNE0emK6KPY+P5Mi GdP+VTzCOh0PL6j/ns9+C1BaNz4oH6St09a72Rc= X-Google-Smtp-Source: AGs4zMb6y9qFlmPvr1cpnEP420g22M3n1l/MW6+RkUMH7jJSp5s/cgjfjOewCaiA/ioFHaSFCz6Z3oAyQ1e2UmyM6Tg= X-Received: by 10.223.173.4 with SMTP id p4mr14375472wrc.209.1511297760403; Tue, 21 Nov 2017 12:56:00 -0800 (PST) MIME-Version: 1.0 Received: by 10.28.176.68 with HTTP; Tue, 21 Nov 2017 12:55:19 -0800 (PST) In-Reply-To: <1511295494.896595.1180221048.45AAD198@webmail.messagingengine.com> References: <1511295494.896595.1180221048.45AAD198@webmail.messagingengine.com> From: Patrick Hunt Date: Tue, 21 Nov 2017 12:55:19 -0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Apache build failing with test-core-cppunit To: DevZooKeeper Content-Type: multipart/alternative; boundary="f403045cef0428a905055e846eeb" archived-at: Tue, 21 Nov 2017 20:56:05 -0000 --f403045cef0428a905055e846eeb Content-Type: text/plain; charset="UTF-8" FYI: someone just reported similar problems to the builds list: ---- it seems that on some nodes the user ids, that are used by the Jenkins slav= es, have been changed. But there are still some directories residing in /tm= p with ownership to the old uid. That causes a conflict with our tests, bec= ause these files can neither be deleted nor moved. Slave where our jobs fail: H25 But this may not be the only one. Could you please check and delete (old) temp files there. In our case it's /tmp/archiva, but other projects may have similar problems= On Tue, Nov 21, 2017 at 12:18 PM, Abraham Fine wrote: > I'll take a look. > > On Tue, Nov 21, 2017, at 11:55, Patrick Hunt wrote: > > Looks like someone is creating our test files outside of jenkins. I > > modified the job to output our id and look at the perms on those files: > > > > ---- > > > > [ZooKeeper-trunk] $ /bin/bash /tmp/jenkins291402182647699851.sh > > uid=910(jenkins) gid=910(jenkins) groups=910(jenkins),999(docker) > > > > drwxr-xr-x 3 10025 12036 4096 Nov 10 01:39 /tmp/zkdata > > -rw-r--r-- 1 10025 12036 2 Nov 10 01:39 /tmp/zkdata/myid > > > > /tmp/zkdata/version-2: > > total 20 > > drwxr-xr-x 2 10025 12036 4096 Oct 22 23:35 . > > drwxr-xr-x 3 10025 12036 4096 Nov 10 01:39 .. > > -rw-r--r-- 1 10025 12036 1 Oct 22 23:35 acceptedEpoch > > -rw-r--r-- 1 10025 12036 1 Oct 22 23:35 currentEpoch > > -rw-r--r-- 1 10025 12036 562 Oct 22 23:35 snapshot.0 > > > > ---- > > > > > > Notice that it's not jenkins. > > > > > > Can you (Abe?) submit a jira/patch (ASAP as it's breaking the build) > > to create a new directory in /tmp and then host all the tmp files > > there? > > > > > > Thanks, > > > > > > Patrick > > > > > > > > On Tue, Nov 21, 2017 at 10:37 AM, Patrick Hunt wrote: > > > > > With the same issue? Does it ever pass? > > > > > > Patrick > > > > > > On Tue, Nov 21, 2017 at 10:32 AM, Andor Molnar > wrote: > > > > > >> I checked back a few failing builds and see different hosts failing: > H4, > > >> H9, H12, ... > > >> > > >> > > >> > > >> > > >> > > >> On Tue, Nov 21, 2017 at 6:26 PM, Patrick Hunt > wrote: > > >> > > >> > Could it be an environment issue? I see the following just before > the > > >> > failure: > > >> > > > >> > [exec] rm: cannot remove '/tmp/zkdata/myid': Permission denied > > >> > > > >> > check whether it's happening on just one host (jenkins). > > >> > > > >> > Patrick > > >> > > > >> > On Tue, Nov 21, 2017 at 6:25 AM, Andor Molnar > > >> wrote: > > >> > > > >> > > Looks like only https://builds.apache.org/job/ZooKeeper-trunk is > > >> > affected. > > >> > > > > >> > > > > >> > > On Tue, Nov 21, 2017 at 3:22 PM, Andor Molnar > > > >> > wrote: > > >> > > > > >> > > > Hi, > > >> > > > > > >> > > > Zookeeper build has been failing for a while with some weird > error > > >> in > > >> > > > test-core-cppunit task. In most cases the error is the > following: > > >> > > > > > >> > > > ... > > >> > > > [exec] Zookeeper_simpleSystem::testGetChildren2 : elapsed > > >> 1052 : > > >> > OK > > >> > > > [exec] Zookeeper_simpleSystem::testLastZxid : elapsed > 4520 : > > >> OK > > >> > > > [exec] Zookeeper_simpleSystem::testRemoveWatchers > ZooKeeper > > >> > server > > >> > > > started : elapsed 5390 : OK > > >> > > > [exec] rm: cannot remove '/tmp/zkdata/myid': Permission > denied > > >> > > > [exec] Zookeeper_readOnly::testReadOnly : assertion : > elapsed > > >> > 4018 > > >> > > > [exec] /home/jenkins/jenkins-slave/wo > > >> rkspace/ZooKeeper-trunk/src/ > > >> > > > c/tests/TestReadOnlyClient.cc:99: Assertion: equality assertion > > >> failed > > >> > > > [Expected: 0, Actual : -4] > > >> > > > [exec] Failures !!! > > >> > > > [exec] Run: 74 Failure total: 1 Failures: 1 Errors: 0 > > >> > > > [exec] FAIL: zktest-mt > > >> > > > [exec] ========================================== > > >> > > > [exec] 1 of 2 tests failed > > >> > > > [exec] Please report to user@zookeeper.apache.org > > >> > > > [exec] ========================================== > > >> > > > [exec] Makefile:1744: recipe for target 'check-TESTS' > failed > > >> > > > [exec] make[1]: Leaving directory > '/home/jenkins/jenkins-slave/ > > >> > > > workspace/ZooKeeper-trunk/build/test/test-cppunit' > > >> > > > [exec] Makefile:2000: recipe for target 'check-am' failed > > >> > > > [exec] /home/jenkins/jenkins-slave/wo > > >> rkspace/ZooKeeper-trunk/src/ > > >> > > c/tests/zkServer.sh: > > >> > > > line 62: kill: (10156) - No such process > > >> > > > [exec] make[1]: *** [check-TESTS] Error 1 > > >> > > > [exec] make: *** [check-am] Error 2 > > >> > > > > > >> > > > ---------------------- > > >> > > > > > >> > > > Test at line TestReadOnlyClient.cc:99 got ConnectionLoss event. > > >> > > > Does anyone has a clue what could be the root cause of this? > > >> > > > > > >> > > > Regards, > > >> > > > Andor > > >> > > > > > >> > > > > > >> > > > > >> > > > >> > > > > > > > --f403045cef0428a905055e846eeb--