zookeeper-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Patrick Hunt <ph...@apache.org>
Subject Re: Apache build failing with test-core-cppunit
Date Tue, 21 Nov 2017 20:55:19 GMT
FYI: someone just reported similar problems to the builds list:

----
it seems that on some nodes the user ids, that are used by the Jenkins slav=
es, have been changed. But there are still some directories residing in /tm=
p with ownership to the old uid. That causes a conflict with our tests, bec=
ause these files can neither be deleted nor moved.

Slave where our jobs fail: H25
But this may not be the only one.

Could you please check and delete (old) temp files there.
In our case it's /tmp/archiva, but other projects may have similar problems=


On Tue, Nov 21, 2017 at 12:18 PM, Abraham Fine <afine@apache.org> wrote:

> I'll take a look.
>
> On Tue, Nov 21, 2017, at 11:55, Patrick Hunt wrote:
> > Looks like someone is creating our test files outside of jenkins. I
> > modified the job to output our id and look at the perms on those files:
> >
> > ----
> >
> > [ZooKeeper-trunk] $ /bin/bash /tmp/jenkins291402182647699851.sh
> > uid=910(jenkins) gid=910(jenkins) groups=910(jenkins),999(docker)
> >
> > drwxr-xr-x 3 10025 12036 4096 Nov 10 01:39 /tmp/zkdata
> > -rw-r--r-- 1 10025 12036    2 Nov 10 01:39 /tmp/zkdata/myid
> >
> > /tmp/zkdata/version-2:
> > total 20
> > drwxr-xr-x 2 10025 12036 4096 Oct 22 23:35 .
> > drwxr-xr-x 3 10025 12036 4096 Nov 10 01:39 ..
> > -rw-r--r-- 1 10025 12036    1 Oct 22 23:35 acceptedEpoch
> > -rw-r--r-- 1 10025 12036    1 Oct 22 23:35 currentEpoch
> > -rw-r--r-- 1 10025 12036  562 Oct 22 23:35 snapshot.0
> >
> > ----
> >
> >
> > Notice that it's not jenkins.
> >
> >
> > Can you (Abe?) submit a jira/patch (ASAP as it's breaking the build)
> > to create a new directory in /tmp and then host all the tmp files
> > there?
> >
> >
> > Thanks,
> >
> >
> > Patrick
> >
> >
> >
> > On Tue, Nov 21, 2017 at 10:37 AM, Patrick Hunt <phunt@apache.org> wrote:
> >
> > > With the same issue? Does it ever pass?
> > >
> > > Patrick
> > >
> > > On Tue, Nov 21, 2017 at 10:32 AM, Andor Molnar <andor@cloudera.com>
> wrote:
> > >
> > >> I checked back a few failing builds and see different hosts failing:
> H4,
> > >> H9, H12, ...
> > >>
> > >>
> > >>
> > >>
> > >>
> > >> On Tue, Nov 21, 2017 at 6:26 PM, Patrick Hunt <phunt@apache.org>
> wrote:
> > >>
> > >> > Could it be an environment issue? I see the following just before
> the
> > >> > failure:
> > >> >
> > >> >      [exec] rm: cannot remove '/tmp/zkdata/myid': Permission denied
> > >> >
> > >> > check whether it's happening on just one host (jenkins).
> > >> >
> > >> > Patrick
> > >> >
> > >> > On Tue, Nov 21, 2017 at 6:25 AM, Andor Molnar <andor@cloudera.com>
> > >> wrote:
> > >> >
> > >> > > Looks like only https://builds.apache.org/job/ZooKeeper-trunk
is
> > >> > affected.
> > >> > >
> > >> > >
> > >> > > On Tue, Nov 21, 2017 at 3:22 PM, Andor Molnar <andor@cloudera.com
> >
> > >> > wrote:
> > >> > >
> > >> > > > Hi,
> > >> > > >
> > >> > > > Zookeeper build has been failing for a while with some weird
> error
> > >> in
> > >> > > > test-core-cppunit task. In most cases the error is the
> following:
> > >> > > >
> > >> > > > ...
> > >> > > >      [exec] Zookeeper_simpleSystem::testGetChildren2 : elapsed
> > >> 1052 :
> > >> > OK
> > >> > > >      [exec] Zookeeper_simpleSystem::testLastZxid : elapsed
> 4520 :
> > >> OK
> > >> > > >      [exec] Zookeeper_simpleSystem::testRemoveWatchers
> ZooKeeper
> > >> > server
> > >> > > > started : elapsed 5390 : OK
> > >> > > >      [exec] rm: cannot remove '/tmp/zkdata/myid': Permission
> denied
> > >> > > >      [exec] Zookeeper_readOnly::testReadOnly : assertion
:
> elapsed
> > >> > 4018
> > >> > > >      [exec] /home/jenkins/jenkins-slave/wo
> > >> rkspace/ZooKeeper-trunk/src/
> > >> > > > c/tests/TestReadOnlyClient.cc:99: Assertion: equality assertion
> > >> failed
> > >> > > > [Expected: 0, Actual  : -4]
> > >> > > >      [exec] Failures !!!
> > >> > > >      [exec] Run: 74   Failure total: 1   Failures: 1   Errors:
0
> > >> > > >      [exec] FAIL: zktest-mt
> > >> > > >      [exec] ==========================================
> > >> > > >      [exec] 1 of 2 tests failed
> > >> > > >      [exec] Please report to user@zookeeper.apache.org
> > >> > > >      [exec] ==========================================
> > >> > > >      [exec] Makefile:1744: recipe for target 'check-TESTS'
> failed
> > >> > > >      [exec] make[1]: Leaving directory
> '/home/jenkins/jenkins-slave/
> > >> > > > workspace/ZooKeeper-trunk/build/test/test-cppunit'
> > >> > > >      [exec] Makefile:2000: recipe for target 'check-am'
failed
> > >> > > >      [exec] /home/jenkins/jenkins-slave/wo
> > >> rkspace/ZooKeeper-trunk/src/
> > >> > > c/tests/zkServer.sh:
> > >> > > > line 62: kill: (10156) - No such process
> > >> > > >      [exec] make[1]: *** [check-TESTS] Error 1
> > >> > > >      [exec] make: *** [check-am] Error 2
> > >> > > >
> > >> > > > ----------------------
> > >> > > >
> > >> > > > Test at line TestReadOnlyClient.cc:99 got ConnectionLoss
event.
> > >> > > > Does anyone has a clue what could be the root cause of this?
> > >> > > >
> > >> > > > Regards,
> > >> > > > Andor
> > >> > > >
> > >> > > >
> > >> > >
> > >> >
> > >>
> > >
> > >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message