infra-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tibor Digana (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (INFRA-16951) Unstable Jenkins. Aborted build. No disk space. Lost connection between slave and master.
Date Sun, 16 Sep 2018 21:57:00 GMT

    [ https://issues.apache.org/jira/browse/INFRA-16951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16616911#comment-16616911
] 

Tibor Digana commented on INFRA-16951:
--------------------------------------

I do not want to b annoying but we have new findings.
Windows-2016-1 machine stopped system clock during the run of several integration tests.
See the time {{02:40:46.186}} in the log 
https://builds.apache.org/job/maven-box/job/maven-surefire/job/master/60/consoleFull

Therefore the entire job failed. Difference between two time instances was not positive and
a test failed.
We have very frequent problems with windows machines.
They are very overloaded having two executors however the machine has only one CPU.
Our performance tests fail.
Java with Parallel GC requires two CPUs at least.
Pls improve the number of CPU on windows machines.

> Unstable Jenkins. Aborted build. No disk space. Lost connection between slave and master.
> -----------------------------------------------------------------------------------------
>
>                 Key: INFRA-16951
>                 URL: https://issues.apache.org/jira/browse/INFRA-16951
>             Project: Infrastructure
>          Issue Type: Bug
>          Components: Jenkins
>         Environment: Jenkins Windows/Linus slaves aborted build
>            Reporter: Tibor Digana
>            Assignee: Gavin
>            Priority: Major
>
> My build is still unstable on h/w issues.
> Windows and Linux machines down and no permission to open file (1GB of files)
> The worst was observed from the last build where I lost connection with slaves for 5
minutes and it looks like somebody or something aborted my build. The h/w issue is additional
issue:
> java.nio.file.FileSystemException: /x1/jenkins/jenkins-home/jobs/maven-box/jobs/maven-surefire/branches/INV1561/builds/24/archive/surefire-its--windows-jdk7-maven3.5.x.zip:
Too many open files
> Pls see this full log
> https://builds.apache.org/job/maven-box/job/maven-surefire/job/INV1561/24/consoleFull
> If you want to know which machines were used, please look up for the string "Running
on" in the console log.
> Can you find out who aborted my build?
> It usually takes less than two hours to complete and uses 8 executors.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message