aurora-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stephan Erb (JIRA)" <>
Subject [jira] [Commented] (AURORA-1939) Thermos landing (host) page reports incorrect CPU rates when it is busy
Date Sun, 23 Jul 2017 20:53:00 GMT


Stephan Erb commented on AURORA-1939:

This is now on master. Thanks for the patch!

commit cdc5b8efd5bb86d38f73cca6d91903078b120333
Author: Reza Motamedi
Date:   Sat Jul 22 20:28:50 2017 +0200

Remove psutil's oneshot

After a lot of testing on busy machines, I realized that psutil's oneshot is
not threadsafe. I contacted the developer however, have not recevied a conceret

Please read and for more information.

These inconsistencies disappear after removing oneshot.

Reviewed at

src/main/python/apache/thermos/monitoring/ | 23 +++++++++++------------
 1 file changed, 11 insertions(+), 12 deletions(-)

> Thermos landing (host) page reports incorrect CPU rates when it is busy
> -----------------------------------------------------------------------
>                 Key: AURORA-1939
>                 URL:
>             Project: Aurora
>          Issue Type: Bug
>            Reporter: Reza Motamedi
>            Assignee: Reza Motamedi
>            Priority: Minor
> Thermos Observer uses `psutil` to monitor resource consumption of Thermos Processes.
On a busy machine, I have noticed negative CPU values when visiting the Thermos landing page.
> In my test I reproduced this by starting many processes that constantly create short
lived children. This indicates that in time between `process_collector_psutil` looks up the
Process children and the time it calculates the CPU time the pid of the child is actually
reused by another much younger process, which leads to negative CPU times.

This message was sent by Atlassian JIRA

View raw message