edgent-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dale LaBossiere (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (EDGENT-396) JobMonitorApp restarts job 3 times more than it should
Date Tue, 02 Jan 2018 19:08:00 GMT

     [ https://issues.apache.org/jira/browse/EDGENT-396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Dale LaBossiere updated EDGENT-396:
-----------------------------------
    Fix Version/s: Apache Edgent 1.2.0

> JobMonitorApp restarts job 3 times more than it should
> ------------------------------------------------------
>
>                 Key: EDGENT-396
>                 URL: https://issues.apache.org/jira/browse/EDGENT-396
>             Project: Edgent
>          Issue Type: Bug
>          Components: Runtime
>            Reporter: Dale LaBossiere
>            Assignee: Dale LaBossiere
>             Fix For: Apache Edgent 1.2.0
>
>
> N.B. I don't think this bug affects IotProvider since I'm pretty sure that while IotProvider
includes JobMonitorApp, it doesn't register a JobRegistryService so the monitor does nothing.
 JIRA forthcoming for that.
> JobMonitorAppTest exercises the app but it doesn't perform any validation that restarts
were actually happening.  Adding instrumentation / validation highlights that 3x the number
of rebuilds/restarts are happening.
> {code}
> appOne: buildCnt: 7 injectedFailureCnt: 2
> appTwo: buildCnt: 10 injectedFailureCnt: 3
> {code}
> Further investigation identifies the JobMonitorApp's job event filtering as the problem.
 Each "failed" job ends up with 3 events that pass through the filter
> {code}
> RUNNING, RUNNING, UNHEALTHY
> RUNNING, CLOSED, UNHEALTHY
> CLOSED, CLOSED, UNHEALTHY
> {code}
> ... or something like that



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message