edgent-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dale LaBossiere (JIRA)" <j...@apache.org>
Subject [jira] [Created] (EDGENT-396) JobMonitorApp restarts job 3 times more than it should
Date Tue, 14 Mar 2017 19:02:41 GMT
Dale LaBossiere created EDGENT-396:
--------------------------------------

             Summary: JobMonitorApp restarts job 3 times more than it should
                 Key: EDGENT-396
                 URL: https://issues.apache.org/jira/browse/EDGENT-396
             Project: Edgent
          Issue Type: Bug
          Components: Runtime
            Reporter: Dale LaBossiere
            Assignee: Dale LaBossiere


N.B. I don't think this bug affects IotProvider since I'm pretty sure that while IotProvider
includes JobMonitorApp, it doesn't register a JobRegistryService so the monitor does nothing.
 JIRA forthcoming for that.

JobMonitorAppTest exercises the app but it doesn't perform any validation that restarts were
actually happening.  Adding instrumentation / validation highlights that 3x the number of
rebuilds/restarts are happening.

{code}
appOne: buildCnt: 7 injectedFailureCnt: 2
appTwo: buildCnt: 10 injectedFailureCnt: 3
{code}

Further investigation identifies the JobMonitorApp's job event filtering as the problem. 
Each "failed" job ends up with 3 events that pass through the filter
{code}
RUNNING, RUNNING, UNHEALTHY
RUNNING, CLOSED, UNHEALTHY
CLOSED, CLOSED, UNHEALTHY
{code}
... or something like that



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message