aurora-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "brian wickman (JIRA)" <j...@apache.org>
Subject [jira] [Created] (AURORA-1353) MesosPathDetector double-counts checkpoint roots
Date Wed, 17 Jun 2015 20:51:03 GMT
brian wickman created AURORA-1353:
-------------------------------------

             Summary: MesosPathDetector double-counts checkpoint roots
                 Key: AURORA-1353
                 URL: https://issues.apache.org/jira/browse/AURORA-1353
             Project: Aurora
          Issue Type: Bug
          Components: Observer
            Reporter: brian wickman


This manifests itself in the observer ballooning in resource usage.  The TaskDetector finds
two copies of every task because of the 'latest' symlink being a duplicate of the actual directory,
so we get race conditions in on_active/on_finished inside src/main/python/apache/thermos/observer/task_observer.py
that cause it to hold onto TaskResourceMonitors which tend to be pretty expensive to run.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message