ignite-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alexey Goncharuk (JIRA)" <j...@apache.org>
Subject [jira] [Created] (IGNITE-6587) Ignite watchdog service
Date Tue, 10 Oct 2017 11:30:00 GMT
Alexey Goncharuk created IGNITE-6587:
----------------------------------------

             Summary: Ignite watchdog service
                 Key: IGNITE-6587
                 URL: https://issues.apache.org/jira/browse/IGNITE-6587
             Project: Ignite
          Issue Type: Improvement
          Components: general
    Affects Versions: 2.2
            Reporter: Alexey Goncharuk
             Fix For: 2.4


We need to come up with a 'watchdog service' to monitor for Ignite node local health and kill
the process under some critical conditions.
For example, if one of the mission-critical Ignite threads die, the Ignite node must be stopped.
At the first glance, the list of critical threads is:
All TCP discovery threads
All communication NIO threads (acceptor and workers)
Exchange worker
Striped pool threads
Timeout Worker
Checkpointer 
WAL archiver

The mechanism should support pluggable components so that self-check can be extended via plugins.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message