asterixdb-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Murtadha Hubail (JIRA)" <>
Subject [jira] [Resolved] (ASTERIXDB-2284) Ensure Node Failure on Heartbeat Misses
Date Fri, 16 Feb 2018 04:36:00 GMT


Murtadha Hubail resolved ASTERIXDB-2284.
    Resolution: Implemented

> Ensure Node Failure on Heartbeat Misses
> ---------------------------------------
>                 Key: ASTERIXDB-2284
>                 URL:
>             Project: Apache AsterixDB
>          Issue Type: Improvement
>            Reporter: Murtadha Hubail
>            Assignee: Murtadha Hubail
>            Priority: Major
> Currently, there is a possibility that an NC exceeds the allowed period to send its heartbeat
(i.e. due to garbage collection pause), and continue to stay up which will result in the cluster
state being unusable forever. The proposal is to ensure the failed node has really failed
by asking it to shutdown. By doing this, if the shutdown succeeds, the NC will be restarted
and the cluster state will be active again when the NC joins.

This message was sent by Atlassian JIRA

View raw message