mesos-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gastón Kleiman (JIRA) <j...@apache.org>
Subject [jira] [Created] (MESOS-9318) Consider providing better operation status updates while an RP is recovering
Date Mon, 15 Oct 2018 20:39:00 GMT
Gastón Kleiman created MESOS-9318:
-------------------------------------

             Summary: Consider providing better operation status updates while an RP is recovering
                 Key: MESOS-9318
                 URL: https://issues.apache.org/jira/browse/MESOS-9318
             Project: Mesos
          Issue Type: Task
    Affects Versions: 1.7.0, 1.6.0
            Reporter: Gastón Kleiman


Consider the following scenario:

1. A framework accepts an offer with an operation affecting SLRP resources.
2. The master forwards it to the corresponding agent.
3. The agent forwards it to the corresponding RP.
4. The agent and the master fail over.
5. The master recovers.
6. The agent recovers while the RP is still recovering, so it doesn't include the pending
operation on the {{RegisterMessage}}.
7. A framework performs an explicit operation status reconciliation.

In this case the master will currently respond with {{OPERATION_UNKNOWN}}, but it should be
possible to respond with a more fine-grained and useful state, such as {{OPERATION_RECOVERING}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message