mesos-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Megha (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MESOS-6223) Allow agents to re-register post a host reboot
Date Wed, 21 Sep 2016 23:08:21 GMT

     [ https://issues.apache.org/jira/browse/MESOS-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Megha updated MESOS-6223:
-------------------------
    Description: Agent does’t recover its state post a host reboot, it registers with the
master and gets a new SlaveID. With partition awareness, the agents are now allowed to re-register
after they have been marked Unreachable. The executors are anyway terminated on the agent
when it reboots so there is no harm in letting the agent keep its SlaveID, re-register with
the master and reconcile the lost executors. This is a pre-requisite for supporting persistent/restartable
tasks in mesos (MESOS-3545).  (was: Agent does’t recover its state post a host reboot, it
registers with the master and gets a new SlaveID. With partition awareness, the agents are
now allowed to re-register after they have been marked Unreachable. The executors are anyway
terminated on the agent when it reboots so there is no harm in letting the agent keep its
SlaveID, re-register with the master and reconcile the lost executors. This is a pre-requisite
for supporting persistent/restartable tasks in mesos (https://issues.apache.org/jira/browse/MESOS-3545).)

> Allow agents to re-register post a host reboot
> ----------------------------------------------
>
>                 Key: MESOS-6223
>                 URL: https://issues.apache.org/jira/browse/MESOS-6223
>             Project: Mesos
>          Issue Type: Improvement
>          Components: slave
>            Reporter: Megha
>
> Agent does’t recover its state post a host reboot, it registers with the master and
gets a new SlaveID. With partition awareness, the agents are now allowed to re-register after
they have been marked Unreachable. The executors are anyway terminated on the agent when it
reboots so there is no harm in letting the agent keep its SlaveID, re-register with the master
and reconcile the lost executors. This is a pre-requisite for supporting persistent/restartable
tasks in mesos (MESOS-3545).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message