mesos-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adam B (JIRA)" <>
Subject [jira] [Commented] (MESOS-982) Relax slave (re-)registration retries and add a backoff mechanism.
Date Mon, 07 Apr 2014 20:59:16 GMT


Adam B commented on MESOS-982:

Shouldn't we still be concerned about the network load on the master, especially for master
failover in a large-scale cluster with thousands of slaves? Or in high-latency networks?
Admittedly, it's not a blocker for the registrar, but we might still want to add some retry/backoff

> Relax slave (re-)registration retries and add a backoff mechanism.
> ------------------------------------------------------------------
>                 Key: MESOS-982
>                 URL:
>             Project: Mesos
>          Issue Type: Sub-task
>          Components: slave
>            Reporter: Benjamin Mahler
>            Assignee: Vinod Kone
>             Fix For: 0.19.0
> With the Registrar in place, the master must persist the registration attempt of slaves.
> Slaves will currently retry registration every 1 second, until registration succeeds.
With the addition of the persistence in the master, we should relax this retry time and add
a back-off mechanism to avoid placing excessive load on the master.

This message was sent by Atlassian JIRA

View raw message