mesos-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhitao Li (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (MESOS-6177) Return unregistered agents recovered from registrar in `GetAgents` and/or `/state.json`
Date Fri, 14 Oct 2016 01:24:21 GMT

    [ https://issues.apache.org/jira/browse/MESOS-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15572997#comment-15572997
] 

Zhitao Li edited comment on MESOS-6177 at 10/14/16 1:24 AM:
------------------------------------------------------------


(edited)

[~anandmazumdar], after some more thoughts, I'm inclined to return the full {{AgentInfo}}
instead of only {{AgentID}} for agents in {{recovered}} state.

This has the benefit to help operators to know the hostname of the agent id which is not recovered
yet without calling registry again.

-My primary intention is to have a hold of {{pid}}, so the operator/subscriber can know the
ip:port the agent is listening at. If we only return {{AgentID}}, the operator can do little
additional babysitting steps to validate the state of the agent, except for waiting for {{--agent_reregistration_timeout}}
to pass.-

-This is also pretty easy to implement IIUIC: we can simply change the {{slaves.recovered}}
from {{hashset<SlaveID>}} to {{hashmap<SlaveID, SlaveInfo>}}. The {{SlaveInfo}}
is already available after Registrar recovers it.-



was (Author: zhitao):
[~anandmazumdar], after some more thoughts, I'm inclined to return the full {{AgentInfo}}
instead of only {{AgentID}} for agents in {{recovered}} state.

My primary intention is to have a hold of {{pid}}, so the operator/subscriber can know the
ip:port the agent is listening at. If we only return {{AgentID}}, the operator can do little
additional babysitting steps to validate the state of the agent, except for waiting for {{--agent_reregistration_timeout}}
to pass.

This is also pretty easy to implement IIUIC: we can simply change the {{slaves.recovered}}
from {{hashset<SlaveID>}} to {{hashmap<SlaveID, SlaveInfo>}}. The {{SlaveInfo}}
is already available after Registrar recovers it.

> Return unregistered agents recovered from registrar in `GetAgents` and/or `/state.json`
> ---------------------------------------------------------------------------------------
>
>                 Key: MESOS-6177
>                 URL: https://issues.apache.org/jira/browse/MESOS-6177
>             Project: Mesos
>          Issue Type: Improvement
>          Components: HTTP API
>            Reporter: Zhitao Li
>            Assignee: Zhitao Li
>
> Use case:
> This can be used for any software which talks to Mesos master to better understand state
of an unregistered agent after a master failover.
> If this information is available, the use case in MESOS-6174 can be handled with a simpler
decision of whether the corresponding agent is removed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message