mesos-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhitao Li (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (MESOS-8609) Create a metric to indicate how long agent takes to recover executors
Date Wed, 14 Mar 2018 20:28:00 GMT

    [ https://issues.apache.org/jira/browse/MESOS-8609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16399215#comment-16399215
] 

Zhitao Li edited comment on MESOS-8609 at 3/14/18 8:27 PM:
-----------------------------------------------------------

{noformat}
commit 82c50c0e00284c131354499f74176b19d89bd21d (HEAD -> master, origin/master, origin/HEAD)
Author: Zhitao Li <zhitaoli.cs@gmail.com>
Date:   Wed Mar 14 09:25:01 2018 -0700

    Document new `slave/recovery_time_secs` gauge.
    
    Review: https://reviews.apache.org/r/66070

commit b8526c61403214aaa67fa941b4e8b0fd8e3328f2
Author: Zhitao Li <zhitaoli.cs@gmail.com>
Date:   Wed Mar 7 15:18:53 2018 -0800

    Added a test to make sure `slave/recovery_time_secs` is reported.
    
    Review: https://reviews.apache.org/r/65959

commit 026dafd33cd23d41818e18e31ec271fa2c13abd2
Author: Zhitao Li <zhitaoli.cs@gmail.com>
Date:   Tue Mar 6 17:43:48 2018 -0800

    Added a gauge for how long agent recovery takes.
    
    The new metric `slave/recover_time_secs` can be used to tell us how long
    Mesos agent needed to finish its recovery cycle. This is an important
    metric on agent machines which have a lot of completed executor
    sandboxes.
    
    Note that the metric 1) will only be available after recovery succeeded
    and 2) never change its value across agent process lifecycle afterwards.
    
    Review: https://reviews.apache.org/r/65954
{noformat}


was (Author: zhitao):
commit 82c50c0e00284c131354499f74176b19d89bd21d (HEAD -> master, origin/master, origin/HEAD)
Author: Zhitao Li <zhitaoli.cs@gmail.com>
Date:   Wed Mar 14 09:25:01 2018 -0700

    Document new `slave/recovery_time_secs` gauge.
    
    Review: https://reviews.apache.org/r/66070

commit b8526c61403214aaa67fa941b4e8b0fd8e3328f2
Author: Zhitao Li <zhitaoli.cs@gmail.com>
Date:   Wed Mar 7 15:18:53 2018 -0800

    Added a test to make sure `slave/recovery_time_secs` is reported.
    
    Review: https://reviews.apache.org/r/65959

commit 026dafd33cd23d41818e18e31ec271fa2c13abd2
Author: Zhitao Li <zhitaoli.cs@gmail.com>
Date:   Tue Mar 6 17:43:48 2018 -0800

    Added a gauge for how long agent recovery takes.
    
    The new metric `slave/recover_time_secs` can be used to tell us how long
    Mesos agent needed to finish its recovery cycle. This is an important
    metric on agent machines which have a lot of completed executor
    sandboxes.
    
    Note that the metric 1) will only be available after recovery succeeded
    and 2) never change its value across agent process lifecycle afterwards.
    
    Review: https://reviews.apache.org/r/65954


> Create a metric to indicate how long agent takes to recover executors
> ---------------------------------------------------------------------
>
>                 Key: MESOS-8609
>                 URL: https://issues.apache.org/jira/browse/MESOS-8609
>             Project: Mesos
>          Issue Type: Improvement
>          Components: agent
>            Reporter: Zhitao Li
>            Assignee: Zhitao Li
>            Priority: Minor
>              Labels: Metrics, agent
>             Fix For: 1.6.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message