aurora-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris Lambert (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (AURORA-722) snapshot performance issues
Date Mon, 06 Oct 2014 16:58:36 GMT

     [ https://issues.apache.org/jira/browse/AURORA-722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chris Lambert updated AURORA-722:
---------------------------------
    Sprint: Aurora Q3 Sprint 2, Aurora Q3 Sprint 3, Aurora Q4 Sprint 1  (was: Aurora Q3 Sprint
2, Aurora Q3 Sprint 3)

> snapshot performance issues
> ---------------------------
>
>                 Key: AURORA-722
>                 URL: https://issues.apache.org/jira/browse/AURORA-722
>             Project: Aurora
>          Issue Type: Bug
>          Components: Scheduler
>            Reporter: Kevin Sweeney
>            Assignee: Kevin Sweeney
>             Fix For: 0.6.0
>
>
> In one of our larger production clusters we're seeing issues with snapshot performance
that cause the scheduler to failover before completing a snapshot.
> For background, the scheduler writes a compressed (when -deflate_snapshots is enabled),
binary-encoded Snapshot (from api.thrift) to the mesos replicated log every hour (or -dlog_snapshot_interval).
This snapshot represents most of the scheduler's heap usage, including the configuration for
all tasks running in the cluster.
> Add appropriate instrumentation to the snapshot routine and patch any obvious performance
bottlenecks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message