apex-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Thomas Weise (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (APEXCORE-426) Support work preserving AM recovery
Date Sun, 10 Apr 2016 02:54:25 GMT

     [ https://issues.apache.org/jira/browse/APEXCORE-426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Thomas Weise updated APEXCORE-426:
    Summary: Support work preserving AM recovery  (was: Support work preserving AM restart)

> Support work preserving AM recovery
> -----------------------------------
>                 Key: APEXCORE-426
>                 URL: https://issues.apache.org/jira/browse/APEXCORE-426
>             Project: Apache Apex Core
>          Issue Type: Improvement
>            Reporter: Thomas Weise
> On app master failure, the streaming containers should continue running. 
> As of 2.2, YARN will automatically terminate all containers and the replacement app master
will relaunch them. Once we move to a newer minimum Hadoop version, we should leverage work
preserving restart.
> The mechanism in Apex containers to locate the new master process are already in place.

This message was sent by Atlassian JIRA

View raw message