Return-Path: X-Original-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0EAEC11282 for ; Tue, 20 May 2014 14:12:39 +0000 (UTC) Received: (qmail 45031 invoked by uid 500); 20 May 2014 14:12:38 -0000 Delivered-To: apmail-hadoop-yarn-issues-archive@hadoop.apache.org Received: (qmail 44991 invoked by uid 500); 20 May 2014 14:12:38 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-issues@hadoop.apache.org Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 44982 invoked by uid 99); 20 May 2014 14:12:38 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 May 2014 14:12:38 +0000 Date: Tue, 20 May 2014 14:12:38 +0000 (UTC) From: "Jason Lowe (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-1352) Recover LogAggregationService upon nodemanager restart MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/YARN-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14003325#comment-14003325 ] Jason Lowe commented on YARN-1352: ---------------------------------- Ah, good point. I'll file a separate JIRA to handle saving that state. As for this JIRA, I believe it will be covered by the combination of YARN-1354 and YARN-1337. As applications are recovered it will notify the LogAggregationService, and as active and completed containers are recovered it will also notify the aggregation service. If the application did not finish aggregating then it will attempt to aggregate the app's container logs again. > Recover LogAggregationService upon nodemanager restart > ------------------------------------------------------ > > Key: YARN-1352 > URL: https://issues.apache.org/jira/browse/YARN-1352 > Project: Hadoop YARN > Issue Type: Sub-task > Components: nodemanager > Affects Versions: 2.3.0 > Reporter: Jason Lowe > Assignee: Jason Lowe > > LogAggregationService state needs to be recovered as part of the work-preserving nodemanager restart feature. -- This message was sent by Atlassian JIRA (v6.2#6252)