hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xuan Gong (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2583) Modify the LogDeletionService to support Log aggregation for LRS
Date Tue, 07 Oct 2014 17:29:34 GMT

    [ https://issues.apache.org/jira/browse/YARN-2583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14162157#comment-14162157

Xuan Gong commented on YARN-2583:

Here is the proposal:
* Add private configuration for number of logs we can save in NM side. We will delete old
logs if the num of logs is larger than this configured value. This is a temporary solution.
The configuration will be deleted once we find a more scalable method(will be tracked by YARN-2548)
to only write a single log file per LRS. 
* jhs contacts RM to check whether app is still running or not. If this app is still running,
we need to keep the app dir, but remove the old logs.
* Remove per-app LogRollingInterval completely and then have NM wake up every so often and
upload log files. In this ticket, we can spin off LogRollingInterval from AppLogAggregatorImpl.
YARN-2651 will be used to track the changes for other places.
* Enforce the minimal log rolling interval. (3600 seconds will be used as minimal value)

> Modify the LogDeletionService to support Log aggregation for LRS
> ----------------------------------------------------------------
>                 Key: YARN-2583
>                 URL: https://issues.apache.org/jira/browse/YARN-2583
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager, resourcemanager
>            Reporter: Xuan Gong
>            Assignee: Xuan Gong
>         Attachments: YARN-2583.1.patch, YARN-2583.2.patch, YARN-2583.3.1.patch, YARN-2583.3.patch
> Currently, AggregatedLogDeletionService will delete old logs from HDFS. It will check
the cut-off-time, if all logs for this application is older than this cut-off-time. The app-log-dir
from HDFS will be deleted. This will not work for LRS. We expect a LRS application can keep
running for a long time. 
> Two different scenarios: 
> 1) If we configured the rollingIntervalSeconds, the new log file will be always uploaded
to HDFS. The number of log files for this application will become larger and larger. And there
is no log files will be deleted.
> 2) If we did not configure the rollingIntervalSeconds, the log file can only be uploaded
to HDFS after the application is finished. It is very possible that the logs are uploaded
after the cut-off-time. It will cause problem because at that time the app-log-dir for this
application in HDFS has been deleted.

This message was sent by Atlassian JIRA

View raw message