hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "zhihai xu (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (YARN-6396) Call verifyAndCreateRemoteLogDir at service initialization instead of application initialization to decrease load for name node
Date Fri, 14 Apr 2017 16:03:42 GMT

    [ https://issues.apache.org/jira/browse/YARN-6396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15969197#comment-15969197
] 

zhihai xu edited comment on YARN-6396 at 4/14/17 4:02 PM:
----------------------------------------------------------

Thanks for the review [~jianhe] and [~rkanter]! if some one deletes the remote log dir, all
the old log will disappear. That will be a more serious issue, recreating the remote log dir
won't save the old log data. This looks like a monitor problem, I think it will be better
to do it in some tool outside the NM. It will be more efficient to do it at one place instead
of on each NM, which could be many thousands in a large cluster. Yes, it's a trade off between
validation and efficiency. Also restarting the NM will help recreate the remote log dir.


was (Author: zxu):
Thanks for the review [~jianhe] and [~rkanter]! if some one deletes the remote log dir, all
the old log will disappear. That will be a more serious issue, recreating the remote log dir
won't save the old log data. This looks like a monitor problem, I think it will be better
to do it in some tool outside the NM. It will be more efficient to do it at one place instead
of on each NM, which could be many thousands in a large cluster. Yes, it's a trade off between
validation and efficiency.

> Call verifyAndCreateRemoteLogDir at service initialization instead of application initialization
to decrease load for name node
> -------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: YARN-6396
>                 URL: https://issues.apache.org/jira/browse/YARN-6396
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: log-aggregation
>    Affects Versions: 3.0.0-alpha2
>            Reporter: zhihai xu
>            Assignee: zhihai xu
>            Priority: Minor
>         Attachments: YARN-6396.000.patch
>
>
> Call verifyAndCreateRemoteLogDir at service initialization instead of application initialization
to decrease load for name node.
> Currently for every application at each Node, verifyAndCreateRemoteLogDir will be called
before doing log aggregation, This will be a non trivial overhead for name node in a large
cluster since verifyAndCreateRemoteLogDir calls getFileStatus. Once the remote log directory
is created successfully, it is not necessary to call it again. It will be better to call verifyAndCreateRemoteLogDir
at LogAggregationService service initialization.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message