hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Haibo Chen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-3760) Log aggregation failures
Date Wed, 29 Mar 2017 17:55:41 GMT

    [ https://issues.apache.org/jira/browse/YARN-3760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15947608#comment-15947608
] 

Haibo Chen commented on YARN-3760:
----------------------------------

bq. the ctor creates the fs data stream then a TFile.Writer w/o a try/catch. If the TFile.Writer
ctor throws an exception, it's impossible to close the stream.
YARN-6288 is in flight to fix this issue.
Will upload a patch to address the other issue that ISE causes FSDataStream leak.

> Log aggregation failures 
> -------------------------
>
>                 Key: YARN-3760
>                 URL: https://issues.apache.org/jira/browse/YARN-3760
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.4.0
>            Reporter: Daryn Sharp
>            Assignee: Haibo Chen
>            Priority: Critical
>
> The aggregated log file does not appear to be properly closed when writes fail.  This
leaves a lease renewer active in the NM that spams the NN with lease renewals.  If the token
is marked not to be cancelled, the renewals appear to continue until the token expires.  If
the token is cancelled, the periodic renew spam turns into a flood of failed connections until
the lease renewer gives up.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message