hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Payne (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-3074) Nodemanager dies when localizer runner tries to write to a full disk
Date Tue, 03 Feb 2015 14:48:35 GMT

    [ https://issues.apache.org/jira/browse/YARN-3074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14303362#comment-14303362

Eric Payne commented on YARN-3074:

Thanks for your reply, [~varun_saxena],
bq.  I actually wanted a different message to be printed in logs for these exceptions. I can
probably write a small method to consolidate these duplicate lines.
If you want to include separate messages, then separate sections is probably fine. However,
the "cause" message in the exception/FSError will give more information, so separate message
is probably not necessary. If you do choose to have separate catch blocks, the message for
FSError should probably say "filesystem error" rather than "disk error." The FSError may be
related to the filey system, but not necessary a problem with the disk. Also, I would not
add the overhead of a separate method just to save a couple of lines of code.

> Nodemanager dies when localizer runner tries to write to a full disk
> --------------------------------------------------------------------
>                 Key: YARN-3074
>                 URL: https://issues.apache.org/jira/browse/YARN-3074
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.5.0
>            Reporter: Jason Lowe
>            Assignee: Varun Saxena
>         Attachments: YARN-3074.001.patch
> When a LocalizerRunner tries to write to a full disk it can bring down the nodemanager
process.  Instead of failing the whole process we should fail only the container and make
a best attempt to keep going.

This message was sent by Atlassian JIRA

View raw message