hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bikas Saha (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-257) NM should gracefully handle a full local disk
Date Wed, 05 Dec 2012 21:51:58 GMT

    [ https://issues.apache.org/jira/browse/YARN-257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13510812#comment-13510812
] 

Bikas Saha commented on YARN-257:
---------------------------------

Before the complete change, would it help if the NM did not accept new containers. Maybe by
indicating in the heartbeat that do not assign containers to it.
Why does the RM not notice abnormal failure rates on such an NM and put it out of rotation
for scheduling?
                
> NM should gracefully handle a full local disk
> ---------------------------------------------
>
>                 Key: YARN-257
>                 URL: https://issues.apache.org/jira/browse/YARN-257
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: nodemanager
>    Affects Versions: 2.0.2-alpha, 0.23.5
>            Reporter: Jason Lowe
>
> When a local disk becomes full, the node will fail every container launched on it because
the container is unable to localize.  It tries to create an app-specific directory for each
local and log directories.  If any of those directory creates fail (due to lack of free space)
the container fails.
> It would be nice if the node could continue to launch containers using the space available
on other disks rather than failing all containers trying to launch on the node.
> This is somewhat related to YARN-91 but is centered around the disk becoming full rather
than the disk failing.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message