hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hitesh Shah (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-778) Failures in container launches due to issues like disk failure are difficult to diagnose
Date Thu, 06 Jun 2013 23:00:20 GMT

    [ https://issues.apache.org/jira/browse/YARN-778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13677632#comment-13677632
] 

Hitesh Shah commented on YARN-778:
----------------------------------

Here is a case where container launch failed due to a bad disk. This information is hard to
come by without looking at the NM logs as well as the disk in question. 

2013-06-06 17:45:25,329 WARN  launcher.ContainerLaunch (ContainerLaunch.java:call(255)) -
Failed to launch container.
java.io.IOException: mkdir of /grid/1/hdp/yarn/local/usercache/hrt_qa/appcache/application_1370473246485_0136/container_1370473246485_0136_01_000019
failed
        at org.apache.hadoop.fs.FileSystem.primitiveMkdir(FileSystem.java:1044)
        at org.apache.hadoop.fs.DelegateToFileSystem.mkdir(DelegateToFileSystem.java:150)
        at org.apache.hadoop.fs.FilterFs.mkdir(FilterFs.java:187)
        at org.apache.hadoop.fs.FileContext$4.next(FileContext.java:730)
        at org.apache.hadoop.fs.FileContext$4.next(FileContext.java:726)
        at org.apache.hadoop.fs.FileContext$FSLinkResolver.resolve(FileContext.java:2379)
        at org.apache.hadoop.fs.FileContext.mkdir(FileContext.java:726)
        at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.createDir(DefaultContainerExecutor.java:412)
        at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:130)
        at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:250)
        at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:73)
        at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)        at 
                
> Failures in container launches due to issues like disk failure are difficult to diagnose
> ----------------------------------------------------------------------------------------
>
>                 Key: YARN-778
>                 URL: https://issues.apache.org/jira/browse/YARN-778
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Hitesh Shah
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message