hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karthik Kambatla (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2194) Cgroups cease to work in RHEL7
Date Fri, 12 Jun 2015 06:42:02 GMT

    [ https://issues.apache.org/jira/browse/YARN-2194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14583045#comment-14583045

Karthik Kambatla commented on YARN-2194:

I tried running jobs with the patch posted here, and ran into issues during localization:
Localizer failed
java.io.IOException: Application application_1434091083696_0001 initialization failed (exitCode=20)
with output: main : command provided 0
main : user is nobody
main : requested yarn user is systest
Failed to create directory /data/yarn/nm%/data1/yarn/nm/usercache/systest - No such file or

	at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:241)
	at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1132)
Caused by: ExitCodeException exitCode=20: 
	at org.apache.hadoop.util.Shell.runCommand(Shell.java:538)
	at org.apache.hadoop.util.Shell.run(Shell.java:455)
	at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:715)
	at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:232)
	... 1 more

> Cgroups cease to work in RHEL7
> ------------------------------
>                 Key: YARN-2194
>                 URL: https://issues.apache.org/jira/browse/YARN-2194
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.7.0
>            Reporter: Wei Yan
>            Assignee: Wei Yan
>            Priority: Critical
>         Attachments: YARN-2194-1.patch, YARN-2194-2.patch, YARN-2194-3.patch, YARN-2194-4.patch
> In RHEL7, the CPU controller is named "cpu,cpuacct". The comma in the controller name
leads to container launch failure. 
> RHEL7 deprecates libcgroup and recommends the user of systemd. However, systemd has certain
shortcomings as identified in this JIRA (see comments). 
> This JIRA only fixes the failure, and doesn't try to use systemd.

This message was sent by Atlassian JIRA

View raw message