hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Yang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-8587) Delays are noticed to launch docker container
Date Sun, 21 Oct 2018 17:22:00 GMT

    [ https://issues.apache.org/jira/browse/YARN-8587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16658296#comment-16658296
] 

Eric Yang commented on YARN-8587:
---------------------------------

[~Charo Zhang] Thank you for the patch.  Max_retries is hard coded to 3, instead of get_max_retries(&CFG);
 Any reason for the retries to be 3?

Indentation and spacing are not properly aligned.  Hadoop uses 2 spaces for indentation.
{code}
if (pclose (inspect_exitcode_docker) != 0 || res <= 0) {
} else {
}
{code}

The rest of the patch looks good to me.

> Delays are noticed to launch docker container
> ---------------------------------------------
>
>                 Key: YARN-8587
>                 URL: https://issues.apache.org/jira/browse/YARN-8587
>             Project: Hadoop YARN
>          Issue Type: Bug
>    Affects Versions: 3.1.1
>            Reporter: Yesha Vora
>            Assignee: Charo Zhang
>            Priority: Major
>              Labels: Docker
>             Fix For: 3.3.0
>
>         Attachments: YARN-8587.patch
>
>
> Launch dshell application. Wait for application to go in RUNNING state.
> {code:java}
> yarn  jar /xx/hadoop-yarn-applications-distributedshell-*.jar  -shell_command "sleep
300" -num_containers 1 -shell_env YARN_CONTAINER_RUNTIME_TYPE=docker -shell_env YARN_CONTAINER_RUNTIME_DOCKER_IMAGE=httpd:0.1
-shell_env YARN_CONTAINER_RUNTIME_DOCKER_DELAYED_REMOVAL=true -jar /usr/hdp/current/hadoop-yarn-client/hadoop-yarn-applications-distributedshell-xx.jar
> {code}
> Find out container allocation. Run docker inspect command for docker containers launched
by app.
> Sometimes, the container is allocated to NM but docker PID is not up.
> {code:java}
> Command ssh -q -o StrictHostKeyChecking=no -o UserKnownHostsFile=/dev/null xxx "sudo
su - -c \"docker ps  -a | grep container_e02_1531189225093_0003_01_000002\" root" failed after
0 retries 
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message