hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Yang (Jira)" <j...@apache.org>
Subject [jira] [Commented] (YARN-9809) NMs should supply a health status when registering with RM
Date Wed, 17 Jun 2020 17:48:00 GMT

    [ https://issues.apache.org/jira/browse/YARN-9809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17138685#comment-17138685
] 

Eric Yang commented on YARN-9809:
---------------------------------

[~ebadger] Thank you for the patch.  The patch looks very close to final product.  I have
confirmed the test case failure doesn't happen, if there are sufficient amount of RAM on the
testing node.  I also validated that new node manager can work with unpatched resource manager.
 However, I could not get health check script to fail to cause node registered as unhealthy.

Here is my check script:
{code}
#!/bin/bash
echo "i am here" > /tmp/hello
exit 1
{code}

It would be nice to have verbose message to show the exit code of the health check script
in node manager log file.  The script is executed, but it shows healthy.  What am I doing
wrong?

> NMs should supply a health status when registering with RM
> ----------------------------------------------------------
>
>                 Key: YARN-9809
>                 URL: https://issues.apache.org/jira/browse/YARN-9809
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Eric Badger
>            Assignee: Eric Badger
>            Priority: Major
>         Attachments: YARN-9809.001.patch, YARN-9809.002.patch, YARN-9809.003.patch, YARN-9809.004.patch
>
>
> Currently if the NM registers with the RM and it is unhealthy, it can be scheduled many
containers before the first heartbeat. After the first heartbeat, the RM will mark the NM
as unhealthy and kill all of the containers.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message