hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sreekanth Ramakrishnan (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-5478) Provide a node health check script and run it periodically to check the node health status
Date Mon, 15 Jun 2009 11:39:07 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-5478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sreekanth Ramakrishnan updated HADOOP-5478:
-------------------------------------------

    Attachment: hadoop-5478-3.patch

Attaching a patch which does following:

* Added a new protocol which is used for node health reporting.
* Added a new Class which implements this Interface.
* The new class spawns off a new JVM, which does the health monitoring.
* Moved the timeout logic into {{Shell}}
* Retained the old test case which does
 1. Black listing due to node script reporting error 
2. White-listing of same tracker when script returns no error 
3. Blacklisting when the script times out.

> Provide a node health check script and run it periodically to check the node health status
> ------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-5478
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5478
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: mapred
>    Affects Versions: 0.20.0
>            Reporter: Aroop Maliakkal
>            Assignee: Vinod K V
>         Attachments: hadoop-5478-1.patch, hadoop-5478-2.patch, hadoop-5478-3.patch
>
>
> Hadoop must have some mechanism to find the health status of a node . It should run the
health check script periodically and if there is any errors, it should black list the node.
This will be really helpful when we run static mapred clusters. Else we may have to run some
scripts/daemons periodically to find the node status and take it offline manually.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message