hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jamack, Peter" <PJam...@consilium1.com>
Subject Re: HDFS Reporting Tools
Date Tue, 06 Mar 2012 19:02:31 GMT
You could set up things like Ganglia, Nagios to monitor, send off events,
issues.
Within the Hadoop Ecosystem, there are things like Vaidya, maybe
Ambari(not sure as I've not used this), Splunk even has a new beta test
for Shep/Splunk Hadoop Monitoring app.

Peter Jamack

On 3/6/12 8:35 AM, "Oren Livne" <livne@uchicago.edu> wrote:

>Dear All,
>
>We are maintaining a 60-node hadoop cluster for external users, and
>would like to be automatically notified via email when an HDFS crash or
>some other infrastructure failure occurs that is not due to a user
>programming error. We've been encountering such "soft" errors, where
>hadoop does not crash, but becomes very slow and job hand for a long
>time and fail.
>
>Are there existing tools that provide this capability? Or do we have to
>manually monitor the web services at on http://namenode and
>http://namenode:50030?
>
>Thank you so much,
>Oren
>
>-- 
>"We plan ahead, which means we don't do anything right now."
>                                               -- Valentine (Tremors)
>
>-- 
>"We plan ahead, which means we don't do anything right now."
>                                               -- Valentine (Tremors)
>


Mime
View raw message