Return-Path: X-Original-To: apmail-hadoop-general-archive@minotaur.apache.org Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D95D5E863 for ; Sat, 5 Jan 2013 09:30:03 +0000 (UTC) Received: (qmail 45736 invoked by uid 500); 5 Jan 2013 09:30:01 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 45153 invoked by uid 500); 5 Jan 2013 09:29:55 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 45114 invoked by uid 99); 5 Jan 2013 09:29:54 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 05 Jan 2013 09:29:54 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ablozhou@gmail.com designates 209.85.215.48 as permitted sender) Received: from [209.85.215.48] (HELO mail-la0-f48.google.com) (209.85.215.48) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 05 Jan 2013 09:29:47 +0000 Received: by mail-la0-f48.google.com with SMTP id ej20so10939186lab.21 for ; Sat, 05 Jan 2013 01:29:27 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to:content-type; bh=Nc2/J8kQftpVh76z6D8Yjgt696n+ZvtXsI2gQNrRWyQ=; b=of5pUu36qf2JRhcNSNRJuAbpxpW5mcY3pnHGY2ZMg7u3/RYfU8uJdUWg5lLELAmYe+ iyo5EphgD2H9wPgw5ciQlD67rb1u0GqzaYFr3wP5vSE6ZOfx6ir7bd8ImielSIk30Hje flZ7peRF8wc6ViOAc8GTj6ZXvXNVnpp4P4dflH0ySLqXjzd8ihjAQy6n9A3wABpUTY38 OGzlXhcsWiYMpy6RxpvAvWlKAdO6jjiNOuttA91ZYL/5XC4YhM+6EWBtFw+08Od1l2Gd MXHUlX6OMYMYQgJ4VF5ldL9scy8TqPQIG+RxR+yFqmKqlya5MME8ThYm2Iwp8+zX1PBk 5sHg== Received: by 10.152.46.161 with SMTP id w1mr52499109lam.27.1357378167494; Sat, 05 Jan 2013 01:29:27 -0800 (PST) MIME-Version: 1.0 Received: by 10.112.87.231 with HTTP; Sat, 5 Jan 2013 01:29:06 -0800 (PST) From: =?UTF-8?B?5ZGo5qKm5oOz?= Date: Sat, 5 Jan 2013 17:29:06 +0800 Message-ID: Subject: hadoop simple monitor To: general@hadoop.apache.org Content-Type: multipart/alternative; boundary=bcaec55408907fd7ff04d2873bbd X-Virus-Checked: Checked by ClamAV on apache.org --bcaec55408907fd7ff04d2873bbd Content-Type: text/plain; charset=ISO-8859-1 Hello, I write a bash script to monitor all nodes of Hadoop system. It can monitor memory, disk usage and hadoop,thirift,hbase process exit. Simple Monitor for Hadoop Processes, Memory and Diskspace usage. If memory and disk usage reaches to a threshold, or some hadoop system process exit, the program will send email to make warnings. Features: 1. Simple Monitor Hadoop Processes, Memory and Diskspace usage of every hosts that configured. 2. Deploy is simple. Just copy or uncompress the code to one station, and it can monitor each remote hosts.( Do not need to deploy to any nodes to be monitored.) 3. Write by Bash script, It's easy to modify. 4. It can monitor java processes(Such as HBase,Thrift,Hadoop processes) or other processes(need a little change) Deploy: Just copy or uncompress the code to a directory, configured to monitor all remote hosts, the remote hosts needn't any configure or deploy. 1. modify config.sh to configure parameters 2. check loopcheck.sh to modify paths, because crontab has a limited env and PATH 3. check email sending is ok or not, you may need configure MTA such as Postfix 4. configure crontab, every 5 minutes check the hosts 0-59/5 * * * * $HOME/smr/loopcheck.sh Download: https://code.google.com/p/hadoop-simple-monitor/ May it could help some body. Best Regards, Andy Zhou --bcaec55408907fd7ff04d2873bbd--