hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bharath Mundlapudi <bharathw...@yahoo.com>
Subject Re: check namenode, jobtracker, datanodes and tasktracker status
Date Fri, 08 Jul 2011 17:56:36 GMT
Shouldn't be a problem. But making sure, you disconnect the connection from this monitoring
client might be helpful at peak loads.


From: Marc Sturlese <marc.sturlese@gmail.com>
To: hadoop-user@lucene.apache.org
Sent: Friday, July 8, 2011 10:49 AM
Subject: check namenode, jobtracker, datanodes and tasktracker status

Hey there,
I've written some scripts to check dfs disk space, number of datanodes,
number of tasktrackers, heap in use...
I'm with hadoop 0.20.2 and to do that I use the DFSClient and JobClient
I do things like:

JobClient jc = new JobClient(socketJT, conf);
ClusterStatus clusterStatus = jc.getClusterStatus(true);
DFSClient client = new DFSClient(socketNN, conf);
DatanodeInfo[] dni = client.datanodeReport(DatanodeReportType.ALL);

FileSystem fs = FileSystem.get(new URI("hdfs://" + host + "/"), conf);

It's is working well but I'm worried it could be harmful for the cluster to
run the script continuously (resource consumer). Is it alrite for example to
run it every 10 o 15 minutes? In case not, which is a good practice to
monitor the cluster?

Thanks in advance.

View this message in context: http://lucene.472066.n3.nabble.com/check-namenode-jobtracker-datanodes-and-tasktracker-status-tp3152565p3152565.html
Sent from the Hadoop lucene-users mailing list archive at Nabble.com.
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message