Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@locus.apache.org Received: (qmail 85898 invoked from network); 2 Dec 2008 03:21:05 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 2 Dec 2008 03:21:05 -0000 Received: (qmail 41711 invoked by uid 500); 2 Dec 2008 03:21:16 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 41645 invoked by uid 500); 2 Dec 2008 03:21:16 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 41633 invoked by uid 99); 2 Dec 2008 03:21:15 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Dec 2008 19:21:15 -0800 X-ASF-Spam-Status: No, hits=-1.0 required=10.0 tests=RCVD_IN_DNSWL_LOW,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jlist@streamy.com designates 72.34.249.3 as permitted sender) Received: from [72.34.249.3] (HELO mail.streamy.com) (72.34.249.3) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Dec 2008 03:19:36 +0000 Received: from jg (static-98-112-71-211.lsanca.dsl-w.verizon.net [98.112.71.211]) by ns1.streamy.com (8.13.1/8.13.1) with ESMTP id mB23KNm5031840 for ; Mon, 1 Dec 2008 19:20:23 -0800 From: "Jonathan Gray" To: References: <823889.72529.qm@web65510.mail.ac4.yahoo.com> In-Reply-To: Subject: RE: Bulk import question. Date: Mon, 1 Dec 2008 19:20:14 -0800 Message-ID: <005301c9542c$e45f39f0$ad1dadd0$@com> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable X-Mailer: Microsoft Office Outlook 12.0 Thread-Index: AclUKsGWbanxImySSk6nFxrhOc8SoQAAXJpg Content-Language: en-us X-Virus-Checked: Checked by ClamAV on apache.org Your new best friends: Ganglia and Nagios Ganglia is great for monitoring cluster-wide resource usage over time. = You'll see memory, cpu, disk, network usage over time for entire cluster = and for each node. It is very easy to setup because it uses UDP = broadcast so no need to actually configure nodes in conf files. HBase = 0.19 introduces ganglia metrics which will also be available in the = ganglia web interface. http://ganglia.info/ Nagios is good for monitoring services as well as resource utilization. = Rather than give data over time, it's aim is really to alert you when = something is wrong. For example, when a server is no longer reachable = or when available disk space reaches a configurable threshold. It does = require a bit more work to get up and running because you have to setup = your node and service configurations. I have written custom nagios = plugins for hadoop and hbase, if there's interest I will look at = cleaning them up and contrib'ing them. http://www.nagios.org/ Both are free and essential tools for properly monitoring your cluster. JG > -----Original Message----- > From: edward@udanax.org [mailto:edward@udanax.org] On Behalf Of Edward > J. Yoon > Sent: Monday, December 01, 2008 7:04 PM > To: apurtell@apache.org > Cc: hbase-user@hadoop.apache.org; 02635@nhncorp.com > Subject: Re: Bulk import question. >=20 > I'm considering to store the large-scale web-mail data on the Hbase. > As you know, there is a lot of mail bomb (e.g. spam, group mail,..., > etc). So, I tested these. >=20 > Here's my additionally question. Have we a monitoring tool for disk > space? >=20 > /Edward >=20 > On Tue, Dec 2, 2008 at 11:42 AM, Andrew Purtell > wrote: > > Edward, > > > > You are running with insufficient resources -- too little CPU > > for your task and too little disk for your data. > > > > If you are running a mapreduce task and DFS runs out of space > > for the temporary files, then you indeed should expect > > aberrant job status from the Hadoop job framework, for > > example such things as completion status running backwards. > > > > I do agree that under these circumstances HBase daemons > > should fail more gracefully, by entering some kind of > > degraded read only mode, if DFS is not totally dead. I > > suspect this is already on a to do list somewhere, and I > > vaguely recall a jira filed on that topic. > > > > - Andy > > > > > >> From: Edward J. Yoon > >> Subject: Re: Bulk import question. > >> To: hbase-user@hadoop.apache.org, apurtell@apache.org > >> Date: Monday, December 1, 2008, 6:26 PM > >> It was by 'Datanode DiskOutOfSpaceException'. But, I > >> think daemons should not dead. > >> > >> On Wed, Nov 26, 2008 at 1:08 PM, Edward J. Yoon > >> wrote: > >> > Hmm. It often occurs to me. I'll check the logs. > >> > > >> > On Fri, Nov 21, 2008 at 9:46 AM, Andrew Purtell > >> wrote: > >> > > I think a 2 node cluster is simply too small for > >> > > the full load of everything. > >> > > > > > > > > > > > > >=20 >=20 >=20 > -- > Best Regards, Edward J. Yoon @ NHN, corp. > edwardyoon@apache.org > http://blog.udanax.org