Return-Path: Delivered-To: apmail-hadoop-chukwa-user-archive@minotaur.apache.org Received: (qmail 59110 invoked from network); 4 Dec 2009 01:59:15 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Dec 2009 01:59:15 -0000 Received: (qmail 17634 invoked by uid 500); 4 Dec 2009 01:59:15 -0000 Delivered-To: apmail-hadoop-chukwa-user-archive@hadoop.apache.org Received: (qmail 17597 invoked by uid 500); 4 Dec 2009 01:59:15 -0000 Mailing-List: contact chukwa-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: chukwa-user@hadoop.apache.org Delivered-To: mailing list chukwa-user@hadoop.apache.org Received: (qmail 17588 invoked by uid 99); 4 Dec 2009 01:59:15 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Dec 2009 01:59:15 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [69.147.107.20] (HELO mrout1-b.corp.re1.yahoo.com) (69.147.107.20) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Dec 2009 01:59:03 +0000 Received: from SNV-EXBH01.ds.corp.yahoo.com (snv-exbh01.ds.corp.yahoo.com [207.126.227.249]) by mrout1-b.corp.re1.yahoo.com (8.13.8/8.13.8/y.out) with ESMTP id nB41wNRh073166 for ; Thu, 3 Dec 2009 17:58:23 -0800 (PST) DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=received:user-agent:date:subject:from:to:message-id: thread-topic:thread-index:in-reply-to:mime-version:content-type: content-transfer-encoding:return-path:x-originalarrivaltime; b=AxB242mFw9QZqXkGrO65GhGc3k+4pOfHEa7vf/8NTyJRxWq90eTT8ifeAyBVR/wj Received: from SNV-EXVS06.ds.corp.yahoo.com ([207.126.227.234]) by SNV-EXBH01.ds.corp.yahoo.com with Microsoft SMTPSVC(6.0.3790.3959); Thu, 3 Dec 2009 17:58:23 -0800 Received: from 10.72.111.153 ([10.72.111.153]) by SNV-EXVS06.ds.corp.yahoo.com ([207.126.227.82]) via Exchange Front-End Server snv-webmail.corp.yahoo.com ([207.126.227.59]) with Microsoft Exchange Server HTTP-DAV ; Fri, 4 Dec 2009 01:58:22 +0000 User-Agent: Microsoft-Entourage/12.23.0.091001 Date: Thu, 03 Dec 2009 17:58:21 -0800 Subject: Re: What's Chukwa for? From: Eric Yang To: Message-ID: Thread-Topic: What's Chukwa for? Thread-Index: Acp0hUFjv+pkZaKuQU6vx5C2lFBlYQ== In-Reply-To: <4B184CFB.8020400@googlemail.com> Mime-version: 1.0 Content-type: text/plain; charset="US-ASCII" Content-transfer-encoding: 7bit X-OriginalArrivalTime: 04 Dec 2009 01:58:23.0082 (UTC) FILETIME=[42A14CA0:01CA7485] X-Virus-Checked: Checked by ClamAV on apache.org Hi MJ, When Chukwa Agent was introduced in the design, it was designed to be both configuration management and monitoring agent. However, the current agent implementation does not have configuration management capabilities nor limit developer to add them. The community is currently focused on monitoring agent for now. Regards, Eric On 12/3/09 3:42 PM, "MJ Lai" wrote: > > Eric, Ari. > > Thanks for your responses. > > I have some extended questions. I'm trying to figure out what is the > best way to manage a production hadoop cluster, features include: > - deployment: install hadoop from one console; > - monitoring > - configure > - log/system analytics > - software upgrade > - etc. > > It seems Ganglia/Nagio are widely used to monitor hadoop cluster > metrics, and Chukwa is for log analytics. But still, it is a pain to > manage/configure a hadoop cluster. > > Since Chukwa has an agent installed in each endpoint, do you have any > plan to build it as a universal platform for hadoop management? > > Thanks. > MJ > > > Eric Yang wrote: >> Chukwa is a generic distributed log processing system. It's primary use >> case is to monitor Hadoop cluster. There are several analytics bundled for >> displaying system state, java vm resource usage, Hadoop dfs, mapreduce >> metrics. However, anyone could add their own analytics system to run in >> Chukwa. >> >> In general, the monitoring system is usually independent from the subject >> which being monitored. Chukwa documentation might look like you need two >> clusters for this to work. However, it's actually possible for Chukwa to >> run on the same cluster as it's monitoring. >> >> It's better to call chukwa as a reporting system if Chukwa is running on the >> same cluster. If hadoop crashed in this type of deployment, chukwa would >> not be responsible for not alerting. >> >> Regards, >> Eric >> >> On 12/2/09 3:30 PM, "MJ Lai" wrote: >> >>> >>> Hi. >>> >>> It is another ``what for'' question. >>> >>> I went thru the chukwa web site and am still kind of confused by what is >>> software really for. Can I say the major purpose is to provide 1) a >>> generic distributed log processing system, or 2) this log system is only >>> for Hadoop cluster? In case of 1), why do we want to make it tightly >>> bound to Hadoop? >>> >>> Assume we have a 100-machine cluster (no hadoop), if I deploy Chukwa to >>> process the cluster logs, I still need to create another hadoop cluster >>> to make it work? >>> >>> I think some practical use cases could reduce the confusions of this >>> this project. >>> >>> Thanks. >>> MJ >>> >> >>