Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 88674 invoked from network); 19 May 2008 13:39:59 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 19 May 2008 13:39:59 -0000 Received: (qmail 98464 invoked by uid 500); 19 May 2008 13:40:01 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 97487 invoked by uid 500); 19 May 2008 13:39:57 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 97476 invoked by uid 99); 19 May 2008 13:39:57 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 May 2008 06:39:57 -0700 X-ASF-Spam-Status: No, hits=-2.8 required=10.0 tests=RCVD_IN_DNSWL_MED,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [192.6.10.60] (HELO tobor.hpl.hp.com) (192.6.10.60) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 May 2008 13:39:09 +0000 Received: from localhost (localhost [127.0.0.1]) by tobor.hpl.hp.com (Postfix) with ESMTP id D013BB7BC0 for ; Mon, 19 May 2008 14:39:03 +0100 (BST) X-Virus-Scanned: amavisd-new at hplb.hpl.hp.com Received: from tobor.hpl.hp.com ([127.0.0.1]) by localhost (tobor.hpl.hp.com [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 4uD1-pt+njGj for ; Mon, 19 May 2008 14:38:56 +0100 (BST) Received: from ha-node-br2.hpl.hp.com (ha-node-br2.hpl.hp.com [16.25.144.57]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by tobor.hpl.hp.com (Postfix) with ESMTPS id 97887B7B4E for ; Mon, 19 May 2008 14:38:50 +0100 (BST) MailScanner-NULL-Check: 1211809109.1778@YQeJj+mgog0MT9rWw8k/xg Received: from [16.25.171.118] (morzine.hpl.hp.com [16.25.171.118]) by ha-node-br2.hpl.hp.com (8.14.1/8.13.4) with ESMTP id m4JDcR8d006280 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Mon, 19 May 2008 14:38:27 +0100 (BST) Message-ID: <483182D9.3080804@apache.org> Date: Mon, 19 May 2008 14:38:33 +0100 From: Steve Loughran User-Agent: Thunderbird 2.0.0.14 (X11/20080421) MIME-Version: 1.0 To: core-user@hadoop.apache.org Subject: Re: How do people keep their client configurations in sync with the remote cluster(s) References: <482D4DD2.7090707@apache.org> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-HPL-MailScanner-Information: Please contact the ISP for more information X-MailScanner-ID: m4JDcR8d006280 X-HPL-MailScanner: Found to be clean X-HPL-MailScanner-From: stevel@apache.org X-Virus-Checked: Checked by ClamAV on apache.org Alejandro Abdelnur wrote: > A while ago I've opened an issue related to this topic > > https://issues.apache.org/jira/browse/HADOOP-3287 > > My take is a little different, when submitting a job, the clients > should only send to the jobtracker the configuration they explicitly > set, then the job tracker would apply the defaults for all the other > configuration. > > By doing this the cluster admin can modify things at any time and > changes on default values take effect for all clients without having > to distribute a new configuration to all clients. > > IMO, this approach was the intended behavior at some point, according > to the Configuration.write(OutputStream) javadocs ' Writes non-default > properties in this configuration.'. But as the write method is writing > default properties this is not happening. I'll keep an eye on that issue. I think a key problem right now is that clients take their config from the configuration file in the core jar, and from their own settings, You need to keep the settings in sync somehow, and have to take what the core jar provides. > This approach would also get rid of the separate mechanism (zookeeper, > svn, etc) to keep clients synchronized as there would be no need to do > so. zookeeper and similar are to keep the cluster alive; they shouldnt be needed for clients, which should only need some URL of a job tracker to talk to.