Return-Path: Delivered-To: apmail-lucene-hadoop-user-archive@locus.apache.org Received: (qmail 70093 invoked from network); 30 Oct 2006 13:47:53 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 30 Oct 2006 13:47:53 -0000 Received: (qmail 55724 invoked by uid 500); 30 Oct 2006 13:48:03 -0000 Delivered-To: apmail-lucene-hadoop-user-archive@lucene.apache.org Received: (qmail 55477 invoked by uid 500); 30 Oct 2006 13:48:02 -0000 Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-user@lucene.apache.org Delivered-To: mailing list hadoop-user@lucene.apache.org Received: (qmail 55468 invoked by uid 99); 30 Oct 2006 13:48:02 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Oct 2006 05:48:02 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (herse.apache.org: local policy) Received: from [208.97.132.66] (HELO spunkymail-a18.dreamhost.com) (208.97.132.66) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Oct 2006 05:47:46 -0800 Received: from [192.168.0.2] (adsl-074-229-189-244.sip.rmo.bellsouth.net [74.229.189.244]) by spunkymail-a18.dreamhost.com (Postfix) with ESMTP id 621435B534 for ; Mon, 30 Oct 2006 05:47:23 -0800 (PST) Mime-Version: 1.0 (Apple Message framework v752.2) Content-Transfer-Encoding: 7bit Message-Id: <5CFC98AE-894B-4146-B76D-5B6EE13B2044@apache.org> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed To: hadoop-user@lucene.apache.org From: Grant Ingersoll Subject: Initialization costs Date: Mon, 30 Oct 2006 08:47:22 -0500 X-Mailer: Apple Mail (2.752.2) X-Virus-Checked: Checked by ClamAV on apache.org I know in general that I shouldn't worry too much about initialization costs, as they will be amortized over the life of the job and are often a drop in the bucket time wise. However, in my setup I have a conf() method that needs to load in some resources from disk. This is on a per job basis currently. I know that each node in my cluster is going to need these resources and every job I submit is going to end up doing this same thing. So I was wondering if there was anyway these resources could be loaded once per startup of the task tracker. In some sense, this is akin to putting something into application scope in a webapp as opposed to session scope. Thanks, Grant