Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 26112 invoked from network); 30 Apr 2009 22:22:55 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 30 Apr 2009 22:22:55 -0000 Received: (qmail 93554 invoked by uid 500); 30 Apr 2009 22:22:55 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 93468 invoked by uid 500); 30 Apr 2009 22:22:55 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 93458 invoked by uid 99); 30 Apr 2009 22:22:55 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Apr 2009 22:22:55 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of msa@schor.com designates 69.56.142.19 as permitted sender) Received: from [69.56.142.19] (HELO gateway01.websitewelcome.com) (69.56.142.19) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 30 Apr 2009 22:22:45 +0000 Received: (qmail 31211 invoked from network); 30 Apr 2009 22:24:03 -0000 Received: from gator74.hostgator.com (67.18.27.130) by gateway01.websitewelcome.com with SMTP; 30 Apr 2009 22:24:03 -0000 Received: from [129.34.20.19] (port=16946 helo=[9.2.34.89]) by gator74.hostgator.com with esmtpsa (TLSv1:AES256-SHA:256) (Exim 4.69) (envelope-from ) id 1Lzee5-0002N1-Jq for general@hadoop.apache.org; Thu, 30 Apr 2009 17:22:21 -0500 Message-ID: <49FA249E.3030106@schor.com> Date: Thu, 30 Apr 2009 18:22:22 -0400 From: Marshall Schor User-Agent: Thunderbird 2.0.0.21 (Windows/20090302) MIME-Version: 1.0 To: general@hadoop.apache.org Subject: classpath for finding Key classes X-Enigmail-Version: 0.95.7 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - gator74.hostgator.com X-AntiAbuse: Original Domain - hadoop.apache.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - schor.com X-Virus-Checked: Checked by ClamAV on apache.org In hadoop, one can define the classes to be used for Keys and Values. I do this. When I make my giant Jar file holding everything needed for running my application, I include these classes. However, I've discovered that that is not enough it seems (in 0.19.1 version - in case that matters :-) ). The job start up processes is reading the configuration and finding the names of my Key classes, and tries to load them. But it is not using the giant Jar for my job, (yet), so it doesn't find them. A work-around that I've found is to include my giant Jar as the argument to -libjars - that seems to get the class path set up so the startup / validation code can find my classes. This seems wasteful - having the giant jar in two places... Is there a best practices way to do this that's better than this? Thanks. -Marshall Schor