Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 62834 invoked from network); 18 Apr 2008 22:03:26 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 18 Apr 2008 22:03:26 -0000 Received: (qmail 33809 invoked by uid 500); 18 Apr 2008 22:03:20 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 33777 invoked by uid 500); 18 Apr 2008 22:03:20 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 33768 invoked by uid 99); 18 Apr 2008 22:03:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Apr 2008 15:03:20 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mbautin@gmail.com designates 72.14.220.159 as permitted sender) Received: from [72.14.220.159] (HELO fg-out-1718.google.com) (72.14.220.159) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Apr 2008 22:02:28 +0000 Received: by fg-out-1718.google.com with SMTP id 16so700997fgg.35 for ; Fri, 18 Apr 2008 15:02:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to:subject:mime-version:content-type; bh=ZI4Mj1Qis+MktqEflrA1zTUSRqKxkJeIWxuG6rIn4AQ=; b=G3gHyqIKKqxoy8b1xvSgtFMq5jNG5Bi4DH7A14pTI+Ndr2CT3JHR5z43JbnfxfT68Op0K3LNIgivAKTEVv5JCKtno1O5TgJV6Z4zAOW5gzoGeLBp4flW4SmjlaPXe3ayHpA+Rw5FKU28A2R65S4JozaTSLqjmZRidTloP6gVXFk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:mime-version:content-type; b=iudmAOX+ybryGmov0xHn+OQTi4py/rYSX/EFfaVzPYeCzQGZHDnq68gYaF35R9OEGoqVhKKVMfaHCp0SK0t4+t/blfTuuCbjub8IEDFsN5nmboS5j3KnWhlZqCGV++JC3lQBDjSE2R1E8gTHati1IM/XPeAwy2hw4Gwqolv+yU8= Received: by 10.82.120.18 with SMTP id s18mr5623944buc.8.1208556169198; Fri, 18 Apr 2008 15:02:49 -0700 (PDT) Received: by 10.82.152.13 with HTTP; Fri, 18 Apr 2008 15:02:49 -0700 (PDT) Message-ID: Date: Fri, 18 Apr 2008 18:02:49 -0400 From: "Mikhail Bautin" To: core-user@hadoop.apache.org Subject: jar files on NFS instead of DistributedCache MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_6451_1049215.1208556169174" X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_6451_1049215.1208556169174 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Content-Disposition: inline Hello, We are using Hadoop here at Stony Brook University to power the next-generation text analytics backend for www.textmap.com. We also have an NFS partition that is mounted on all machines of our 100-node cluster. I found it much more convenient to store manually created files (e.g. configuration) on the NFS partition and just use them from my mappers and reducers rather than copying them to HDFS every time I change them, which is necessary when using DistributedCache. Is there a way to do the same for jars? Specifically, I just need a way to alter the child JVM's classpath via JobConf, without having the framework copy anything in and out of HDFS, because all my files are already accessible from all nodes. I see how to do that by adding a couple of lines to TaskRunner's run() method, e.g.: classPath.append(sep); classPath.append(conf.get("mapred.additional.classpath")); or something similar. Is there already such a feature or should I just go ahead and implement it? Thanks, Mikhail Bautin ------=_Part_6451_1049215.1208556169174--