Return-Path: Delivered-To: apmail-lucene-hadoop-user-archive@locus.apache.org Received: (qmail 63098 invoked from network); 29 May 2007 18:53:01 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 29 May 2007 18:53:01 -0000 Received: (qmail 57970 invoked by uid 500); 29 May 2007 18:53:04 -0000 Delivered-To: apmail-lucene-hadoop-user-archive@lucene.apache.org Received: (qmail 57568 invoked by uid 500); 29 May 2007 18:53:04 -0000 Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-user@lucene.apache.org Delivered-To: mailing list hadoop-user@lucene.apache.org Received: (qmail 57559 invoked by uid 99); 29 May 2007 18:53:04 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 29 May 2007 11:53:04 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of ghostwhoowalks@gmail.com designates 209.85.134.187 as permitted sender) Received: from [209.85.134.187] (HELO mu-out-0910.google.com) (209.85.134.187) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 29 May 2007 11:52:58 -0700 Received: by mu-out-0910.google.com with SMTP id g7so2021466muf for ; Tue, 29 May 2007 11:52:37 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=mzZQm4GpbpIQonqWkaLbul9A+DolDdAif6NAmbYsTAFN+CGzcPyOrNsBsaJJANQSlmUKHKKGdFufdAzoLemG02x84EkY0rSGo5pK7OTm+w7XEu8gTmaVr2U24XIHY0M9Bx1HLs4QIvTQl9J/1DNcIkqeSOLvIOiLc3Vvq5jqlso= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=F6wJnFhTPYj2nRN7f6zhSVjEm7rr3wbstTxFCNa5yQKCLh7tlZzNUTIhYtdRPvymCJldAus9zz2QUkNcLEKFcTiCeQvcWZ7U9EQBq2qmDdh8ryJ2pQp1XsgYaCrDnSkKGgtqVIejjejJvm0AC+S3hbCMQmxMh9xFtukge1KtR8g= Received: by 10.82.126.5 with SMTP id y5mr13155055buc.1180464755688; Tue, 29 May 2007 11:52:35 -0700 (PDT) Received: by 10.82.158.3 with HTTP; Tue, 29 May 2007 11:52:35 -0700 (PDT) Message-ID: <84ad79bb0705291152t980f6dfy4c0677ef07cf6898@mail.gmail.com> Date: Tue, 29 May 2007 11:52:35 -0700 From: Phantom To: hadoop-user@lucene.apache.org Subject: Re: Configuration and Hadoop cluster setup In-Reply-To: <84ad79bb0705291101tf8761b2v6e80b6e3805a4340@mail.gmail.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_134129_15178254.1180464755636" References: <001201c79e62$ebe27920$a8afe442@ds.corp.yahoo.com> <20070525072204.88BA710FB00E@herse.apache.org> <84ad79bb0705251337h7c7418e0xdb422a95a49c8e02@mail.gmail.com> <84ad79bb0705251419n51e15a81x4961c8f336510d80@mail.gmail.com> <465755C9.5060804@apache.org> <46576B67.5060601@yahoo-inc.com> <84ad79bb0705281313q73ed7511o9ab5334b3d2203dd@mail.gmail.com> <465C6929.8030503@apache.org> <84ad79bb0705291101tf8761b2v6e80b6e3805a4340@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_134129_15178254.1180464755636 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline Either I am totally confused or this configuration stuff is confusing the hell out of me. I am pretty sure it is the former. Please I am looking for advice here as to how I should do this. I have my fs.default.name set to hdfs://:. In my JobConf setup I set the set same value for my fs.default.name. Now I have two options and I would appreciate if some expert could tell me which option I should take and why ? (1) Set my fs.default.name set to hdfs://: and also specify it in the JobConf configuration. Copy my sample input file into HDFS using "bin/hadoop fd -put" from my local file system. I then need to specify this file to my WordCount sample as input. Should I specify this file with the hdfs:// directive ? (2) Set my fs.default.name set to file://: and also specify it in the JobConf configuration. Just specify the input path to the WordCount sample and everything should work if the path is available to all machines in the cluster ? Which way should I go ? Thanks Avinash On 5/29/07, Phantom wrote: > > Yes it is. > > Thanks > A > > > On 5/29/07, Doug Cutting wrote: > > > > Phantom wrote: > > > Is there a workaround ? I want to run the WordCount sample against a > > > file on > > > my local filesystem. If this is not possible do I need to put my file > > into > > > HDFS and then point that location to my program ? > > > > Is your local filesystem accessible to all nodes in your system? > > > > Doug > > > > ------=_Part_134129_15178254.1180464755636--