Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 86766 invoked from network); 14 May 2010 15:54:09 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 14 May 2010 15:54:09 -0000 Received: (qmail 65352 invoked by uid 500); 14 May 2010 15:54:07 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 65181 invoked by uid 500); 14 May 2010 15:54:07 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 65173 invoked by uid 99); 14 May 2010 15:54:07 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 May 2010 15:54:07 +0000 X-ASF-Spam-Status: No, hits=-0.1 required=10.0 tests=AWL,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of andrew-lists-hadoop@ucsfcti.org designates 173.203.210.90 as permitted sender) Received: from [173.203.210.90] (HELO mail.naconsulting.net) (173.203.210.90) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 May 2010 15:54:00 +0000 Received: from mbp17.na-consulting.net (unknown [208.66.24.251]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: inbox.andrew@ucsfcti.org) by mail.naconsulting.net (Postfix) with ESMTPSA id 8880E750C4 for ; Fri, 14 May 2010 08:53:39 -0700 (PDT) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Apple Message framework v1077) Subject: Re: Setting up a second cluster and getting a weird issue From: Andrew Nguyen In-Reply-To: Date: Fri, 14 May 2010 08:53:35 -0700 Content-Transfer-Encoding: quoted-printable Message-Id: <0AAA5E6A-EB59-4C07-B76C-662480F1B268@ucsfcti.org> References: <9FC73A39-1F56-4622-B83C-5963DB98B11E@ucsfcti.org> To: common-user@hadoop.apache.org X-Mailer: Apple Mail (2.1077) Just to be clear, I'm only sharing the Hadoop binaries and config files = via NFS. I don't see how this would cause a conflict - do you have any = additional information? The referenced path in the error below (/srv/hadoop/dfs/1) is not being = shared via NFS... Thanks, Andrew On May 13, 2010, at 6:51 PM, Jeff Zhang wrote: > It is not suggested to deploy hadoop on NFS, there will be conflict > between data nodes, because NFS share the same namespace of file > system. >=20 >=20 >=20 > On Thu, May 13, 2010 at 9:52 PM, Andrew Nguyen = wrote: >>=20 >> Yes, in this deployment, I'm attempting to share the hadoop files via = NFS. The log and pid directories are local. >>=20 >> Thanks! >>=20 >> --Andrew >>=20 >> On May 12, 2010, at 7:40 PM, Jeff Zhang wrote: >>=20 >>> These 4 nodes share NFS =EF=BC=9F >>>=20 >>>=20 >>> On Thu, May 13, 2010 at 8:19 AM, Andrew Nguyen >>> wrote: >>>> I'm working on bringing up a second test cluster and am getting = these intermittent errors on the DataNodes: >>>>=20 >>>> 2010-05-12 17:17:15,094 ERROR = org.apache.hadoop.hdfs.server.datanode.DataNode: = java.io.FileNotFoundException: /srv/hadoop/dfs/1/current/VERSION (No = such file or directory) >>>> at java.io.RandomAccessFile.open(Native Method) >>>> at = java.io.RandomAccessFile.(RandomAccessFile.java:212) >>>> at = org.apache.hadoop.hdfs.server.common.Storage$StorageDirectory.write(Storag= e.java:249) >>>> at = org.apache.hadoop.hdfs.server.common.Storage$StorageDirectory.write(Storag= e.java:243) >>>> at = org.apache.hadoop.hdfs.server.common.Storage.writeAll(Storage.java:689) >>>> at = org.apache.hadoop.hdfs.server.datanode.DataNode.register(DataNode.java:560= ) >>>> at = org.apache.hadoop.hdfs.server.datanode.DataNode.runDatanodeDaemon(DataNode= .java:1230) >>>> at = org.apache.hadoop.hdfs.server.datanode.DataNode.createDataNode(DataNode.ja= va:1273) >>>> at = org.apache.hadoop.hdfs.server.datanode.DataNode.main(DataNode.java:1394) >>>>=20 >>>>=20 >>>> There are 4 slaves and sometimes 1 or 2 have the error but the = specific nodes change. Sometimes it's slave1, sometimes it's slave4, = etc. >>>>=20 >>>> Any thoughts? >>>>=20 >>>> Thanks! >>>>=20 >>>> --Andrew >>>=20 >>>=20 >>>=20 >>> -- >>> Best Regards >>>=20 >>> Jeff Zhang >>=20 >=20 >=20 >=20 > -- > Best Regards >=20 > Jeff Zhang