Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 31267 invoked from network); 12 Apr 2011 11:13:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 12 Apr 2011 11:13:27 -0000 Received: (qmail 3189 invoked by uid 500); 12 Apr 2011 11:13:20 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 2903 invoked by uid 500); 12 Apr 2011 11:13:20 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 2882 invoked by uid 99); 12 Apr 2011 11:13:19 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Apr 2011 11:13:19 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of praveen.peddi@nokia.com designates 147.243.128.26 as permitted sender) Received: from [147.243.128.26] (HELO mgw-da02.nokia.com) (147.243.128.26) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Apr 2011 11:13:13 +0000 Received: from vaebh101.NOE.Nokia.com (vaebh101.europe.nokia.com [10.160.244.22]) by mgw-da02.nokia.com (Switch-3.4.3/Switch-3.4.3) with ESMTP id p3CBCgBW007099; Tue, 12 Apr 2011 14:12:52 +0300 Received: from smtp.mgd.nokia.com ([65.54.30.8]) by vaebh101.NOE.Nokia.com over TLS secured channel with Microsoft SMTPSVC(6.0.3790.4675); Tue, 12 Apr 2011 14:12:17 +0300 Received: from 008-AM1MMR1-003.mgdnok.nokia.com (65.54.30.58) by NOK-AM1MHUB-04.mgdnok.nokia.com (65.54.30.8) with Microsoft SMTP Server (TLS) id 8.2.255.0; Tue, 12 Apr 2011 13:12:15 +0200 Received: from 008-AM1MPN1-013.mgdnok.nokia.com ([169.254.3.100]) by 008-AM1MMR1-003.mgdnok.nokia.com ([65.54.30.58]) with mapi id 14.01.0270.002; Tue, 12 Apr 2011 13:12:07 +0200 From: To: CC: Subject: Re: "Retrying connect to server" error while configuring hadoop Thread-Topic: "Retrying connect to server" error while configuring hadoop Thread-Index: AQHL+NZ5fDx9qBGiGUWQlzu+l6sCxJRaEz2o Date: Tue, 12 Apr 2011 11:12:06 +0000 Message-ID: <080AADC7-D146-44AD-B4DA-9117204757F0@nokia.com> References: <31376269.post@talk.nabble.com> In-Reply-To: <31376269.post@talk.nabble.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OriginalArrivalTime: 12 Apr 2011 11:12:17.0073 (UTC) FILETIME=[7BB26210:01CBF902] X-Nokia-AV: Clean Did you check if all ports are open between all nodes across the cluster? P= orts need to be open not just between master and slaves but also between sl= aves for data nodes to talk to each other. Praveen On Apr 12, 2011, at 1:57 AM, "ext prasunb" w= rote: >=20 > Hello,=20 >=20 > I am trying to configure Hadoop in fully distributed mode on three virtua= l > Fedora machines. During configuring I am not getting any error. Even when= I > am executing the script "start-dfs.sh", there aren't any error.=20 >=20 > But practically the namenode isn't able to connect the datanodes. These a= re > the error snippents from the "hadoop-root-datanode-hadoop2.log" files of > both datanodes....=20 >=20 > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=20 > ************************************************************ > STARTUP_MSG: Starting DataNode > STARTUP_MSG: host =3D hadoop2/127.0.0.1 > STARTUP_MSG: args =3D [] > STARTUP_MSG: version =3D 0.20.2-CDH3B4 > STARTUP_MSG: build =3D -r 3aa7c91592ea1c53f3a913a581dbfcdfebe98bfe; > compiled by 'root' on Mon Feb 21 17:31:12 EST 2011 > ************************************************************/ > 2011-04-08 15:33:03,537 WARN org.apache.hadoop.util.NativeCodeLoader: Una= ble > to load native-hadoop library for your platform... using builtin-java > classes where applicable > 2011-04-08 15:33:03,549 INFO > org.apache.hadoop.security.UserGroupInformation: JAAS Configuration alrea= dy > set up for Hadoop, not re-installing. > 2011-04-08 15:33:03,691 ERROR > org.apache.hadoop.hdfs.server.datanode.DataNode: java.io.IOException: Cal= l > to hadoop1/192.168.161.198:8020 failed on local exception: > java.io.IOException: Connection reset by peer > at org.apache.hadoop.ipc.Client.wrapException(Client.java:1139) > at org.apache.hadoop.ipc.Client.call(Client.java:1107) > at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:226) > at $Proxy4.getProtocolVersion(Unknown Source) > at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:398) > at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:342) > at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:317) > at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:297) > at > org.apache.hadoop.hdfs.server.datanode.DataNode.startDataNode(DataNode.ja= va:338) > at > org.apache.hadoop.hdfs.server.datanode.DataNode.(DataNode.java:280) > at > org.apache.hadoop.hdfs.server.datanode.DataNode.makeInstance(DataNode.jav= a:1527) > at > org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataN= ode.java:1467) > at > org.apache.hadoop.hdfs.server.datanode.DataNode.createDataNode(DataNode.j= ava:1485) > at > org.apache.hadoop.hdfs.server.datanode.DataNode.secureMain(DataNode.java:= 1610) > at > org.apache.hadoop.hdfs.server.datanode.DataNode.main(DataNode.java:1620) > Caused by: java.io.IOException: Connection reset by peer > at sun.nio.ch.FileDispatcher.read0(Native Method) > at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:21) > at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:202) > at sun.nio.ch.IOUtil.read(IOUtil.java:175) > at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:243) > at > org.apache.hadoop.net.SocketInputStream$Reader.performIO(SocketInputStrea= m.java:55) > at > org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:1= 42) > at > org.apache.hadoop.net.SocketInputStream.read(SocketInputStream.java:155) > at > org.apache.hadoop.net.SocketInputStream.read(SocketInputStream.java:128) > at java.io.FilterInputStream.read(FilterInputStream.java:116) > at > org.apache.hadoop.ipc.Client$Connection$PingInputStream.read(Client.java:= 375) > at java.io.BufferedInputStream.fill(BufferedInputStream.java:218) > at java.io.BufferedInputStream.read(BufferedInputStream.java:237) > at java.io.DataInputStream.readInt(DataInputStream.java:370) > at > org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:812) > at org.apache.hadoop.ipc.Client$Connection.run(Client.java:720) >=20 > 2011-04-08 15:33:03,692 INFO > org.apache.hadoop.hdfs.server.datanode.DataNode: SHUTDOWN_MSG: > /************************************************************ > SHUTDOWN_MSG: Shutting down DataNode at hadoop2/127.0.0.1 > ************************************************************/ > 2011-04-08 15:47:46,416 INFO > org.apache.hadoop.hdfs.server.datanode.DataNode: STARTUP_MSG: > /************************************************************ > STARTUP_MSG: Starting DataNode > STARTUP_MSG: host =3D hadoop2/127.0.0.1 > STARTUP_MSG: args =3D [] > STARTUP_MSG: version =3D 0.20.2-CDH3B4 > STARTUP_MSG: build =3D -r 3aa7c91592ea1c53f3a913a581dbfcdfebe98bfe; > compiled by 'root' on Mon Feb 21 17:31:12 EST 2011 > ************************************************************/ > 2011-04-08 15:47:46,738 INFO > org.apache.hadoop.security.UserGroupInformation: JAAS Configuration alrea= dy > set up for Hadoop, not re-installing. > 2011-04-08 15:47:47,839 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 0 time(s). > 2011-04-08 15:47:48,849 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 1 time(s). > 2011-04-08 15:47:49,859 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 2 time(s). > 2011-04-08 15:47:50,869 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 3 time(s). > 2011-04-08 15:47:51,878 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 4 time(s). > 2011-04-08 15:47:52,889 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 5 time(s). > 2011-04-08 15:47:53,900 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 6 time(s). > 2011-04-08 15:47:54,908 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 7 time(s). > 2011-04-08 15:47:55,917 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 8 time(s). > 2011-04-08 15:47:56,926 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 9 time(s). > 2011-04-08 15:47:56,928 INFO org.apache.hadoop.ipc.RPC: Server at > hadoop1/192.168.161.198:8020 not available yet, Zzzzz... > 2011-04-08 15:47:58,944 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 0 time(s). > 2011-04-08 15:47:59,953 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 1 time(s). > 2011-04-08 15:48:00,961 INFO org.apache.hadoop.ipc.Client: Retrying conne= ct > to server: hadoop1/192.168.161.198:8020. Already tried 2 time(s). >=20 > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D >=20 > Can anyone please help me to understand the problem.=20 >=20 > Thanks in advance. >=20 > --=20 > View this message in context: http://old.nabble.com/%22Retrying-connect-t= o-server%22-error-while-configuring-hadoop-tp31376269p31376269.html > Sent from the Hadoop core-user mailing list archive at Nabble.com. >=20