Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 26271DDB2 for ; Wed, 8 Aug 2012 21:40:04 +0000 (UTC) Received: (qmail 32057 invoked by uid 500); 8 Aug 2012 21:39:59 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 31920 invoked by uid 500); 8 Aug 2012 21:39:59 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 31911 invoked by uid 99); 8 Aug 2012 21:39:59 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Aug 2012 21:39:59 +0000 X-ASF-Spam-Status: No, hits=1.8 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FSL_RCVD_USER,HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of arjunreddy0768@gmail.com designates 209.85.216.176 as permitted sender) Received: from [209.85.216.176] (HELO mail-qc0-f176.google.com) (209.85.216.176) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Aug 2012 21:39:51 +0000 Received: by qcsc21 with SMTP id c21so993096qcs.35 for ; Wed, 08 Aug 2012 14:39:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=yY4n6F/LaqqPBZWiVjHGCor6JGrCUBXs5OyYkT5Hw8Y=; b=hekK3B1/vKzXXpLT/PUNpzuyiLOATnuKU8yebFGOlY1zdQt3NSrBXbV2rWoKZ0x1FN P28y0Egq4PuRT68iSZm/ayiXGmfeBIgPDZrp90FUutq/Vend5DKeqBylMNq+JCkaep8c Hg7Ikg2qWRtcidycbgvrE2lGAFs04iVUVv5bbu0voBO8hqCRdSvR/w/6I7l9I1hjTvR+ 0O3rv6jp7fKDq9DAGp01qSQgn0KzOwEwDQn8npnHMMIMxZxtVgreTBeUpJAlg4ZlN9Ov gvDRJkKrWq/o23Ksw4eiv1xREujBcbzCPtv98Ghydc76FK8IxRM4cmTbGpRNHIMIYIQV OggQ== MIME-Version: 1.0 Received: by 10.224.105.205 with SMTP id u13mr32671825qao.54.1344461970192; Wed, 08 Aug 2012 14:39:30 -0700 (PDT) Received: by 10.49.38.37 with HTTP; Wed, 8 Aug 2012 14:39:30 -0700 (PDT) Date: Wed, 8 Aug 2012 16:39:30 -0500 Message-ID: Subject: Problem running PI example in Hadoop 2.0.0 From: Arjun Reddy To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=20cf3071d13625b97204c6c7f26e --20cf3071d13625b97204c6c7f26e Content-Type: text/plain; charset=ISO-8859-1 I am trying to setup a small cluster using hadoop 2.0.0 and using PI example to validate the setup. When I have 1 master and 1 slave the example works fine. I am getting exceptions with the PI example when additional slave nodes are added to the cluster. The syslogs for failed tasks are as follows. Any ideas why this is happening. 2012-08-08 15:41:19,914 WARN [main] org.apache.hadoop.conf.Configuration: job.xml:an attempt to override final parameter: mapreduce.job.end-notification.max.retry.interval; Ignoring. 2012-08-08 15:41:19,915 WARN [main] org.apache.hadoop.conf.Configuration: job.xml:an attempt to override final parameter: mapreduce.job.end-notification.max.attempts; Ignoring. 2012-08-08 15:41:19,973 WARN [main] org.apache.hadoop.security.authentication.util.KerberosName: Kerberos krb5 configuration not found, setting default realm to empty 2012-08-08 15:41:20,142 INFO [main] org.apache.hadoop.metrics2.impl.MetricsConfig: loaded properties from hadoop-metrics2.properties 2012-08-08 15:41:20,221 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Scheduled snapshot period at 10 second(s). 2012-08-08 15:41:20,221 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: MapTask metrics system started 2012-08-08 15:41:20,377 WARN [main] org.apache.hadoop.conf.Configuration: job.xml:an attempt to override final parameter: dfs.namenode.name.dir; Ignoring. 2012-08-08 15:41:20,378 WARN [main] org.apache.hadoop.conf.Configuration: job.xml:an attempt to override final parameter: mapreduce.job.end-notification.max.retry.interval; Ignoring. 2012-08-08 15:41:20,378 WARN [main] org.apache.hadoop.conf.Configuration: job.xml:an attempt to override final parameter: dfs.datanode.data.dir; Ignoring. 2012-08-08 15:41:20,378 WARN [main] org.apache.hadoop.conf.Configuration: job.xml:an attempt to override final parameter: mapreduce.job.end-notification.max.attempts; Ignoring. 2012-08-08 15:41:20,435 INFO [main] org.apache.hadoop.mapred.YarnChild: Sleeping for 0ms before retrying again. Got null now. 2012-08-08 15:41:21,483 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 0 time(s). 2012-08-08 15:41:22,484 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 1 time(s). 2012-08-08 15:41:23,484 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 2 time(s). 2012-08-08 15:41:24,485 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 3 time(s). 2012-08-08 15:41:25,486 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 4 time(s). 2012-08-08 15:41:26,486 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 5 time(s). 2012-08-08 15:41:27,487 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 6 time(s). 2012-08-08 15:41:28,488 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 7 time(s). 2012-08-08 15:41:29,488 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 8 time(s). 2012-08-08 15:41:30,489 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 9 time(s). 2012-08-08 15:41:30,492 WARN [main] org.apache.hadoop.mapred.YarnChild: Exception running child : java.net.ConnectException: Call From node2/ 127.0.1.1 to node2:45965 failed on connection exception: java.net.ConnectException: Connection refused; For more details see: http://wiki.apache.org/hadoop/ConnectionRefused at org.apache.hadoop.net.NetUtils.wrapException(NetUtils.java:727) at org.apache.hadoop.ipc.Client.call(Client.java:1165) at org.apache.hadoop.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:224) at $Proxy6.getTask(Unknown Source) at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:123) Caused by: java.net.ConnectException: Connection refused at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:701) at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206) at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:524) at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489) at org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:472) at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:566) at org.apache.hadoop.ipc.Client$Connection.access$2000(Client.java:215) at org.apache.hadoop.ipc.Client.getConnection(Client.java:1271) at org.apache.hadoop.ipc.Client.call(Client.java:1141) ... 3 more 2012-08-08 15:41:30,493 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Stopping MapTask metrics system... 2012-08-08 15:41:30,494 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: MapTask metrics system stopped. 2012-08-08 15:41:30,494 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: MapTask metrics system shutdown complete. --20cf3071d13625b97204c6c7f26e Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable

I am trying to setup a small cluster using hadoop 2.0.0 and using PI exa= mple to validate the setup.=A0 When I have 1 master and 1 slave the example= works fine.=A0 I am getting exceptions with the PI example when additional= slave nodes are added to the cluster.=A0 The syslogs for failed=A0 tasks a= re as follows.=A0 Any ideas why this is happening.

2012-08-08 15:41:19,914 WARN [main] org.apache.hadoop.conf.Configuration= : job.xml:an attempt to override final parameter: mapreduce.job.end-notific= ation.max.retry.interval;=A0 Ignoring.
2012-08-08 15:41:19,915 WARN [mai= n] org.apache.hadoop.conf.Configuration: job.xml:an attempt to override fin= al parameter: mapreduce.job.end-notification.max.attempts;=A0 Ignoring.
2012-08-08 15:41:19,973 WARN [main] org.apache.hadoop.security.authenticati= on.util.KerberosName: Kerberos krb5 configuration not found, setting defaul= t realm to empty
2012-08-08 15:41:20,142 INFO [main] org.apache.hadoop.m= etrics2.impl.MetricsConfig: loaded properties from hadoop-metrics2.properti= es
2012-08-08 15:41:20,221 INFO [main] org.apache.hadoop.metrics2.impl.Metrics= SystemImpl: Scheduled snapshot period at 10 second(s).
2012-08-08 15:41:= 20,221 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: MapTa= sk metrics system started
2012-08-08 15:41:20,377 WARN [main] org.apache.hadoop.conf.Configuration: j= ob.xml:an attempt to override final parameter: dfs.namenode.name.dir;=A0 Ig= noring.
2012-08-08 15:41:20,378 WARN [main] org.apache.hadoop.conf.Confi= guration: job.xml:an attempt to override final parameter: mapreduce.job.end= -notification.max.retry.interval;=A0 Ignoring.
2012-08-08 15:41:20,378 WARN [main] org.apache.hadoop.conf.Configuration: j= ob.xml:an attempt to override final parameter: dfs.datanode.data.dir;=A0 Ig= noring.
2012-08-08 15:41:20,378 WARN [main] org.apache.hadoop.conf.Confi= guration: job.xml:an attempt to override final parameter: mapreduce.job.end= -notification.max.attempts;=A0 Ignoring.
2012-08-08 15:41:20,435 INFO [main] org.apache.hadoop.mapred.YarnChild: Sle= eping for 0ms before retrying again. Got null now.
2012-08-08 15:41:21,4= 83 INFO [main] org.apache.hadoop.ipc.Client: Retrying connect to server: no= de2/127.0.1.1:45965. Already tried 0= time(s).
2012-08-08 15:41:22,484 INFO [main] org.apache.hadoop.ipc.Client: Retrying = connect to server: node2/127.0.1.1:45965= . Already tried 1 time(s).
2012-08-08 15:41:23,484 INFO [main] org.a= pache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 2 time(s).
2012-08-08 15:41:24,485 INFO [main] org.apache.hadoop.ipc.Client: Retrying = connect to server: node2/127.0.1.1:45965= . Already tried 3 time(s).
2012-08-08 15:41:25,486 INFO [main] org.a= pache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 4 time(s).
2012-08-08 15:41:26,486 INFO [main] org.apache.hadoop.ipc.Client: Retrying = connect to server: node2/127.0.1.1:45965= . Already tried 5 time(s).
2012-08-08 15:41:27,487 INFO [main] org.a= pache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 6 time(s).
2012-08-08 15:41:28,488 INFO [main] org.apache.hadoop.ipc.Client: Retrying = connect to server: node2/127.0.1.1:45965= . Already tried 7 time(s).
2012-08-08 15:41:29,488 INFO [main] org.a= pache.hadoop.ipc.Client: Retrying connect to server: node2/127.0.1.1:45965. Already tried 8 time(s).
2012-08-08 15:41:30,489 INFO [main] org.apache.hadoop.ipc.Client: Retrying = connect to server: node2/127.0.1.1:45965= . Already tried 9 time(s).
2012-08-08 15:41:30,492 WARN [main] org.a= pache.hadoop.mapred.YarnChild: Exception running child : java.net.ConnectEx= ception: Call From node2/127.0.1.1 to node= 2:45965 failed on connection exception: java.net.ConnectException: Connecti= on refused; For more details see:=A0 http://wiki.apache.org/hadoop/ConnectionRefused =A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.net.NetUtils.wrapException(NetUt= ils.java:727)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.ipc.Client.call= (Client.java:1165)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.ipc.Writab= leRpcEngine$Invoker.invoke(WritableRpcEngine.java:224)
=A0=A0=A0=A0=A0=A0=A0 at $Proxy6.getTask(Unknown Source)
=A0=A0=A0=A0=A0= =A0=A0 at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:123)
Ca= used by: java.net.ConnectException: Connection refused
=A0=A0=A0=A0=A0= =A0=A0 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
=A0=A0=A0=A0=A0=A0=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketC= hannelImpl.java:701)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.net.Sock= etIOWithTimeout.connect(SocketIOWithTimeout.java:206)
=A0=A0=A0=A0=A0=A0= =A0 at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:524)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.net.NetUtils.connect(NetUtils.ja= va:489)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.ipc.Client$Connection= .setupConnection(Client.java:472)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.ha= doop.ipc.Client$Connection.setupIOstreams(Client.java:566)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.ipc.Client$Connection.access$200= 0(Client.java:215)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.ipc.Client= .getConnection(Client.java:1271)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.had= oop.ipc.Client.call(Client.java:1141)
=A0=A0=A0=A0=A0=A0=A0 ... 3 more

2012-08-08 15:41:30,493 INFO [main] org.apache.hadoop.metrics2.impl.Me= tricsSystemImpl: Stopping MapTask metrics system...
2012-08-08 15:41:30,= 494 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: MapTask = metrics system stopped.
2012-08-08 15:41:30,494 INFO [main] org.apache.hadoop.metrics2.impl.Metrics= SystemImpl: MapTask metrics system shutdown complete.
=A0
--20cf3071d13625b97204c6c7f26e--