Return-Path: X-Original-To: apmail-giraph-user-archive@www.apache.org Delivered-To: apmail-giraph-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2E08D11B6B for ; Fri, 16 May 2014 19:25:11 +0000 (UTC) Received: (qmail 92801 invoked by uid 500); 16 May 2014 11:49:55 -0000 Delivered-To: apmail-giraph-user-archive@giraph.apache.org Received: (qmail 85847 invoked by uid 500); 16 May 2014 11:38:49 -0000 Mailing-List: contact user-help@giraph.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@giraph.apache.org Delivered-To: mailing list user@giraph.apache.org Received: (qmail 50578 invoked by uid 99); 16 May 2014 11:15:23 -0000 Received: from Unknown (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 May 2014 11:15:23 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of togarun@gmail.com designates 209.85.128.173 as permitted sender) Received: from [209.85.128.173] (HELO mail-ve0-f173.google.com) (209.85.128.173) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 May 2014 08:09:38 +0000 Received: by mail-ve0-f173.google.com with SMTP id pa12so823976veb.4 for ; Thu, 15 May 2014 01:09:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=8IQi690zH5d1hNCWVE4aDb0wJdxBdjpnmBNYsXZnndU=; b=J85x5+d63UjUbhpNc6ulHYP6cxDIEYeWtJuNNGGXjXxH4E3Sy1eY2A/vD58i2/e9nW PFNWxafr9b8MjWxOCd3aqMKEuXTktrUiVPyhGJBLIbcaWli1or5wy5ZHemrlm3RTGMnj WoiS/56dan8aMny4gKJdWbco5pA+WFL2dlDdRpJPcQrdW6UHzO5bwc7YS5mqSYa25gy/ 7FIORO0gXXXHm9RLRRWrY1bv8Qlo17J1d5tTgBYRHbUIsgxOuqY+2Qgm78Jb38GmRBva Vd2ZdpGykVnq/i+YfcYlSnT5oJXRCllsRTnxs4QEbIhU57qu8WvtCymt9JiFjegtwc+0 JOiQ== MIME-Version: 1.0 X-Received: by 10.58.186.207 with SMTP id fm15mr7320379vec.4.1400141354453; Thu, 15 May 2014 01:09:14 -0700 (PDT) Received: by 10.58.226.10 with HTTP; Thu, 15 May 2014 01:09:14 -0700 (PDT) In-Reply-To: <5373C06F.5020400@apache.org> References: <5373C06F.5020400@apache.org> Date: Thu, 15 May 2014 13:39:14 +0530 Message-ID: Subject: Re: Error while executing large graph From: Arun Kumar To: user@giraph.apache.org Content-Type: multipart/alternative; boundary=047d7b67594c1150cf04f96bd07e X-Virus-Checked: Checked by ClamAV on apache.org --047d7b67594c1150cf04f96bd07e Content-Type: text/plain; charset=UTF-8 Hi Thanks for the replay . I am running this example in a cluster of 5 machines each machine is having 16 GB of ram.The java heap size is set as 2000mb and java.child.options is set with 2000mb and each machine has 4 cores and total number of map instance is set as 3. So for each slave machine 10 gb will be used. My input data is of 1gb size In this scenario how can out of memory error occur .Please clarify Regards Arun On Thu, May 15, 2014 at 12:43 AM, Avery Ching wrote: > I think this is the key message. > > > 0 out of 196 partitions computed; min free memory on worker 6 - 0.81MB, > average 11.56MB > > Having less than 1 MB free won't work. Your workers are likely OOM, > killing the job. Can you get more memory for your job? > > > On 5/14/14, 3:13 AM, Arun Kumar wrote: > > Hi when i run giraph job against a data of 1 gb i am getting the below > exception after some times can somebody tell me what is the issue? > 14/05/14 01:54:01 INFO job.JobProgressTracker: Data from 14 workers - > Compute superstep 2: 0 out of 4847571 vertices computed; 0 out of 196 > partitions computed; min free memory on worker 6 - 0.81MB, average 11.56MB > 14/05/14 01:54:03 INFO zookeeper.ClientCnxn: Unable to read additional > data from server sessionid 0x145f9cff031000f, likely server has closed > socket, closing socket connection and attempting reconnect > 14/05/14 01:54:04 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:04 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:06 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:06 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:06 WARN zk.ZooKeeperExt: exists: Connection loss on attempt > 0, waiting 5000 msecs before retrying. > org.apache.zookeeper.KeeperException$ConnectionLossException: > KeeperErrorCode = ConnectionLoss for > /_hadoopBsp/job_201405140108_0003/_workerProgresses > at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) > at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1041) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1069) > at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:360) > at > org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:87) > at java.lang.Thread.run(Thread.java:745) > 14/05/14 01:54:08 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:08 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:09 INFO mapred.JobClient: map 93% reduce 0% > 14/05/14 01:54:10 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:10 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:12 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:12 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:12 WARN zk.ZooKeeperExt: exists: Connection loss on attempt > 1, waiting 5000 msecs before retrying. > org.apache.zookeeper.KeeperException$ConnectionLossException: > KeeperErrorCode = ConnectionLoss for > /_hadoopBsp/job_201405140108_0003/_workerProgresses > at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) > at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1041) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1069) > at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:360) > at > org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:87) > at java.lang.Thread.run(Thread.java:745) > 14/05/14 01:54:13 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:13 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:15 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:15 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:16 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:16 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:18 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:18 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:18 WARN zk.ZooKeeperExt: exists: Connection loss on attempt > 2, waiting 5000 msecs before retrying. > org.apache.zookeeper.KeeperException$ConnectionLossException: > KeeperErrorCode = ConnectionLoss for > /_hadoopBsp/job_201405140108_0003/_workerProgresses > at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) > at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1041) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1069) > at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:360) > at > org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:87) > at java.lang.Thread.run(Thread.java:745) > 14/05/14 01:54:20 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:20 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:21 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:21 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:22 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:22 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:23 INFO job.JobProgressTracker: run: Exception occurred > java.lang.IllegalStateException: exists: Failed to check > /_hadoopBsp/job_201405140108_0003/_workerProgresses after 3 tries! > at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:369) > at > org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:87) > at java.lang.Thread.run(Thread.java:745) > 14/05/14 01:54:24 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:24 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:24 WARN zk.ZooKeeperExt: createExt: Connection loss on > attempt 0, waiting 5000 msecs before retrying. > org.apache.zookeeper.KeeperException$ConnectionLossException: > KeeperErrorCode = ConnectionLoss for > /_hadoopBsp/job_201405140108_0003/_cleanedUpDir/client > at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) > at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) > at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:783) > at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152) > at > org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:123) > at java.lang.Thread.run(Thread.java:745) > 14/05/14 01:54:25 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:25 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:27 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:27 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:29 INFO mapred.JobClient: map 86% reduce 0% > 14/05/14 01:54:30 INFO zookeeper.ClientCnxn: Opening socket connection to > server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to > authenticate using SASL (unknown error) > 14/05/14 01:54:30 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for > server null, unexpected error, closing socket connection and attempting > reconnect > java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739) > at > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > 14/05/14 01:54:30 WARN zk.ZooKeeperExt: createExt: Connection loss on > attempt 1, waiting 5000 msecs before retrying. > org.apache.zookeeper.KeeperException$ConnectionLossException: > KeeperErrorCode = ConnectionLoss for > /_hadoopBsp/job_201405140108_0003/_cleanedUpDir/client > at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) > at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) > at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:783) > at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152) > at > org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:123) > at java.lang.Thread.run(Thread.java:745) > 14/05/14 01:54:30 INFO mapred.JobClient: Job complete: > job_201405140108_0003 > 14/05/14 01:54:30 INFO mapred.JobClient: Counters: 6 > 14/05/14 01:54:30 INFO mapred.JobClient: Job Counters > 14/05/14 01:54:30 INFO mapred.JobClient: SLOTS_MILLIS_MAPS=30036780 > 14/05/14 01:54:30 INFO mapred.JobClient: Total time spent by all > reduces waiting after reserving slots (ms)=0 > 14/05/14 01:54:30 INFO mapred.JobClient: Total time spent by all maps > waiting after reserving slots (ms)=0 > 14/05/14 01:54:30 INFO mapred.JobClient: Launched map tasks=15 > 14/05/14 01:54:30 INFO mapred.JobClient: SLOTS_MILLIS_REDUCES=0 > 14/05/14 01:54:30 INFO mapred.JobClient: Failed map tasks=1 > > Regards > Arun > > > --047d7b67594c1150cf04f96bd07e Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi
Thanks for the replay .

=
I am running this example in a cluster of 5 machines each machine is = having 16 GB of ram.The java=C2=A0 heap size is set as 2000mb and java.chil= d.options is set with 2000mb and each machine has 4 cores and total number = of map instance is set as 3.
So for each slave machine 10 gb will be used.

= My input data is of 1gb size
In this scenario how can out of memory err= or occur=C2=A0 .Please clarify

Regards
Arun=




On Thu, May 15, 2014 at 12:43 AM, Avery Ching <aching@= apache.org> wrote:
=20 =20 =20
I think this is the key message.


0 out of 196 partitions computed; min free memory on worker 6 - 0.81MB, average 11.56MB

Having less than 1 MB free won't work.=C2=A0 Your workers are lik= ely OOM, killing the job.=C2=A0 Can you get more memory for your job?


On 5/14/14, 3:13 AM, Arun Kumar wrote:
Hi when i run giraph job against a data of 1 gb i am getting the below exception after some times can somebody tell me what is the issue?
14/05/14 01:54:01 INFO job.JobProgressTracker: Data from 14 workers - Compute superstep 2: 0 out of 4847571 vertices computed; 0 out of 196 partitions computed; min free memory on worker 6 - 0.81MB, average 11.56MB
14/05/14 01:54:03 INFO zookeeper.ClientCnxn: Unable to read additional data from server sessionid 0x145f9cff031000f, likely server has closed socket, closing socket connection and attempting reconnect
14/05/14 01:54:04 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:04 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:06 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:06 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:06 WARN zk.ZooKeeperExt: exists: Connection loss on attempt 0, waiting 5000 msecs before retrying.
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode =3D ConnectionLoss for /_hadoopBsp/job_201405140108_0003/_workerProgresses
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.KeeperException.create(KeeperException.jav= a:99)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.KeeperException.create(KeeperException.jav= a:51)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1041)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1069)
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:360)=
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:87)<= br> =C2=A0=C2=A0=C2=A0 at java.lang.Thread.run(Thread.java:745)
14/05/14 01:54:08 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:08 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:09 INFO mapred.JobClient:=C2=A0 map 93% reduce 0= %
14/05/14 01:54:10 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:10 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:12 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:12 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:12 WARN zk.ZooKeeperExt: exists: Connection loss on attempt 1, waiting 5000 msecs before retrying.
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode =3D ConnectionLoss for /_hadoopBsp/job_201405140108_0003/_workerProgresses
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.KeeperException.create(KeeperException.jav= a:99)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.KeeperException.create(KeeperException.jav= a:51)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1041)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1069)
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:360)=
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:87)<= br> =C2=A0=C2=A0=C2=A0 at java.lang.Thread.run(Thread.java:745)
14/05/14 01:54:13 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:13 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:15 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:15 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:16 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:16 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:18 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:18 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:18 WARN zk.ZooKeeperExt: exists: Connection loss on attempt 2, waiting 5000 msecs before retrying.
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode =3D ConnectionLoss for /_hadoopBsp/job_201405140108_0003/_workerProgresses
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.KeeperException.create(KeeperException.jav= a:99)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.KeeperException.create(KeeperException.jav= a:51)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1041)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1069)
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:360)=
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:87)<= br> =C2=A0=C2=A0=C2=A0 at java.lang.Thread.run(Thread.java:745)
14/05/14 01:54:20 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:20 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:21 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:21 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:22 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:22 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:23 INFO job.JobProgressTracker: run: Exception occurred
java.lang.IllegalStateException: exists: Failed to check /_hadoopBsp/job_201405140108_0003/_workerProgresses after 3 tries!
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:369)=
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:87)<= br> =C2=A0=C2=A0=C2=A0 at java.lang.Thread.run(Thread.java:745)
14/05/14 01:54:24 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:24 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:24 WARN zk.ZooKeeperExt: createExt: Connection loss on attempt 0, waiting 5000 msecs before retrying.
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode =3D ConnectionLoss for /_hadoopBsp/job_201405140108_0003/_cleanedUpDir/client
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.KeeperException.create(KeeperException.jav= a:99)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.KeeperException.create(KeeperException.jav= a:51)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:783)
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:1= 52)
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:123)=
=C2=A0=C2=A0=C2=A0 at java.lang.Thread.run(Thread.java:745)
14/05/14 01:54:25 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:25 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:27 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:27 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:29 INFO mapred.JobClient:=C2=A0 map 86% reduce 0= %
14/05/14 01:54:30 INFO zookeeper.ClientCnxn: Opening socket connection to server mercado-12.hpl.hp.com/15.25.119.147:22= 181. Will not attempt to authenticate using SASL (unknown error)
14/05/14 01:54:30 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for server null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.checkConnect= (Native Method)
=C2=A0=C2=A0=C2=A0 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.ja= va:739)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.ja= va:350)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:= 1068)
14/05/14 01:54:30 WARN zk.ZooKeeperExt: createExt: Connection loss on attempt 1, waiting 5000 msecs before retrying.
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode =3D ConnectionLoss for /_hadoopBsp/job_201405140108_0003/_cleanedUpDir/client
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.KeeperException.create(KeeperException.jav= a:99)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.KeeperException.create(KeeperException.jav= a:51)
=C2=A0=C2=A0=C2=A0 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:783)
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:1= 52)
=C2=A0=C2=A0=C2=A0 at org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:123)=
=C2=A0=C2=A0=C2=A0 at java.lang.Thread.run(Thread.java:745)
14/05/14 01:54:30 INFO mapred.JobClient: Job complete: job_201405140108_0003
14/05/14 01:54:30 INFO mapred.JobClient: Counters: 6
14/05/14 01:54:30 INFO mapred.JobClient:=C2=A0=C2=A0 Job Counte= rs
14/05/14 01:54:30 INFO mapred.JobClient:=C2=A0=C2=A0=C2=A0=C2= =A0 SLOTS_MILLIS_MAPS=3D30036780
14/05/14 01:54:30 INFO mapred.JobClient:=C2=A0=C2=A0=C2=A0=C2= =A0 Total time spent by all reduces waiting after reserving slots (ms)=3D0
14/05/14 01:54:30 INFO mapred.JobClient:=C2=A0=C2=A0=C2=A0=C2= =A0 Total time spent by all maps waiting after reserving slots (ms)=3D0
14/05/14 01:54:30 INFO mapred.JobClient:=C2=A0=C2=A0=C2=A0=C2= =A0 Launched map tasks=3D15
14/05/14 01:54:30 INFO mapred.JobClient:=C2=A0=C2=A0=C2=A0=C2= =A0 SLOTS_MILLIS_REDUCES=3D0
14/05/14 01:54:30 INFO mapred.JobClient:=C2=A0=C2=A0=C2=A0=C2= =A0 Failed map tasks=3D1

Regards
Arun



--047d7b67594c1150cf04f96bd07e--