Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 64DE4926D for ; Fri, 21 Oct 2011 08:53:21 +0000 (UTC) Received: (qmail 53772 invoked by uid 500); 21 Oct 2011 08:53:18 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 53712 invoked by uid 500); 21 Oct 2011 08:53:17 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 53704 invoked by uid 99); 21 Oct 2011 08:53:16 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 21 Oct 2011 08:53:16 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of wget.null@googlemail.com designates 209.85.210.50 as permitted sender) Received: from [209.85.210.50] (HELO mail-pz0-f50.google.com) (209.85.210.50) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 21 Oct 2011 08:53:12 +0000 Received: by pzk34 with SMTP id 34so9455508pzk.9 for ; Fri, 21 Oct 2011 01:52:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=ZQuX4nbpL6/B7MQNekvFOKBy5xGEbQBdata2ewaCQ9w=; b=q2795dLgBehkHeEdcwHD6/vu0Q4FwWudH1hBVb/qI452eCqjsNG3UdSfqKcMF67AWz 5YCQsHhmGulpQXAFkvUGkGF25/tC006YjlkqD8szpsyXjEtHsj9lACY8w+06Gh4/8kSI SkbbyZ77dARAU7eDclJOgojI76o98hdBoyK5U= MIME-Version: 1.0 Received: by 10.68.36.103 with SMTP id p7mr26374145pbj.74.1319187171679; Fri, 21 Oct 2011 01:52:51 -0700 (PDT) Received: by 10.142.174.18 with HTTP; Fri, 21 Oct 2011 01:52:51 -0700 (PDT) In-Reply-To: <08C814576B3D664DBE68BE682A57AE990F07D21B@USA0300MS03.na.xerox.net> References: <08C814576B3D664DBE68BE682A57AE990F07D216@USA0300MS03.na.xerox.net> <08C814576B3D664DBE68BE682A57AE990F07D217@USA0300MS03.na.xerox.net> <08C814576B3D664DBE68BE682A57AE990F07D21B@USA0300MS03.na.xerox.net> Date: Fri, 21 Oct 2011 10:52:51 +0200 Message-ID: Subject: Re: jobtracker cannot be started From: "Alexander C.H. Lorenz" To: common-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=bcaec51dd5cbc29ee504afcb3218 --bcaec51dd5cbc29ee504afcb3218 Content-Type: text/plain; charset=ISO-8859-1 looks like that the Heap utilization has exceeded the value set by -Xmx. Do a ps waux|grep java @jobtracker |grep -i xmx The heapsize will be set in hadoop-env.sh: export HADOOP_HEAPSIZE= default 1000, I think. - alex On Fri, Oct 21, 2011 at 10:31 AM, Peng, Wei wrote: > Thank you for your quick reply!! > > I cannot change the hadoop conf files because they are owned by a person > who has left the company, though I have the root access. My Java version > is java version "1.5.0_07" > Java(TM) 2 Runtime Environment, Standard Edition (build 1.5.0_07-b03) > Java HotSpot(TM) 64-Bit Server VM (build 1.5.0_07-b03, mixed mode) > > The log on http://jobtracker:50030/stacks is > Process Thread Dump: > 26 active threads > Thread 53 (1424598978@qtp0-5): > State: RUNNABLE > Blocked count: 0 > Waited count: 29 > Stack: > sun.management.ThreadImpl.getThreadInfo0(Native Method) > sun.management.ThreadImpl.getThreadInfo(ThreadImpl.java:147) > sun.management.ThreadImpl.getThreadInfo(ThreadImpl.java:123) > > org.apache.hadoop.util.ReflectionUtils.printThreadInfo(ReflectionUtils.j > ava:149) > > org.apache.hadoop.http.HttpServer$StackServlet.doGet(HttpServer.java:505 > ) > javax.servlet.http.HttpServlet.service(HttpServlet.java:707) > javax.servlet.http.HttpServlet.service(HttpServlet.java:820) > > org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:502) > > org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:363) > > org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:2 > 16) > > org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181) > > org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766) > > org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:417) > > org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandler > Collection.java:230) > > org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) > org.mortbay.jetty.Server.handle(Server.java:324) > > org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:534) > > org.mortbay.jetty.HttpConnection$RequestHandler.headerComplete(HttpConne > ction.java:864) > org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:533) > org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:207) > Thread 43 (Trash Emptier): > State: TIMED_WAITING > Blocked count: 0 > Waited count: 183 > Stack: > java.lang.Thread.sleep(Native Method) > org.apache.hadoop.fs.Trash$Emptier.run(Trash.java:234) > java.lang.Thread.run(Thread.java:619) > Thread 36 (IPC Server handler 9 on 9000): > State: WAITING > Blocked count: 32 > Waited count: 3444 > Waiting on > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@49 > 59d87f > Stack: > sun.misc.Unsafe.park(Native Method) > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.aw > ait(AbstractQueuedSynchronizer.java:1925) > > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:3 > 58) > org.apache.hadoop.ipc.Server$Handler.run(Server.java:939) > Thread 35 (IPC Server handler 8 on 9000): > State: WAITING > Blocked count: 29 > Waited count: 3446 > Waiting on > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@49 > 59d87f > Stack: > sun.misc.Unsafe.park(Native Method) > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.aw > ait(AbstractQueuedSynchronizer.java:1925) > > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:3 > 58) > org.apache.hadoop.ipc.Server$Handler.run(Server.java:939) > Thread 34 (IPC Server handler 7 on 9000): > State: WAITING > Blocked count: 30 > Waited count: 3451 > Waiting on > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@49 > 59d87f > Stack: > sun.misc.Unsafe.park(Native Method) > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.aw > ait(AbstractQueuedSynchronizer.java:1925) > > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:3 > 58) > org.apache.hadoop.ipc.Server$Handler.run(Server.java:939) > Thread 33 (IPC Server handler 6 on 9000): > State: WAITING > Blocked count: 33 > Waited count: 3445 > Waiting on > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@49 > 59d87f > Stack: > sun.misc.Unsafe.park(Native Method) > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.aw > ait(AbstractQueuedSynchronizer.java:1925) > > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:3 > 58) > org.apache.hadoop.ipc.Server$Handler.run(Server.java:939) > Thread 32 (IPC Server handler 5 on 9000): > State: WAITING > Blocked count: 29 > Waited count: 3448 > Waiting on > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@49 > 59d87f > Stack: > sun.misc.Unsafe.park(Native Method) > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.aw > ait(AbstractQueuedSynchronizer.java:1925) > > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:3 > 58) > org.apache.hadoop.ipc.Server$Handler.run(Server.java:939) > Thread 31 (IPC Server handler 4 on 9000): > State: WAITING > Blocked count: 33 > Waited count: 3446 > Waiting on > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@49 > 59d87f > Stack: > sun.misc.Unsafe.park(Native Method) > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.aw > ait(AbstractQueuedSynchronizer.java:1925) > > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:3 > 58) > org.apache.hadoop.ipc.Server$Handler.run(Server.java:939) > Thread 30 (IPC Server handler 3 on 9000): > State: WAITING > Blocked count: 29 > Waited count: 3450 > Waiting on > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@49 > 59d87f > Stack: > sun.misc.Unsafe.park(Native Method) > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.aw > ait(AbstractQueuedSynchronizer.java:1925) > > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:3 > 58) > org.apache.hadoop.ipc.Server$Handler.run(Server.java:939) > Thread 29 (IPC Server handler 2 on 9000): > State: WAITING > Blocked count: 29 > Waited count: 3449 > Waiting on > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@49 > 59d87f > Stack: > sun.misc.Unsafe.park(Native Method) > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.aw > ait(AbstractQueuedSynchronizer.java:1925) > > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:3 > 58) > org.apache.hadoop.ipc.Server$Handler.run(Server.java:939) > Thread 28 (IPC Server handler 1 on 9000): > State: WAITING > Blocked count: 36 > Waited count: 3445 > Waiting on > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@49 > 59d87f > Stack: > sun.misc.Unsafe.park(Native Method) > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.aw > ait(AbstractQueuedSynchronizer.java:1925) > > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:3 > 58) > org.apache.hadoop.ipc.Server$Handler.run(Server.java:939) > Thread 27 (IPC Server handler 0 on 9000): > State: WAITING > Blocked count: 44 > Waited count: 3445 > Waiting on > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@49 > 59d87f > Stack: > sun.misc.Unsafe.park(Native Method) > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.aw > ait(AbstractQueuedSynchronizer.java:1925) > > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:3 > 58) > org.apache.hadoop.ipc.Server$Handler.run(Server.java:939) > Thread 12 (IPC Server listener on 9000): > State: RUNNABLE > Blocked count: 5 > Waited count: 0 > Stack: > sun.nio.ch.EPollArrayWrapper.epollWait(Native Method) > sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:184) > sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65) > sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69) > sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80) > sun.nio.ch.SelectorImpl.select(SelectorImpl.java:84) > org.apache.hadoop.ipc.Server$Listener.run(Server.java:318) > Thread 13 (IPC Server Responder): > State: RUNNABLE > Blocked count: 1 > Waited count: 0 > Stack: > sun.nio.ch.EPollArrayWrapper.epollWait(Native Method) > sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:184) > sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65) > sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69) > sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80) > org.apache.hadoop.ipc.Server$Responder.run(Server.java:478) > Thread 26 (Timer-0): > State: TIMED_WAITING > Blocked count: 0 > Waited count: 190 > Stack: > java.lang.Object.wait(Native Method) > java.util.TimerThread.mainLoop(Timer.java:509) > java.util.TimerThread.run(Timer.java:462) > Thread 25 (1947740113@qtp0-1 - Acceptor0 > SelectChannelConnector@penguin33:50070): > State: RUNNABLE > Blocked count: 579 > Waited count: 5 > Stack: > sun.nio.ch.EPollArrayWrapper.epollWait(Native Method) > sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:184) > sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65) > sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69) > sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80) > > org.mortbay.io.nio.SelectorManager$SelectSet.doSelect(SelectorManager.ja > va:429) > > org.mortbay.io.nio.SelectorManager.doSelect(SelectorManager.java:185) > > org.mortbay.jetty.nio.SelectChannelConnector.accept(SelectChannelConnect > or.java:124) > > org.mortbay.jetty.AbstractConnector$Acceptor.run(AbstractConnector.java: > 707) > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > :522) > Thread 22 > (org.apache.hadoop.hdfs.server.namenode.DecommissionManager$Monitor@67c7 > 980c): > State: TIMED_WAITING > Blocked count: 5 > Waited count: 127 > Stack: > java.lang.Thread.sleep(Native Method) > > org.apache.hadoop.hdfs.server.namenode.DecommissionManager$Monitor.run(D > ecommissionManager.java:65) > java.lang.Thread.run(Thread.java:619) > Thread 21 > (org.apache.hadoop.hdfs.server.namenode.FSNamesystem$ReplicationMonitor@ > 2094257f): > State: TIMED_WAITING > Blocked count: 20 > Waited count: 1263 > Stack: > java.lang.Thread.sleep(Native Method) > > org.apache.hadoop.hdfs.server.namenode.FSNamesystem$ReplicationMonitor.r > un(FSNamesystem.java:2304) > java.lang.Thread.run(Thread.java:619) > Thread 20 > (org.apache.hadoop.hdfs.server.namenode.LeaseManager$Monitor@3a51127a): > State: TIMED_WAITING > Blocked count: 21 > Waited count: 1875 > Stack: > java.lang.Thread.sleep(Native Method) > > org.apache.hadoop.hdfs.server.namenode.LeaseManager$Monitor.run(LeaseMan > ager.java:349) > java.lang.Thread.run(Thread.java:619) > Thread 19 > (org.apache.hadoop.hdfs.server.namenode.FSNamesystem$HeartbeatMonitor@61 > 578aab): > State: TIMED_WAITING > Blocked count: 0 > Waited count: 13 > Stack: > java.lang.Thread.sleep(Native Method) > > org.apache.hadoop.hdfs.server.namenode.FSNamesystem$HeartbeatMonitor.run > (FSNamesystem.java:2286) > java.lang.Thread.run(Thread.java:619) > Thread 18 > (org.apache.hadoop.hdfs.server.namenode.PendingReplicationBlocks$Pending > ReplicationMonitor@2339e351): > State: TIMED_WAITING > Blocked count: 0 > Waited count: 13 > Stack: > java.lang.Thread.sleep(Native Method) > > org.apache.hadoop.hdfs.server.namenode.PendingReplicationBlocks$PendingR > eplicationMonitor.run(PendingReplicationBlocks.java:186) > java.lang.Thread.run(Thread.java:619) > Thread 9 (RMI TCP Accept-0): > State: RUNNABLE > Blocked count: 0 > Waited count: 0 > Stack: > java.net.PlainSocketImpl.socketAccept(Native Method) > java.net.PlainSocketImpl.accept(PlainSocketImpl.java:384) > java.net.ServerSocket.implAccept(ServerSocket.java:453) > java.net.ServerSocket.accept(ServerSocket.java:421) > > sun.rmi.transport.tcp.TCPTransport$AcceptLoop.executeAcceptLoop(TCPTrans > port.java:369) > > sun.rmi.transport.tcp.TCPTransport$AcceptLoop.run(TCPTransport.java:341) > java.lang.Thread.run(Thread.java:619) > Thread 4 (Signal Dispatcher): > State: RUNNABLE > Blocked count: 0 > Waited count: 0 > Stack: > Thread 3 (Finalizer): > State: WAITING > Blocked count: 0 > Waited count: 40 > Waiting on java.lang.ref.ReferenceQueue$Lock@22f62eba > Stack: > java.lang.Object.wait(Native Method) > java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:116) > java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:132) > java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:159) > Thread 2 (Reference Handler): > State: WAITING > Blocked count: 1 > Waited count: 39 > Waiting on java.lang.ref.Reference$Lock@646d6aa0 > Stack: > java.lang.Object.wait(Native Method) > java.lang.Object.wait(Object.java:485) > java.lang.ref.Reference$ReferenceHandler.run(Reference.java:116) > Thread 1 (main): > State: WAITING > Blocked count: 10 > Waited count: 10 > Waiting on org.apache.hadoop.ipc.RPC$Server@41f6321 > Stack: > java.lang.Object.wait(Native Method) > java.lang.Object.wait(Object.java:485) > org.apache.hadoop.ipc.Server.join(Server.java:1122) > > org.apache.hadoop.hdfs.server.namenode.NameNode.join(NameNode.java:292) > > org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:966) > > Wei > > -----Original Message----- > From: Alexander C.H. Lorenz [mailto:wget.null@googlemail.com] > Sent: Friday, October 21, 2011 4:15 AM > To: common-user@hadoop.apache.org > Subject: Re: jobtracker cannot be started > > add into hadoop-env.sh the opts to the jdk-call. The logs should be > accessible at he hadoop-log-directory. > > Also check http://jobtracker:50030/stacks - thats the same as jstack > (jstack > PID). Also you can use jstack -F PID to get a corefile (similar to > /stacks I > think) @jobtracker. > > Are you using 64bit-JDK? Which version? > > regards, > Alex > > On Fri, Oct 21, 2011 at 10:00 AM, Peng, Wei wrote: > > > I am using the default heap size, which is 1000MB. The jobtracker hung > > when only I was running one job. Now I could not even restart the > > jobtracker. > > Can you teach me how to turn on GC logging in hadoop? > > > > Thanks! > > Wei > > > > -----Original Message----- > > From: Alexander C.H. Lorenz [mailto:wget.null@googlemail.com] > > Sent: Friday, October 21, 2011 3:54 AM > > To: common-user@hadoop.apache.org > > Subject: Re: jobtracker cannot be started > > > > Hi, > > > > what are the heap size you given at the jobtracker? And how much jobs > / > > users / tasks are run? What say a log? > > Turn on GC logging: > > http://java.sun.com/developer/technicalArticles/Programming/GCPortal/ > > > > - Alex > > > > > > On Fri, Oct 21, 2011 at 9:47 AM, Peng, Wei wrote: > > > > > Hi, > > > > > > > > > > > > When I was running a job on hadoop with 75% mappers finished, the > > > jobtracker hung so that I cannot access > > > jobtrackerserver:7845/jobtracker.jsp and hadoop job -status hung as > > > well. > > > > > > > > > > > > Then I stopped jobtracker and restarted it. However, the jobtracker > > > cannot be started. I received error message from jobtracker.log.out > > > saying > > > > > > > > > > > > Exception in thread "LeaseChecker" java.lang.OutOfMemoryError: Java > > heap > > > space > > > > > > at > > > java.io.BufferedOutputStream.(BufferedOutputStream.java:59) > > > > > > at > > > java.io.BufferedOutputStream.(BufferedOutputStream.java:42) > > > > > > at > > > > > > org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:318) > > > > > > at > > > org.apache.hadoop.ipc.Client$Connection.access$1700(Client.java:176) > > > > > > at > org.apache.hadoop.ipc.Client.getConnection(Client.java:859) > > > > > > at org.apache.hadoop.ipc.Client.call(Client.java:719) > > > > > > at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220) > > > > > > at $Proxy4.renewLease(Unknown Source) > > > > > > at sun.reflect.GeneratedMethodAccessor11.invoke(Unknown > Source) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at > > > > > > org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvo > > > cationHandler.java:82) > > > > > > at > > > > > > org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocation > > > Handler.java:59) > > > > > > at $Proxy4.renewLease(Unknown Source) > > > > > > at > > > > > > org.apache.hadoop.hdfs.DFSClient$LeaseChecker.renew(DFSClient.java:1016) > > > > > > at > > > > org.apache.hadoop.hdfs.DFSClient$LeaseChecker.run(DFSClient.java:1028) > > > > > > at java.lang.Thread.run(Thread.java:619) > > > > > > Exception in thread "expireTrackers" java.lang.OutOfMemoryError: > Java > > > heap space > > > > > > at java.util.Arrays.copyOf(Arrays.java:2882) > > > > > > at > > > > > > java.lang.AbstractStringBuilder.expandCapacity(AbstractStringBuilder.jav > > > a:100) > > > > > > at > > > > java.lang.AbstractStringBuilder.append(AbstractStringBuilder.java:390) > > > > > > at java.lang.StringBuffer.append(StringBuffer.java:224) > > > > > > at > org.apache.hadoop.mapred.JobHistory.log(JobHistory.java:354) > > > > > > at > > > > > > org.apache.hadoop.mapred.JobHistory$MapAttempt.logStarted(JobHistory.jav > > > a:1354) > > > > > > at > > > > > > org.apache.hadoop.mapred.JobInProgress.failedTask(JobInProgress.java:233 > > > 2) > > > > > > at > > > > > > org.apache.hadoop.mapred.JobInProgress.updateTaskStatus(JobInProgress.ja > > > va:849) > > > > > > at > > > > > > org.apache.hadoop.mapred.JobInProgress.failedTask(JobInProgress.java:246 > > > 3) > > > > > > at > > > > > > org.apache.hadoop.mapred.JobTracker.lostTaskTracker(JobTracker.java:3474 > > > ) > > > > > > at > > > > > > org.apache.hadoop.mapred.JobTracker$ExpireTrackers.run(JobTracker.java:3 > > > 48) > > > > > > at java.lang.Thread.run(Thread.java:619) > > > > > > Exception in thread "IPC Server listener on 9001" > > > java.lang.OutOfMemoryError: Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:181) > > > > > > at > > > > > > org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:449) > > > > > > at > > > > > > org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:2 > > > 16) > > > > > > at > > > > > > org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181) > > > > > > at > > > > > > org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766) > > > > > > at > > > > org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:417) > > > > > > at > > > > > > org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandler > > > Collection.java:230) > > > > > > at > > > > > > org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) > > > > > > at org.mortbay.jetty.Server.handle(Server.java:324) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:534) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection$RequestHandler.headerComplete(HttpConne > > > ction.java:864) > > > > > > at > org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:533) > > > > > > at > > > org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:207) > > > > > > at > > > org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:403) > > > > > > at > > > > > > org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java: > > > 409) > > > > > > at > > > > > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > > > :522) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:181) > > > > > > at org.mortbay.thread.Timeout.tick(Timeout.java:159) > > > > > > at > > > > > > org.mortbay.io.nio.SelectorManager$SelectSet.doSelect(SelectorManager.ja > > > va:587) > > > > > > at > > > > org.mortbay.io.nio.SelectorManager.doSelect(SelectorManager.java:185) > > > > > > at > > > > > > org.mortbay.jetty.nio.SelectChannelConnector.accept(SelectChannelConnect > > > or.java:124) > > > > > > at > > > > > > org.mortbay.jetty.AbstractConnector$Acceptor.run(AbstractConnector.java: > > > 707) > > > > > > at > > > > > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > > > :522) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:189) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:563) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection$RequestHandler.headerComplete(HttpConne > > > ction.java:864) > > > > > > at > org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:533) > > > > > > at > > > org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:207) > > > > > > at > > > org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:403) > > > > > > at > > > > > > org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java: > > > 409) > > > > > > at > > > > > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > > > :522) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > Java HotSpot(TM) 64-Bit Server VM warning: Exception > > > java.lang.OutOfMemoryError occurred dispatching signal SIGTERM to > > > handler- the VM may need to be forcibly terminated > > > > > > Exception in thread "retireJobs" java.lang.OutOfMemoryError: Java > heap > > > space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > Exception in thread "1586500830@qtp0-37" java.lang.OutOfMemoryError: > > > Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > Exception in thread "1513282332@qtp0-29" java.lang.OutOfMemoryError: > > > Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:181) > > > > > > at > > > > > > org.mortbay.jetty.servlet.HashSessionManager.scavenge(HashSessionManager > > > .java:281) > > > > > > at > > > > > > org.mortbay.jetty.servlet.HashSessionManager.access$000(HashSessionManag > > > er.java:44) > > > > > > at > > > > > > org.mortbay.jetty.servlet.HashSessionManager$2.run(HashSessionManager.ja > > > va:213) > > > > > > at java.util.TimerThread.mainLoop(Timer.java:512) > > > > > > at java.util.TimerThread.run(Timer.java:462) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > Exception in thread "IPC Server Responder" > java.lang.OutOfMemoryError: > > > Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:181) > > > > > > at > > > > > > org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java: > > > 429) > > > > > > at > > > > > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > > > :522) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > Exception in thread "expireLaunchingTasks" > java.lang.OutOfMemoryError: > > > Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:181) > > > > > > at > > > > > > org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:449) > > > > > > at > > > > > > org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:2 > > > 16) > > > > > > at > > > > > > org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181) > > > > > > at > > > > > > org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766) > > > > > > at > > > > org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:417) > > > > > > at > > > > > > org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandler > > > Collection.java:230) > > > > > > at > > > > > > org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) > > > > > > at org.mortbay.jetty.Server.handle(Server.java:324) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:534) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection$RequestHandler.headerComplete(HttpConne > > > ction.java:864) > > > > > > at > org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:533) > > > > > > at > > > org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:207) > > > > > > at > > > org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:403) > > > > > > at > > > > > > org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java: > > > 409) > > > > > > at > > > > > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > > > :522) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:189) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:563) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection$RequestHandler.headerComplete(HttpConne > > > ction.java:864) > > > > > > at > org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:533) > > > > > > at > > > org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:207) > > > > > > at > > > org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:403) > > > > > > at > > > > > > org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java: > > > 409) > > > > > > at > > > > > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > > > :522) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:181) > > > > > > at > > > > > > org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java: > > > 429) > > > > > > at > > > > > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > > > :522) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:181) > > > > > > at > > > > > > org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:449) > > > > > > at > > > > > > org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:2 > > > 16) > > > > > > at > > > > > > org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181) > > > > > > at > > > > > > org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766) > > > > > > at > > > > org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:417) > > > > > > at > > > > > > org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandler > > > Collection.java:230) > > > > > > at > > > > > > org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) > > > > > > at org.mortbay.jetty.Server.handle(Server.java:324) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:534) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection$RequestHandler.headerComplete(HttpConne > > > ction.java:864) > > > > > > at > org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:533) > > > > > > at > > > org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:207) > > > > > > at > > > org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:403) > > > > > > at > > > > > > org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java: > > > 409) > > > > > > at > > > > > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > > > :522) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:181) > > > > > > at > > > > > > org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:449) > > > > > > at > > > > > > org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:2 > > > 16) > > > > > > at > > > > > > org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181) > > > > > > at > > > > > > org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766) > > > > > > at > > > > org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:417) > > > > > > at > > > > > > org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandler > > > Collection.java:230) > > > > > > at > > > > > > org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) > > > > > > at org.mortbay.jetty.Server.handle(Server.java:324) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:534) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection$RequestHandler.headerComplete(HttpConne > > > ction.java:864) > > > > > > at > org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:533) > > > > > > at > > > org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:207) > > > > > > at > > > org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:403) > > > > > > at > > > > > > org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java: > > > 409) > > > > > > at > > > > > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > > > :522) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > java.lang.reflect.InvocationTargetException > > > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > > > > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav > > > a:39) > > > > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor > > > Impl.java:25) > > > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > > > at org.mortbay.log.Slf4jLog.warn(Slf4jLog.java:126) > > > > > > at org.mortbay.log.Log.warn(Log.java:189) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:563) > > > > > > at > > > > > > org.mortbay.jetty.HttpConnection$RequestHandler.headerComplete(HttpConne > > > ction.java:864) > > > > > > at > org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:533) > > > > > > at > > > org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:207) > > > > > > at > > > org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:403) > > > > > > at > > > > > > org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java: > > > 409) > > > > > > at > > > > > > org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java > > > :522) > > > > > > Caused by: java.lang.OutOfMemoryError: Java heap space > > > > > > > > > > > > I have no idea how to solve this problem. The memory on the > jobtracker > > > server seems ok > > > > > > total used free shared buffers > > > cached > > > > > > Mem: 7148044 3578024 3570020 0 673768 > > > 467340 > > > > > > -/+ buffers/cache: 2436916 4711128 > > > > > > Swap: 1020116 160 1019956 > > > > > > > > > > > > Can someone help me on this? > > > > > > > > > > > > Thanks a lot > > > > > > Wei > > > > > > > > > > > > -- > > Alexander Lorenz > > http://mapredit.blogspot.com > > > > > > -- > Alexander Lorenz > http://mapredit.blogspot.com > -- Alexander Lorenz http://mapredit.blogspot.com --bcaec51dd5cbc29ee504afcb3218--