Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 24241 invoked from network); 24 Feb 2010 02:47:41 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 24 Feb 2010 02:47:41 -0000 Received: (qmail 38161 invoked by uid 500); 24 Feb 2010 02:47:39 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 38035 invoked by uid 500); 24 Feb 2010 02:47:39 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 38025 invoked by uid 99); 24 Feb 2010 02:47:38 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Feb 2010 02:47:38 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lvzheng19800619@gmail.com designates 209.85.216.192 as permitted sender) Received: from [209.85.216.192] (HELO mail-px0-f192.google.com) (209.85.216.192) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Feb 2010 02:47:31 +0000 Received: by pxi30 with SMTP id 30so2267414pxi.25 for ; Tue, 23 Feb 2010 18:47:11 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=Jv9WV1Q/wuz6RF1Igvyd0jpILvd0W7qEWt4Hvw/1xD4=; b=L4mIiGkQvTrjPnLzwFwunmS57nWn91Iav+SLShhdpwKqbVEj4fe7F7qEaR1tC7pS/d 06Ssy+IITwBlIABij65fmHv6U8f81mhCi9IeyOj6qxph8MHvB+AWJccr0dUZuk9N0qMU +QQ5QHLvRHOWpxiKJVW52UkiQM54HJacOH0+E= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=LK5eRy2mSRghqltsnQLg94pQPlEiLa1IOQnp7WulR0nPh1CL8//jQyZmkmXmnqjkJU 4jdgoXq8j3tgru//ZS0T9XzZy8JBH1f7l5XDRql8o7pR3BHA8fyHiDRBsUuu8Sr5lrqf /nLoblG+OnZQzYEYLkujt/kMqAJVd8K8AjaQA= MIME-Version: 1.0 Received: by 10.115.27.9 with SMTP id e9mr4245414waj.219.1266979630573; Tue, 23 Feb 2010 18:47:10 -0800 (PST) In-Reply-To: <314098691002220809v1a3f8a32v61b5db4e2a695f5@mail.gmail.com> References: <1bf2d1821002180407n844f844if833f0ea5c754bb8@mail.gmail.com> <17e273101002192016m4a1b238dl8237b5d832563534@mail.gmail.com> <17e273101002200649r2dff83b3jcca82443e7a212b1@mail.gmail.com> <314098691002220809v1a3f8a32v61b5db4e2a695f5@mail.gmail.com> Date: Wed, 24 Feb 2010 10:47:10 +0800 Message-ID: Subject: Re: Many child processes dont exit From: Zheng Lv To: common-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=0016364578b0d167f204804fae87 --0016364578b0d167f204804fae87 Content-Type: text/plain; charset=ISO-8859-1 Thank you Jason, your reply is helpful. 2010/2/23 Jason Venner > Someone is using a threadpool that does not have daemon priority > threads, and that is not shutdown before the main method returns. > The non daemon threads prevent the jvm from exiting. > We had this problem for a while and modified the Child.main to exit, > rather than trying to work out and fix the third party library that > ran the thread pool. This thecnique does of prevent jvm reuse. > > On Sat, Feb 20, 2010 at 6:49 AM, Ted Yu wrote: > > Do you have System.exit() as the last line in your main() ? > > Job job = createSubmittableJob(conf, otherArgs); > > System.exit(job.waitForCompletion(true)? 0 : 1); > > > > > > On Sat, Feb 20, 2010 at 12:32 AM, Zheng Lv >wrote: > > > >> Hello Ted, > >> Yes. Every hour a job will be created and started, and when it > finished, > >> it will maintain. The logs looks like normal, do you know what can lead > to > >> this happen?Thank you. > >> LvZheng > >> 2010/2/20 Ted Yu > >> > >> > Did the number of child processes increase over time ? > >> > > >> > On Friday, February 19, 2010, Zheng Lv > >> wrote: > >> > > Hello Edson, > >> > > Thank you for your reply. I don't want to kill them, I want to > know > >> why > >> > > these child processes don't exit, and to know how to make them exit > >> > > successfully when they finish. Any ideas? Thank you. > >> > > LvZheng > >> > > > >> > > 2010/2/18 Edson Ramiro > >> > > > >> > >> Do you want to kill them ? > >> > >> > >> > >> if yes, you can use > >> > >> > >> > >> ./bin/slaves.sh pkill java > >> > >> > >> > >> but it will kill the datanode and tasktracker processes > >> > >> in all slaves and you'll need to start these processes again. > >> > >> > >> > >> Edson Ramiro > >> > >> > >> > >> > >> > >> On 14 February 2010 22:09, Zheng Lv > >> wrote: > >> > >> > >> > >> > any idea? > >> > >> > > >> > >> > 2010/2/11 Zheng Lv > >> > >> > > >> > >> > > Hello Everyone, > >> > >> > > We often find many child processes in datanodes, which have > >> > already > >> > >> > > finished for long time. And following are the jstack log: > >> > >> > > Full thread dump Java HotSpot(TM) 64-Bit Server VM (14.3-b01 > >> mixed > >> > >> > mode): > >> > >> > > "DestroyJavaVM" prio=10 tid=0x00002aaac8019800 nid=0x2422 > waiting > >> on > >> > >> > > condition [0x0000000000000000] > >> > >> > > java.lang.Thread.State: RUNNABLE > >> > >> > > "NioProcessor-31" prio=10 tid=0x00000000439fa000 nid=0x2826 > >> runnable > >> > >> > > [0x000000004100a000] > >> > >> > > java.lang.Thread.State: RUNNABLE > >> > >> > > at sun.nio.ch.EPollArrayWrapper.epollWait(Native > Method) > >> > >> > > at > >> > >> sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:215) > >> > >> > > at > >> > >> > sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65) > >> > >> > > at > >> > >> sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69) > >> > >> > > - locked <0x00002aaab9b5f6f8> (a sun.nio.ch.Util$1) > >> > >> > > - locked <0x00002aaab9b5f710> (a > >> > >> > > java.util.Collections$UnmodifiableSet) > >> > >> > > - locked <0x00002aaab9b5f680> (a > >> > sun.nio.ch.EPollSelectorImpl) > >> > >> > > at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80) > >> > >> > > at > >> > >> > > > >> > >> > > >> > >> > >> > > >> > org.apache.mina.transport.socket.nio.NioProcessor.select(NioProcessor.java:65) > >> > >> > > at > >> > >> > > > >> > >> > > >> > >> > >> > > >> > org.apache.mina.common.AbstractPollingIoProcessor$Worker.run(AbstractPollingIoProcessor.java:672) > >> > >> > > at > >> > >> > > > >> > >> > > >> > >> > >> > > >> > org.apache.mina.util.NamePreservingRunnable.run(NamePreservingRunnable.java:51) > >> > >> > > at > >> > >> > > > >> > >> > > >> > >> > >> > > >> > java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) > >> > >> > > at > >> > >> > > > >> > >> > > >> > >> > >> > > >> > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) > >> > >> > > at java.lang.Thread.run(Thread.java:619) > >> > >> > > "pool-15-thread-1" prio=10 tid=0x00002aaac802d000 nid=0x2825 > >> waiting > >> > on > >> > >> > > condition [0x0000000041604000] > >> > >> > > java.lang.Thread.State: WAITING (parking) > >> > >> > > at sun.misc.Unsafe.park(Native Method) > >> > >> > > - parking to wait for <0x00002aaab9b61620> (a > >> > >> > > > >> > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject) > >> > >> > > at > >> > >> > > > java.util.concurrent.locks.LockSupport.park(LockSupport.java:158) > >> > >> > > at > >> > >> > > > >> > >> > > >> > >> > >> > > >> > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:1925) > >> > >> > > at > >> > >> > > > >> > >> > > >> > >> > >> > > >> > java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:358) > >> > >> > > at > >> > >> > > > >> > >> > > >> > >> > >> > > >> > java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:947) > >> > >> > > at > >> > >> > > > >> > >> > > >> > >> java.util.concurrent.ThreadPoolExecutor$Worker.run(Threa > >> > > >> > > > > > > -- > Pro Hadoop, a book to guide you from beginner to hadoop mastery, > http://www.amazon.com/dp/1430219424?tag=jewlerymall > www.prohadoopbook.com a community for Hadoop Professionals > --0016364578b0d167f204804fae87--