Return-Path: X-Original-To: apmail-tez-issues-archive@minotaur.apache.org Delivered-To: apmail-tez-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5368517BEA for ; Wed, 22 Oct 2014 05:24:34 +0000 (UTC) Received: (qmail 1710 invoked by uid 500); 22 Oct 2014 05:24:34 -0000 Delivered-To: apmail-tez-issues-archive@tez.apache.org Received: (qmail 1558 invoked by uid 500); 22 Oct 2014 05:24:34 -0000 Mailing-List: contact issues-help@tez.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@tez.apache.org Delivered-To: mailing list issues@tez.apache.org Received: (qmail 1545 invoked by uid 99); 22 Oct 2014 05:24:34 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Oct 2014 05:24:34 +0000 Date: Wed, 22 Oct 2014 05:24:34 +0000 (UTC) From: "Jeff Zhang (JIRA)" To: issues@tez.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (TEZ-1629) Replace ThreadPool's default RejectedExecutionHandler in ContainerLauncherImpl to void abort when AM shutdown MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/TEZ-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14179605#comment-14179605 ] Jeff Zhang commented on TEZ-1629: --------------------------------- Committed to both master and branch-0.5 > Replace ThreadPool's default RejectedExecutionHandler in ContainerLauncherImpl to void abort when AM shutdown > ------------------------------------------------------------------------------------------------------------- > > Key: TEZ-1629 > URL: https://issues.apache.org/jira/browse/TEZ-1629 > Project: Apache Tez > Issue Type: Bug > Reporter: Rajesh Balamohan > Assignee: Jeff Zhang > Attachments: Tez-1629.patch > > > 2014-09-19 10:18:42,654 INFO [AsyncDispatcher event handler] org.apache.tez.dag.app.DAGAppMaster: On DAG completion. Old state: RUNNING new state: SUCCEEDED > 2014-09-19 10:18:42,654 INFO [AsyncDispatcher event handler] org.apache.tez.dag.app.DAGAppMaster: Shutting down on completion of dag:dag_1410976766734_0795_1 > 2014-09-19 10:18:42,654 INFO [AsyncDispatcher event handler] org.apache.tez.dag.app.DAGAppMaster: DAGAppMasterShutdownHandler invoked > 2014-09-19 10:18:42,655 INFO [AsyncDispatcher event handler] org.apache.tez.dag.app.DAGAppMaster: Handling DAGAppMaster shutdown > 2014-09-19 10:18:42,656 INFO [AMShutdownThread] org.apache.tez.dag.app.DAGAppMaster: Sleeping for 5 seconds before shutting down > 2014-09-19 10:18:47,661 INFO [AMShutdownThread] org.apache.tez.dag.app.DAGAppMaster: Calling stop for all the services > 2014-09-19 10:18:47,662 INFO [AMShutdownThread] org.apache.tez.dag.history.HistoryEventHandler: Stopping HistoryEventHandler > 2014-09-19 10:18:47,663 INFO [AMShutdownThread] org.apache.tez.dag.history.recovery.RecoveryService: Stopping RecoveryService > 2014-09-19 10:18:47,663 INFO [AMShutdownThread] org.apache.tez.dag.history.recovery.RecoveryService: Closing Summary Stream > 2014-09-19 10:18:47,663 INFO [RecoveryEventHandlingThread] org.apache.tez.dag.history.recovery.RecoveryService: EventQueue take interrupted. Returning > 2014-09-19 10:18:47,699 INFO [AMShutdownThread] org.apache.tez.dag.history.logging.ats.ATSHistoryLoggingService: Stopping ATSService, eventQueueBacklog=0 > 2014-09-19 10:18:47,699 INFO [HistoryEventHandlingThread] org.apache.tez.dag.history.logging.ats.ATSHistoryLoggingService: EventQueue take interrupted. Returning > 2014-09-19 10:18:47,700 INFO [AMShutdownThread] org.apache.tez.dag.app.launcher.ContainerLauncherImpl: Sending a stop request to the NM for ContainerId: container_1410976766734_0795_01_000002 > 2014-09-19 10:18:47,711 INFO [AsyncDispatcher event handler] org.apache.tez.dag.history.HistoryEventHandler: [HISTORY][DAG:dag_1410976766734_0795_1][Event:CONTAINER_STOPPED]: containerId=container_1410976766734_0795_01_000002, stoppedTime=1411121927711, exitStatus=-100 > 2014-09-19 10:18:47,712 INFO [AsyncDispatcher event handler] org.apache.tez.dag.app.rm.container.AMContainerImpl: AMContainer container_1410976766734_0795_01_000002 transitioned from IDLE to STOP_REQUESTED via event C_NM_STOP_SENT > 2014-09-19 10:18:47,713 INFO [DelayedContainerManager] org.apache.tez.dag.app.rm.YarnTaskSchedulerService: AllocatedContainerManager Thread interrupted > 2014-09-19 10:18:47,713 ERROR [ContainerLauncher Event Handler] org.apache.hadoop.yarn.YarnUncaughtExceptionHandler: Thread Thread[ContainerLauncher Event Handler,5,main] threw an Exception. > java.util.concurrent.RejectedExecutionException: Task org.apache.tez.dag.app.launcher.ContainerLauncherImpl$EventProcessor@8b1f182 rejected from java.util.concurrent.ThreadPoolExecutor@5a669243[Terminated, pool size = 0, active threads = 0, queued tasks = 0, completed tasks = 1] > at java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:2048) > at java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:821) > at java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:1372) > at org.apache.tez.dag.app.launcher.ContainerLauncherImpl$1.run(ContainerLauncherImpl.java:310) -- This message was sent by Atlassian JIRA (v6.3.4#6332)