Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id C60E6200B7E for ; Tue, 6 Sep 2016 22:46:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id C447A160ACF; Tue, 6 Sep 2016 20:46:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 1F6AC160AA9 for ; Tue, 6 Sep 2016 22:46:21 +0200 (CEST) Received: (qmail 17374 invoked by uid 500); 6 Sep 2016 20:46:21 -0000 Mailing-List: contact issues-help@tez.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@tez.apache.org Delivered-To: mailing list issues@tez.apache.org Received: (qmail 17351 invoked by uid 99); 6 Sep 2016 20:46:21 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Sep 2016 20:46:21 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 2EBDD2C1B79 for ; Tue, 6 Sep 2016 20:46:21 +0000 (UTC) Date: Tue, 6 Sep 2016 20:46:21 +0000 (UTC) From: "Hitesh Shah (JIRA)" To: issues@tez.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (TEZ-3426) Second AM attempt launched for session mode and recovery disabled for certain cases MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 06 Sep 2016 20:46:22 -0000 [ https://issues.apache.org/jira/browse/TEZ-3426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15468513#comment-15468513 ] Hitesh Shah commented on TEZ-3426: ---------------------------------- +1 > Second AM attempt launched for session mode and recovery disabled for certain cases > ----------------------------------------------------------------------------------- > > Key: TEZ-3426 > URL: https://issues.apache.org/jira/browse/TEZ-3426 > Project: Apache Tez > Issue Type: Bug > Reporter: Jonathan Eagles > Assignee: Jason Lowe > Priority: Critical > Attachments: TEZ-3426.001.patch, TEZ-3426.002.patch, TEZ-3426.003.patch, TEZ-3426.004.patch > > > ApplicationSubmissionContext#setMaxAppAttempts does not fully guarantee that there will be only that many attempts at maximum. There are a few exceptional cases that are not count. Tez should protect itself from accidentally starting the second attempt in session mode and when recovery is disabled since the second attempt will always succeed with no work to do. > {code} > @Override > public boolean shouldCountTowardsMaxAttemptRetry() { > try { > this.readLock.lock(); > int exitStatus = getAMContainerExitStatus(); > return !(exitStatus == ContainerExitStatus.PREEMPTED > || exitStatus == ContainerExitStatus.ABORTED > || exitStatus == ContainerExitStatus.DISKS_FAILED > || exitStatus == ContainerExitStatus.KILLED_BY_RESOURCEMANAGER); > } finally { > this.readLock.unlock(); > } > } > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)