Return-Path: X-Original-To: apmail-tez-issues-archive@minotaur.apache.org Delivered-To: apmail-tez-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4D77A172CB for ; Tue, 3 Feb 2015 23:10:34 +0000 (UTC) Received: (qmail 25038 invoked by uid 500); 3 Feb 2015 23:10:35 -0000 Delivered-To: apmail-tez-issues-archive@tez.apache.org Received: (qmail 24980 invoked by uid 500); 3 Feb 2015 23:10:35 -0000 Mailing-List: contact issues-help@tez.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@tez.apache.org Delivered-To: mailing list issues@tez.apache.org Received: (qmail 24969 invoked by uid 99); 3 Feb 2015 23:10:35 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Feb 2015 23:10:35 +0000 Date: Tue, 3 Feb 2015 23:10:35 +0000 (UTC) From: "Chang Li (JIRA)" To: issues@tez.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Comment Edited] (TEZ-900) Confusing message for incorrect queue for some tez examples MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/TEZ-900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14304269#comment-14304269 ] Chang Li edited comment on TEZ-900 at 2/3/15 11:10 PM: ------------------------------------------------------- [~hitesh] YarnClientImpl's check isn't complete. The "failure of submit to the wrong queue" doesn't really fail to be submitted, instead it succeed in being submitted, however failed immediately after. The check in YarnClientImpl is simply {code} if (!state.equals(YarnApplicationState.NEW) && !state.equals(YarnApplicationState.NEW_SAVING)) { LOG.info("Submitted application " + applicationId); break; } {code} because the state at that time is now FAILED, it will break out the loop and not be captured within YarnClient. But YarnRunner did a more thorough check, it did check FAILED and KILLED state immediate after launch, {code} if (appMaster == null || appMaster.getYarnApplicationState() == YarnApplicationState.FAILED || appMaster.getYarnApplicationState() == YarnApplicationState.KILLED) { throw new IOException("Failed to run job : " + diagnostics); } {code} I can open a jira on yarn for this issue, and I think the solution is simply duplicating the check logic from YarnRunner to YarnClient was (Author: lichangleo): [~hitesh] YarnClientImpl's check isn't complete. The "failure of submit to the wrong queue" doesn't really fail to be submitted, instead it succeed in being submitted, however failed immediately after. The check in YarnClientImpl is simply "if (!state.equals(YarnApplicationState.NEW) && !state.equals(YarnApplicationState.NEW_SAVING)) { LOG.info("Submitted application " + applicationId); break; }" because the state at that time is now FAILED, it will break out the loop and not be captured within YarnClient. But YarnRunner did a more thorough check, it did check FAILED and KILLED state immediate after launch, "if (appMaster == null || appMaster.getYarnApplicationState() == YarnApplicationState.FAILED || appMaster.getYarnApplicationState() == YarnApplicationState.KILLED) { throw new IOException("Failed to run job : " + diagnostics); }" I can open a jira on yarn for this issue, and I think the solution is simply duplicating the check logic from YarnRunner to YarnClient > Confusing message for incorrect queue for some tez examples > ----------------------------------------------------------- > > Key: TEZ-900 > URL: https://issues.apache.org/jira/browse/TEZ-900 > Project: Apache Tez > Issue Type: Bug > Affects Versions: 0.3.0 > Reporter: Jonathan Eagles > Assignee: Chang Li > Attachments: tez900.patch > > > The error message of at least orderedwordcount with non-existent queue is very difficult as compared to mrrsleep which is very intuitive. -- This message was sent by Atlassian JIRA (v6.3.4#6332)