Return-Path: X-Original-To: apmail-ambari-dev-archive@www.apache.org Delivered-To: apmail-ambari-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A98D3181B8 for ; Tue, 10 Nov 2015 17:30:32 +0000 (UTC) Received: (qmail 13457 invoked by uid 500); 10 Nov 2015 17:30:32 -0000 Delivered-To: apmail-ambari-dev-archive@ambari.apache.org Received: (qmail 13421 invoked by uid 500); 10 Nov 2015 17:30:32 -0000 Mailing-List: contact dev-help@ambari.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ambari.apache.org Delivered-To: mailing list dev@ambari.apache.org Received: (qmail 13405 invoked by uid 99); 10 Nov 2015 17:30:32 -0000 Received: from reviews-vm.apache.org (HELO reviews.apache.org) (140.211.11.40) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Nov 2015 17:30:32 +0000 Received: from reviews.apache.org (localhost [127.0.0.1]) by reviews.apache.org (Postfix) with ESMTP id 1284C27D9A7; Tue, 10 Nov 2015 17:30:32 +0000 (UTC) Content-Type: multipart/alternative; boundary="===============7730277703462258848==" MIME-Version: 1.0 Subject: Re: Review Request 40139: SKIPPED_FAILED state should not be bubbled up to the Upgrade level From: "Dmitro Lisnichenko" To: "Alejandro Fernandez" , "Yusaku Sako" , "Jayush Luniya" , "Nate Cole" , "Jonathan Hurley" Cc: "Ambari" Date: Tue, 10 Nov 2015 17:30:32 -0000 Message-ID: <20151110173032.1681.58687@reviews.apache.org> X-ReviewBoard-URL: https://reviews.apache.org/ Auto-Submitted: auto-generated Sender: "Dmitro Lisnichenko" X-ReviewGroup: Ambari X-Auto-Response-Suppress: DR, RN, OOF, AutoReply X-ReviewRequest-URL: https://reviews.apache.org/r/40139/ X-Sender: "Dmitro Lisnichenko" References: <20151110161229.1681.91539@reviews.apache.org> In-Reply-To: <20151110161229.1681.91539@reviews.apache.org> Reply-To: "Dmitro Lisnichenko" X-ReviewRequest-Repository: ambari --===============7730277703462258848== MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/40139/ ----------------------------------------------------------- (Updated Nov. 10, 2015, 7:30 p.m.) Review request for Ambari, Alejandro Fernandez, Jonathan Hurley, Jayush Luniya, Nate Cole, and Yusaku Sako. Summary (updated) ----------------- SKIPPED_FAILED state should not be bubbled up to the Upgrade level Bugs: AMBARI-13818 https://issues.apache.org/jira/browse/AMBARI-13818 Repository: ambari Description (updated) ------- When there is a skipped failure, the "upgrade" state itself comes SKIPPED_FAILED. Even when the upgrade is running or paused, it is returning "SKIPPED_FAILED". The API should not roll this up to the "upgrade" level as the current behavior is confusing. At the top level, it should just be HOLDING, IN_PROGRESS, COMPLETED, etc. SKIPPED_FAILED should be bubbled up to the upgrade group level and stop there. Also fixes another blocker: STR: 1) Install and deploy cluster with older HDP version 2) Enable NameNode HA 3) Register, install new HDP version 4) Start Rolling Upgrade with "Skip all Service Check failures" and "Skip all Slave Component failures" options 5) Break datanode_upgrade.py script and wait for Core Slaves failures 6) Click "Pause upgrade" on "Core Slaves - >Verifying Skipped Failures" step Result: Button "Resume upgrade" doesn't work. After clicking on this button I've got next http response { "status" : 400, "message" : "java.lang.IllegalArgumentException: Can only set status to PENDING when the upgrade is ABORTED (currently SKIPPED_FAILED)" } Diffs ----- ambari-server/src/main/java/org/apache/ambari/server/controller/internal/CalculatedStatus.java f87c32c Diff: https://reviews.apache.org/r/40139/diff/ Testing (updated) ------- checked on live cluster mvn clean test in progress Thanks, Dmitro Lisnichenko --===============7730277703462258848==--