Return-Path: X-Original-To: apmail-ambari-issues-archive@minotaur.apache.org Delivered-To: apmail-ambari-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 358E719E4F for ; Mon, 4 Apr 2016 18:31:26 +0000 (UTC) Received: (qmail 74305 invoked by uid 500); 4 Apr 2016 18:31:26 -0000 Delivered-To: apmail-ambari-issues-archive@ambari.apache.org Received: (qmail 74230 invoked by uid 500); 4 Apr 2016 18:31:26 -0000 Mailing-List: contact issues-help@ambari.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ambari.apache.org Delivered-To: mailing list issues@ambari.apache.org Received: (qmail 74178 invoked by uid 99); 4 Apr 2016 18:31:26 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Apr 2016 18:31:26 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id B1A032C1F6E for ; Mon, 4 Apr 2016 18:31:25 +0000 (UTC) Date: Mon, 4 Apr 2016 18:31:25 +0000 (UTC) From: "Myroslav Papirkovskyi (JIRA)" To: issues@ambari.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (AMBARI-15691) Express Upgrade hangs if ambari agent is restarted in the middle of EU MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/AMBARI-15691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Myroslav Papirkovskyi updated AMBARI-15691: ------------------------------------------- Status: Patch Available (was: Open) > Express Upgrade hangs if ambari agent is restarted in the middle of EU > ---------------------------------------------------------------------- > > Key: AMBARI-15691 > URL: https://issues.apache.org/jira/browse/AMBARI-15691 > Project: Ambari > Issue Type: Bug > Components: ambari-server > Affects Versions: 2.2.2 > Reporter: Myroslav Papirkovskyi > Assignee: Myroslav Papirkovskyi > Priority: Blocker > Fix For: 2.2.2 > > Attachments: AMBARI-15691.patch > > > *Steps* > # Install HDP-2.4.0.0 with Ambari 2.2.2 (secure, non-HA cluster) > # Start EU to 2.4.2.0-127 and reach till "Backup Knox data" prompt > # Hit Proceed at "backup Knox data" message > # Stop ambari agent on two of the cluster hosts and wait for EU to fail with "HOLDING_TIMEDOUT" status (in my test EU stopped at "Snapshot HBase" task) > # Start the agents on both hosts and wait 90 secs. for agents to heartbeat > # Retry the failed task > *Result* > EU hangs > From ambari-server log: > {code} > 04 Apr 2016 08:20:14,729 WARN [ambari-action-scheduler] ActionScheduler:201 - Exception received > java.lang.NullPointerException > at org.apache.ambari.server.actionmanager.ActionScheduler.wasAgentRestartedDuringOperation(ActionScheduler.java:887) > at org.apache.ambari.server.actionmanager.ActionScheduler.processInProgressStage(ActionScheduler.java:691) > at org.apache.ambari.server.actionmanager.ActionScheduler.doWork(ActionScheduler.java:289) > at org.apache.ambari.server.actionmanager.ActionScheduler.run(ActionScheduler.java:196) > at java.lang.Thread.run(Thread.java:745) > 04 Apr 2016 08:30:29,451 WARN [ambari-action-scheduler] ActionScheduler:695 - Detected ambari-agent restart during command execution.The command has been aborted.Execution command details: host: os-d7-ngzvlu-ambari-se-eu-10-2.novalocal, role: ru_execute_tasks, actionId: 19-27 > 04 Apr 2016 08:30:30,581 WARN [ambari-action-scheduler] ActionScheduler:695 - Detected ambari-agent restart during command execution.The command has been aborted.Execution command details: host: os-d7-ngzvlu-ambari-se-eu-10-2.novalocal, role: ru_execute_tasks, actionId: 19-27 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)