From commits-return-89435-archive-asf-public=cust-asf.ponee.io@cloudstack.apache.org Mon Jun 17 12:06:17 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id B5C5018062B for ; Mon, 17 Jun 2019 14:06:16 +0200 (CEST) Received: (qmail 38880 invoked by uid 500); 17 Jun 2019 12:06:17 -0000 Mailing-List: contact commits-help@cloudstack.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cloudstack.apache.org Delivered-To: mailing list commits@cloudstack.apache.org Received: (qmail 38871 invoked by uid 99); 17 Jun 2019 12:06:16 -0000 Received: from ec2-52-202-80-70.compute-1.amazonaws.com (HELO gitbox.apache.org) (52.202.80.70) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 17 Jun 2019 12:06:16 +0000 From: GitBox To: commits@cloudstack.apache.org Subject: [GitHub] [cloudstack] andrijapanicsb opened a new issue #3408: VMware Worker VM doesn't get recycled after a configured timeout Message-ID: <156077317181.12554.9020723732994468692.gitbox@gitbox.apache.org> Date: Mon, 17 Jun 2019 12:06:11 -0000 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit andrijapanicsb opened a new issue #3408: VMware Worker VM doesn't get recycled after a configured timeout URL: https://github.com/apache/cloudstack/issues/3408 ##### ISSUE TYPE * Bug Report * Improvement Request * Enhancement Request * Feature Idea * Documentation Report * Other ##### COMPONENT NAME ~~~ API ~~~ ##### CLOUDSTACK VERSION ~~~ 4.11.2 (didn't test previous ones) ~~~ ##### CONFIGURATION VMware 6.5u2 (should be irelevant) ##### OS / ENVIRONMENT mgmt = CentOS 7 (irelevant) ##### SUMMARY Temp VMware worker VM used to export volume snapshot is not being recycled (i.e. removed) after the timeout is reached (and when configured to be removed( ##### STEPS TO REPRODUCE S~~~ Set following global settings: - job.expire.minutes : 1 - job.cancel.threshold.minutes : 1 vmware.clean.old.worker.vms : true Create a volume snapshot. After 2 x ( job.expire.minutes + job.cancel.threshold.minutes) = 240 sec, the job will time out / fail with the following message in logs: 2019-06-17 11:35:26,459 INFO [c.c.h.v.m.VmwareManagerImpl] (DirectAgentCronJob-40:ctx-dbfaf1a2) (logid:bff5a57c) *Worker VM expired, seconds elapsed: 252* 2019-06-17 11:35:26,463 INFO [c.c.h.v.r.VmwareResource] (DirectAgentCronJob-40:ctx-dbfaf1a2) (logid:bff5a57c) *Recycle pending worker VM: 935686f0ee0b4b518ca2b50597650c75* but tasks remain on the vCenter side: ![worker-vm-tasks-not-removed](https://user-images.githubusercontent.com/45762285/59602942-58f92680-9108-11e9-9b0a-624de4020cf4.JPG) The way to "clean it manually" is to stop/kill the OVF export task in vCenter, after which the VM reconfiguration task and finally removal task (not visible on the image) will be executed and worker VM will be removed as per the expected behavior. Issues seems, that we are NOT killing to OVF export task once the timeouts are reached in ACS, which is the required step. ~~~ ##### EXPECTED RESULTS ~~~ OVF task is stopped/killed and worker VM get's removed. ~~~ ##### ACTUAL RESULTS ~~~ Worker VM keeps running, as well as the OVF export task. ~~~ ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org With regards, Apache Git Services