Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 1E8D8200BF0 for ; Fri, 16 Dec 2016 04:46:24 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 1D2A4160B2D; Fri, 16 Dec 2016 03:46:24 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 667B9160B15 for ; Fri, 16 Dec 2016 04:46:23 +0100 (CET) Received: (qmail 30223 invoked by uid 500); 16 Dec 2016 03:46:22 -0000 Mailing-List: contact dev-help@cloudstack.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cloudstack.apache.org Delivered-To: mailing list dev@cloudstack.apache.org Received: (qmail 30203 invoked by uid 99); 16 Dec 2016 03:46:21 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Dec 2016 03:46:21 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 7A41FC6DE4 for ; Fri, 16 Dec 2016 03:46:21 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -2.89 X-Spam-Level: X-Spam-Status: No, score=-2.89 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, RP_MATCHES_RCVD=-2.999, SPF_PASS=-0.001, T_DKIM_INVALID=0.01] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=neutral reason="invalid (public key: not available)" header.d=nocser.net Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id k2uwVDYg0vCP for ; Fri, 16 Dec 2016 03:46:16 +0000 (UTC) Received: from sv1.nocser.net (sv1.nocser.net [42.0.28.73]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 30E335FB73 for ; Fri, 16 Dec 2016 03:46:15 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=nocser.net; s=default; h=Message-ID:References:In-Reply-To:Subject:To:From:Date: Content-Transfer-Encoding:Content-Type:MIME-Version:Sender:Reply-To:Cc: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=9OXG/jN1pOAOKnGlz2HVqNUcCCqRf9hq6/7v4cY5IhI=; b=iTNuq/DYkgHW8VhiEljNxbq6wU q+NOeqeCimMBlA/FHhv/HY26k/FJUkOE71pFXG/wK3xJgxXhdI1JWlgGHM2hTuWmWtV68dc14tdvR 5AQUhaOWfToxpfbaz7L0GNOMwBYVo0fb6vSuIJ9hDoCqoF288jnhgHeOhfFbl0c+mZ8ceCgIqlh5b fdgUzoyhQ3AR9FX/9Y6MIwmMb7CvEKM+Z2/ESi/kHtW5F80YSkCuiJ9IRuN6imQp8HByYn57d+8p8 UUAGFA/dN4ta0i+Ft3xngfYdu5E1O5r8PowqKak4RfpZ0Dwq/8SJXr5OXfnLX1hjKCizhkzbau2bA QPGtZUrA==; Received: from [::1] (port=36376 helo=sv1.nocser.net) by sv1.nocser.net with esmtpa (Exim 4.87) (envelope-from ) id 1cHjTH-0004aJ-8Y for dev@cloudstack.apache.org; Fri, 16 Dec 2016 11:46:11 +0800 MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII; format=flowed Content-Transfer-Encoding: 7bit Date: Fri, 16 Dec 2016 11:46:11 +0800 From: Syahrul Sazli Shaharir To: dev@cloudstack.apache.org Subject: Re: patchviasocket seems to be broken with qemu 2.3(+?) Organization: ModernOne Data Solutions Sdn. Bhd. In-Reply-To: References: <2f6cfd1f-189b-ecff-edc2-c307a8dfdb62@host1plus.com> <9a2caf48-6d54-fcb0-0a0a-c5de5e5f705f@host1plus.com> Message-ID: <1c8a1e1372123511c68af124ae40bc1a@nocser.net> X-Sender: sazli@nocser.net User-Agent: Roundcube Webmail/1.1.4 X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - sv1.nocser.net X-AntiAbuse: Original Domain - cloudstack.apache.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - nocser.net X-Get-Message-Sender-Via: sv1.nocser.net: authenticated_id: sazli@nocser.net X-Authenticated-Sender: sv1.nocser.net: sazli@nocser.net X-Source: X-Source-Args: X-Source-Dir: archived-at: Fri, 16 Dec 2016 03:46:24 -0000 On 2016-12-16 11:27, Syahrul Sazli Shaharir wrote: > On Wed, 26 Oct 2016, Linas ?ilinskas wrote: > >> So after some investigation I've found out that qemu 2.3.0 is indeed >> broken, at least the way CS uses the qemu chardev/socket. >> >> Not sure in which specific version it happened, but it was fixed in >> 2.4.0-rc3, specifically noting that CloudStack 4.2 was not working. >> >> qemu git commit: 4bf1cb03fbc43b0055af60d4ff093d6894aa4338 >> >> Also attaching the patch from that commit. >> >> >> For our own purposes i've included the patch to the qemu-kvm-ev >> package (2.3.0) and all is well. > > Hi, > > I am facing the exact same issue on latest Cloudstack 4.9.0.1, on > latest CentOS 7.3.1611, with latest qemu-kvm-ev-2.6.0-27.1.el7 > package. > > The issue initially surfaced following a heartbeat-induced reset of > all hosts, when it was on CS 4.8 @ CentOS 7.0 and stock > qemu-kvm-1.5.3. Since then, the patchviasocket.pl/py timeouts > persisted for 1 out of 4 router VM/networks, even after upgrading to > latest code. (I have checked the qemu-kvm-ev-2.6.0-27.1.el7 source, > and the patched code are pretty much still intact, as per the > 2.4.0-rc3 commit). > > Any help would be greatly appreciated. > > Thanks. > > (Attached are some debug logs from the host's agent.log) Here are the debug logs as mentioned: http://pastebin.com/yHdsMNzZ Thanks. > > --sazli > >> >> >> On 2016-10-20 09:59, Linas ?ilinskas wrote: >>> >>> Hi. >>> >>> We have made an upgrade to 4.9. >>> >>> Custom build packages with our own patches, which in my mind (i'm >>> the only >>> one patching those) should not affect the issue i'll describe. >>> >>> I'm not sure whether we didn't notice it before, or it's actually >>> related >>> to something in 4.9 >>> >>> Basically our system vm's were unable to be patched via the qemu >>> socket. >>> The script simply error'ed out with a timeout while trying to push >>> the >>> data to the socket. >>> >>> Executing it manually (with cmd line from the logs) resulted the >>> same. I >>> even tried the old perl variant, which also had same result. >>> >>> So finally we found out that this issue happens only on our HVs >>> which run >>> qemu 2.3.0, from the centos 7 special interest virtualization repo. >>> Other >>> ones that run qemu 1.5, from official repos, can patch the system >>> vms >>> fine. >>> >>> So i'm wondering if anyone tested 4.9 with kvm with qemu >= 2.x? >>> Maybe it >>> something else special in our setup. e.g. we're running the HVs from >>> a >>> preconfigured netboot image (pxe), but all of them, including those >>> with >>> qemu 1.5, so i have no idea. >>> >>> >>> Linas ?ilinskas >>> Head of Development >>> website facebook >>> twitter >>> linkedin >>> >>> >>> Host1Plus is a division of Digital Energy Technologies Ltd. >>> >>> 26 York Street, London W1U 6PZ, United Kingdom >>> >> >> Linas ?ilinskas >> Head of Development >> website facebook >> twitter >> linkedin >> >> >> Host1Plus is a division of Digital Energy Technologies Ltd. >> >> 26 York Street, London W1U 6PZ, United Kingdom >> >> >> -- --sazli [ HP | Dell | Microsoft | Symantec | Server & Network Infrastructure ] W : www.modern.com.my