Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 3384D200BEA for ; Tue, 13 Dec 2016 03:32:21 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 321BD160B2A; Tue, 13 Dec 2016 02:32:21 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 522B0160B22 for ; Tue, 13 Dec 2016 03:32:20 +0100 (CET) Received: (qmail 15361 invoked by uid 500); 13 Dec 2016 02:32:14 -0000 Mailing-List: contact users-help@cloudstack.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@cloudstack.apache.org Delivered-To: mailing list users@cloudstack.apache.org Received: (qmail 15228 invoked by uid 99); 13 Dec 2016 02:32:13 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Dec 2016 02:32:13 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 75860CEB84 for ; Tue, 13 Dec 2016 02:32:13 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.199 X-Spam-Level: * X-Spam-Status: No, score=1.199 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=ena.com header.b=AQn/Pc3J; dkim=pass (1024-bit key) header.d=edneta.onmicrosoft.com header.b=GcweZZ4v Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id 31wqN7f6gVrj for ; Tue, 13 Dec 2016 02:32:11 +0000 (UTC) Received: from mr11.mail.ena.net (mr11.mail.ena.net [96.5.1.11]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 192495F1A1 for ; Tue, 13 Dec 2016 02:32:06 +0000 (UTC) Received: from NAM01-SN1-obe.outbound.protection.outlook.com (mail-sn1nam01lp0118.outbound.protection.outlook.com [207.46.163.118]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA384 (256/256 bits)) (No client certificate requested) by mr11.mail.ena.net (Postfix) with ESMTPS id D641B1480A53; Mon, 12 Dec 2016 20:31:44 -0600 (CST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ena.com; s=default; t=1481596305; bh=UTGJq2gKokK8n6edcViuSyHIztLJuWWvCSYBdoT6ldI=; h=From:To:Subject:Date:References:In-Reply-To; b=AQn/Pc3JzQn+lwP6p7Pvqlgu1bTqn1g2VOEV5GQS4dMnYIfkcAXNz/3Hv9S7rWuaJ EGVE7h0Umi7IV7akPfcmuLN2+CaOTI1tTL+ajqQ6c2wvX9/HYAev2gUfpsmi1fVyrt UX1EPy+HgWYA+IzzSwnBkata8y2PHj+GR69EO1c4= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=edneta.onmicrosoft.com; s=selector1-ena-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=UTGJq2gKokK8n6edcViuSyHIztLJuWWvCSYBdoT6ldI=; b=GcweZZ4vbgM8EDlckLP7KRmK55ZRmrGWQk6fySuO7li29S2EHtSYLhJvwdhkg4ahVy6EqYAjQA65Prf+IR1v1CFSO9IH7SWTwbN9eoy6Nlbpn6ztTjnkdfLPSmqfNNv3m5JEu5fHWuhDk9kl0IMjQ65PFL768W1J6++chHvZY0g= Received: from BY2PR02MB2007.namprd02.prod.outlook.com (10.166.110.7) by BY2PR02MB2008.namprd02.prod.outlook.com (10.166.110.8) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384_P384) id 15.1.771.8; Tue, 13 Dec 2016 02:31:41 +0000 Received: from BY2PR02MB2007.namprd02.prod.outlook.com ([10.166.110.7]) by BY2PR02MB2007.namprd02.prod.outlook.com ([10.166.110.7]) with mapi id 15.01.0771.014; Tue, 13 Dec 2016 02:31:41 +0000 From: Simon Weller To: "users@cloudstack.apache.org" Subject: Re: Router VM: patchviasocket.py timeout issue on 1 out of 4 networks Thread-Topic: Router VM: patchviasocket.py timeout issue on 1 out of 4 networks Thread-Index: AQHSVOe2trYkke1ytEG2Yh4adlKuzqEFJ3It Date: Tue, 13 Dec 2016 02:31:41 +0000 Message-ID: References: In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: spf=none (sender IP is ) smtp.mailfrom=sweller@ena.com; x-ms-exchange-messagesentrepresentingtype: 1 x-originating-ip: [132.245.243.37] x-ms-office365-filtering-correlation-id: 67017005-5d21-4b4b-c914-08d423002c18 x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:(22001);SRVR:BY2PR02MB2008; x-microsoft-exchange-diagnostics: 1;BY2PR02MB2008;7:uZw3l0HKysal6LT0NR34ZcUJgccizdD4R4WoVtcNKS0RaBMEwPcvJ0Pnl9+ZNxqUiSJuDd3yQJg+PYJZvHw4GqZs/i1KZnPEgt1HZrr718XOrHyMGdB+8DAgtWsUKIEEMPdSSbsFkJZkhXu7u5hncE6QV9N4l0oX4T7/ObQgrEp96CLJ0EdSEPKotqpLK88ciW/ieQpM02bt+fNPQGP8KowimOwlog/V3WhUUc4msb7hy7JIuBMw0ed9aJkATHPF6apnXBtGy3ev6mBrDMDs1wN8QVp1GvrgC3u9cF4ps6Sbpl0JJ3SlBJqhwQN+GvghNXo3TsaVrSaCw42xfY4fIw4/vohm05WzrtGZfEa6s80mosdyLBNhn2JCh+w2Zi1LvqMjKFzu+F45he1NyIeVKgqgKGnA13txtlxAiCzW7xAQL6B0crv6/YOK0yN9JBAAJrkpgWQLdt/uhn33kL+EKg== x-microsoft-antispam-prvs: x-exchange-antispam-report-test: UriScan:; x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(6040375)(601004)(2401047)(8121501046)(5005006)(3002001)(10201501046)(6041248)(20161123562025)(20161123564025)(20161123560025)(20161123555025)(6072148);SRVR:BY2PR02MB2008;BCL:0;PCL:0;RULEID:;SRVR:BY2PR02MB2008; x-forefront-prvs: 01559F388D x-forefront-antispam-report: SFV:NSPM;SFS:(10009020)(6009001)(7916002)(39850400002)(39410400002)(39840400002)(39450400003)(377424004)(377454003)(189002)(199003)(74316002)(86362001)(7696004)(575784001)(2906002)(76576001)(7736002)(76176999)(1730700003)(122556002)(6116002)(50986999)(68736007)(92566002)(2900100001)(77096006)(102836003)(33656002)(38730400001)(54356999)(229853002)(101416001)(3846002)(6506006)(19627405001)(97736004)(107886002)(81166006)(8676002)(3280700002)(3660700001)(4001150100001)(6436002)(81156014)(66066001)(3900700001)(8936002)(106116001)(6606003)(106356001)(99286002)(2351001)(189998001)(2950100002)(2501003)(105586002)(110136003)(6916009)(9686002)(5660300001)(270524002);DIR:OUT;SFP:1101;SCL:1;SRVR:BY2PR02MB2008;H:BY2PR02MB2007.namprd02.prod.outlook.com;FPR:;SPF:None;PTR:InfoNoRecords;MX:1;A:1;LANG:en; received-spf: None (protection.outlook.com: ena.com does not designate permitted sender hosts) spamdiagnosticoutput: 1:99 spamdiagnosticmetadata: NSPM Content-Type: multipart/alternative; boundary="_000_BY2PR02MB2007CD95B7F940BA9A1214AFA99B0BY2PR02MB2007namp_" MIME-Version: 1.0 X-OriginatorOrg: ena.com X-MS-Exchange-CrossTenant-originalarrivaltime: 13 Dec 2016 02:31:41.1105 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 6dc38cd4-4d4f-4826-9649-17854289d170 X-MS-Exchange-Transport-CrossTenantHeadersStamped: BY2PR02MB2008 X-ENA-MailScanner-Information: Please contact support@ena.com for more information X-ENA-MailScanner-ID: D641B1480A53.AF0B0 X-ENA-MailScanner: No viruses found X-ENA-MailScanner-SpamCheck: not spam, SpamAssassin (not cached, score=-2.5, required 4, BAYES_00 -3.20, DKIM_SIGNED 0.10, DKIM_VALID -0.10, DKIM_VALID_AU -0.20, HTML_MESSAGE 1.20, OS_UNKNOWN -0.10, SPF_HELO_PASS -0.20) X-ENA-MailScanner-From: sweller@ena.com X-ENA-MailScanner-Watermark: 1482201105.64673@SiXSMgNe+5aMnB+qZTJHfw archived-at: Tue, 13 Dec 2016 02:32:21 -0000 --_000_BY2PR02MB2007CD95B7F940BA9A1214AFA99B0BY2PR02MB2007namp_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Can you turn on agent debug mode and take a look at the debug level logs? You can do that by running sed -i 's/INFO/DEBUG/g' /etc/cloudstack/agent/lo= g4j-cloud.xml on the host and then restarting the agent. - Si ________________________________ From: Syahrul Sazli Shaharir Sent: Monday, December 12, 2016 8:21 PM To: users@cloudstack.apache.org Subject: Router VM: patchviasocket.py timeout issue on 1 out of 4 networks Hi, I am running latest Cloudstack 4.9.0.1 on CentOS 7 KVM + ceph environment. After running for some time, I faced with an issue with one out of 4 networks - following a heartbeat-induced reset on all hosts, the associated virtual router would not get recreated and started properly on any of the 3 hosts I have, even after repeated attempts of the following:- - destroy-recreate cycles, via Cloudstack UI - restartNetwork cleanup=3Dtrue API calls (failed with errorcode =3D 530). - redownload and reregister system VM template as another entry and assign to router VM in global setting (boots the new template OK, but still same problem) - tweak default system offering for router VM (increased RAM from 256 to 51= 2MB) - created new system offering, with RAM tweak, and use of ceph rbd store, and assigned it to Cloud.Com-SoftwareRouter as per docs - which didnt work for some reason: it kept on using initial default offering and created image on local host storage - upgrade to latest cloudstack (previously was running 4.8) As with a handful of others in this list archives, virsh list and dumpxml shows the VM created OK but failed soon after booting, as found in the following error in agent.log :- 2016-12-13 10:03:33,894 WARN [kvm.resource.LibvirtComputingResource] (agentRequest-Handler-1:null) (logid:633e6e03) Timed out: /usr/share/cloudstack-common/scripts/vm/hypervisor/kvm/patchviasocket.py -n r-668-VM -p %template=3DdomP%name=3Dr-668-VM%eth0ip=3D10.3.28.10%eth0mas= k=3D255.255.255.0%gateway=3D10.3.28.1%domain=3Dnocser.net%cidrsize=3D24%dhc= prange=3D10.3.28.1%eth1ip=3D169.254.0.33%eth1mask=3D255.255.0.0%type=3Ddhcp= srvr%disable_rp_filter=3Dtrue%dns1=3D8.8.8.8%dns2=3D8.8.4.4%ip6dns1=3D%ip6d= ns2=3D%baremetalnotificationsecuritykey=3DuavJByNGGjNLrELG-qbdN99__1I3tnp8q= a0KbcsKokKJcPB43K9s6oQu2nMLqo3YP8p6jqDy5XT3WWOWBA2yNw%baremetalnotification= apikey=3D8JH4mdkxsEMhgIBgMonkNXAEKjVOeZnG1m5UVekvvo4v_iXQ4ZS7rh6NNS0qphhc7Z= rCauiz23tp2-Wa3AASlg%host=3D10.2.30.11%port=3D8080 . Output is: ..... 2016-12-13 10:05:45,895 WARN [kvm.resource.LibvirtComputingResource] (agentRequest-Handler-1:null) (logid:633e6e03) Timed out: /usr/share/cloudstack-common/scripts/network/domr/router_proxy.sh vr_cfg.sh 169.254.0.33 -c /var/cache/cloud/VR-48ea8a95-6c02-499f-88d3-eae5bf9f9fbe.cfg . Output is: As mentioned, this only happens with 1 network (always the same network). The other router VMs work OK. Any clues on how to troubleshoot this further, would be greatly appreciated. Thanks. -- --sazli Syahrul Sazli Shaharir --_000_BY2PR02MB2007CD95B7F940BA9A1214AFA99B0BY2PR02MB2007namp_--