Return-Path: X-Original-To: apmail-cloudstack-dev-archive@www.apache.org Delivered-To: apmail-cloudstack-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2654910AC9 for ; Sat, 26 Oct 2013 18:09:54 +0000 (UTC) Received: (qmail 73847 invoked by uid 500); 26 Oct 2013 18:09:52 -0000 Delivered-To: apmail-cloudstack-dev-archive@cloudstack.apache.org Received: (qmail 73632 invoked by uid 500); 26 Oct 2013 18:09:52 -0000 Mailing-List: contact dev-help@cloudstack.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cloudstack.apache.org Delivered-To: mailing list dev@cloudstack.apache.org Received: (qmail 73622 invoked by uid 99); 26 Oct 2013 18:09:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 26 Oct 2013 18:09:51 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of msweet.dev@gmail.com designates 209.85.214.181 as permitted sender) Received: from [209.85.214.181] (HELO mail-ob0-f181.google.com) (209.85.214.181) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 26 Oct 2013 18:09:44 +0000 Received: by mail-ob0-f181.google.com with SMTP id wp4so2166286obc.40 for ; Sat, 26 Oct 2013 11:09:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=Hj87MMG/Yy/zznmRerxpJ91EqUO9EauKVDrgS38C9E8=; b=AQmrZTKEdsJMCStOxJcAFbD7eFY4spdEur8sq4ghtvkDX2v7DltoiC9oFFwhF2X3qO SgbbwshGV0tBxg+A3KuzFQcIde8Jo22Xl8pelvkY63Iennfl1PKBa54x8zpCz35dtnKm APBO/m+MewGdVnkZLjSQFcXTrP6iSeZNBXqiSxHM0XBO5kJ0TkpZ/PqxwLkyqYqEscrs tNn16rr5b98dkmms5QkcqSj6JYxw0sjYjSFwME3HvMIT+xTs8CQB2CmW3wyl2ESComaF j+nbDc9RgKRg1vM3BtlSaGdJ/I2wJPN9KrcCxdJwUZIMt2SpgwfBgVyuMf0uF8mReUL5 aPsA== MIME-Version: 1.0 X-Received: by 10.182.81.41 with SMTP id w9mr8096610obx.18.1382810963117; Sat, 26 Oct 2013 11:09:23 -0700 (PDT) Received: by 10.76.88.233 with HTTP; Sat, 26 Oct 2013 11:09:23 -0700 (PDT) In-Reply-To: <5CE50E03-1B4E-45CE-AE09-B2743B9F0388@gmail.com> References: <5CE50E03-1B4E-45CE-AE09-B2743B9F0388@gmail.com> Date: Sat, 26 Oct 2013 19:09:23 +0100 Message-ID: Subject: Re: CS-Management HA Networking From: Marty Sweet To: "dev@cloudstack.apache.org" Content-Type: multipart/alternative; boundary=047d7b2e4d7c3f8c0904e9a8c438 X-Virus-Checked: Checked by ClamAV on apache.org --047d7b2e4d7c3f8c0904e9a8c438 Content-Type: text/plain; charset=ISO-8859-1 Possibly, I would say it makes more sense to find what interface the cluster.node.ip (I think) is using, then get the MAC address from that interface? Then if users add interfaces and change the order through udev it will be persistent (to an extent). The only problem I can think of is if it's using loopback, which it shouldn't do if management setup is run? Marty On Sat, Oct 26, 2013 at 6:08 PM, Darren Shepherd < darren.s.shepherd@gmail.com> wrote: > Glad that helped. Seems that we should change CloudStack to ignore mac > addresses that are 00:..:00. If you want to put in a bug you can assign it > me and I'll look into changing that. > > Darren > > > On Oct 26, 2013, at 5:35 AM, Marty Sweet wrote: > > > > Hi Darren, thanks for the heads up about that script. > > > > Old Networking Setup: > > eth0 eth1 -> management0 > > management0.11 -> vlan11 > > management0.12 -> vlan12 > > > > Turns out in true Ubuntu Networking fashion bond0 was being created for > no > > reason and was appearing in ifconfig -a (so the script was pulling out > the > > first mac address it found), although it was not active and could not be > > downed. > > > > bond0 Link encap:Ethernet HWaddr 00:00:00:00:00:00 > > BROADCAST MASTER MULTICAST MTU:1500 Metric:1 > > RX packets:0 errors:0 dropped:0 overruns:0 frame:0 > > TX packets:0 errors:0 dropped:0 overruns:0 carrier:0 > > collisions:0 txqueuelen:0 > > RX bytes:0 (0.0 B) TX bytes:0 (0.0 B) > > > > Under this configuration the script returned: > > addr in integer is 0 > > addr in bytes is 0 0 0 0 0 0 > > addr in char is 00:00:00:00:00:00 > > > > > > Once I used bond0 as my bond name, opposed to management0, it started > > working, as the bond was now in use. > > Old Networking Setup: > > eth0 eth1 -> bond0 > > bond0.11 -> vlan11 > > bond0.12 -> vlan12 > > > > Many thanks, > > Marty > > > > > >> On Wed, Oct 23, 2013 at 7:44 AM, Marty Sweet > wrote: > >> > >> Hi Darren, > >> > >> Thanks for getting back to me. I will set the networking config up again > >> and run the commands you sent me over the next couple of days. > >> > >> Thanks, > >> Marty > >> > >> > >> On Tue, Oct 22, 2013 at 11:39 PM, Darren Shepherd < > >> darren.s.shepherd@gmail.com> wrote: > >> > >>> Well that wasn't very useful message. If you can find the cloud-utils > >>> jar on your server run > >>> > >>> java -cp /cloud-utils-4.1.1.jar com.cloud.utils.net.MacAddress > >>> > >>> That will output what its finding for the mac address. Also run an > >>> "ifconfig -a" from the command line. If you won't mind sending the > >>> output of "ifconfig -a" that would be helpful to see what's going > >>> wrong. > >>> > >>> Darren > >>> > >>> On Tue, Oct 22, 2013 at 2:48 PM, Marty Sweet > >>> wrote: > >>>> Just noticed I didn't include the log: > >>>> > >>>> http://pastebin.com/wUtCsSAb > >>>> > >>>> Marty > >>>> > >>>> > >>>>> On Tue, Oct 22, 2013 at 10:38 PM, Marty Sweet > >>>> wrote: > >>>> > >>>>> Hi Darren, > >>>>> > >>>>> Maybe I'm getting confused with an issue I had with the Agents around > >>> that > >>>>> time! > >>>>> The error message I got was very cryptic. Having a fresh look at the > >>>>> source code: > >>> > https://github.com/apache/cloudstack/blob/04cdd90a84f4be5ba02778fe0cd352a4b1c39a13/utils/src/org/apache/cloudstack/utils/identity/ManagementServerNode.java > >>>>> > >>>>> Would suggest that it gets: private static final long s_nodeId = > >>>>> MacAddress.getMacAddress().toLong(); and ensures it's <=0 in the > >>> check() > >>>>> function, which is run by the SystemIntegrityChecker. > >>>>> > >>>>> Hopefully it is just a MAC Address issue, what would the > >>> IntegrityChecker > >>>>> be looking for? > >>>>> > >>>>> Thanks, > >>>>> Marty > >>>>> > >>>>> > >>>>> On Tue, Oct 22, 2013 at 10:02 PM, Darren Shepherd < > >>>>> darren.s.shepherd@gmail.com> wrote: > >>>>> > >>>>>> Do you have a specific error from a log? I was not aware that > >>>>>> CloudStack would look for interfaces w/ eth*, em*. In the code it > >>>>>> just does "ifconfig -a" to list the devices. By creating a bond, > the > >>>>>> mac address CloudStack finds will probably change then I could > imagine > >>>>>> something could possibly fail. > >>>>>> > >>>>>> Darren > >>>>>> > >>>>>> On Tue, Oct 22, 2013 at 1:39 PM, Marty Sweet > >>>>>> wrote: > >>>>>>> Hi Guys. > >>>>>>> > >>>>>>> I am planning on upgrading my 4.1.1 infrastructure to 4.2 over the > >>>>>> weekend. > >>>>>>> > >>>>>>> When testing my 4.1.1 setup I ran across a problem where a TOR > >>> switch > >>>>>>> failure would cause an outage to the management server. The agents > >>> use 2 > >>>>>>> NICs for all management traffic using bonds. > >>>>>>> When I tried to configure the management server to use a bond0 in > >>> simple > >>>>>>> active-passive mode (like I use for my agent management network), > >>>>>>> cloudstack-management would not start due to 'Integrity Issues', > >>> which > >>>>>> at > >>>>>>> the time I located back to a IntegitryChecker which ensures the > >>>>>> interfaces > >>>>>>> of eth* em* or some others were taking the IP of management server. > >>>>>>> > >>>>>>> My question is does this limitation still exist and if so, can it > be > >>>>>>> overcome by adding bond* to the list of allowed interface names and > >>>>>>> compiling the management server from source? > >>>>>>> I would love to hear input to this, it seems bizarre to me that it > >>> is > >>>>>>> difficult to add simple but effective network redundancy to the > >>>>>> management > >>>>>>> server. > >>>>>>> > >>>>>>> For scenario basis, this is the basic redundant network setup I > have > >>>>>> for my > >>>>>>> Agents: > >>>>>>> 4x KVM Hosts all with 4 NICs - 2 bonds (Private/Public Traffic) > >>>>>>> > >>>>>>> Example Host: > >>>>>>> ------------------Interconnect--------------- > >>>>>>> TOR 1 --------- TOR 2 > >>>>>>> --------------------- --------------------- > >>>>>>> | Management | > >>>>>>> | Tagged VLANs | > >>>>>>> ---------------------------------------------------- > >>>>>>> KVM Cloudstack Hypervisor > >>>>>>> ---------------------------------------------------- > >>>>>>> | Public Traffic | > >>>>>>> | Tagged VLANS | > >>>>>>> | LACP Aggregation | > >>>>>>> ---------------------------------------------------- > >>>>>>> Core Router > >>>>>>> ---------------------------------------------------- > >>>>>>> > >>>>>>> There are also LACP links with STP rules between the TOR switches > >>> are > >>>>>> the > >>>>>>> core device to allow for interconnect failure so the TORs do not > >>> become > >>>>>>> isolated, but I have excluded that for simplicity. > >>>>>>> > >>>>>>> > >>>>>>> I would have thought it would be easy to create a bond for my > >>> management > >>>>>>> node and connect the two NICs to both the TOR switches, but that > >>> didn't > >>>>>>> work in 4.1.1 due to my reasons above. > >>>>>>> > >>>>>>> Thanks! > >>>>>>> Marty > >> > >> > --047d7b2e4d7c3f8c0904e9a8c438--