cloudstack-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Raf Smeets (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (CLOUDSTACK-10377) Nuage VSP regression fails in NetworksWithCleanup test since introduction of fix for CLOUDSTACK-9114 in ACS 4.11&master
Date Wed, 23 May 2018 13:52:00 GMT

     [ https://issues.apache.org/jira/browse/CLOUDSTACK-10377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Raf Smeets updated CLOUDSTACK-10377:
------------------------------------
    Description: 
Nuage VSP regression fails in NetworksWithCleanup test since introduction of fix for CLOUDSTACK-9114
in ACS 4.11&master

In Nuage Networks QA regression cycle, cloudstackExpress is failing for master & 4.11
in expressrestartNetworksWithCleanup test.
 The error is Unable to create a deployment.
 Issue is caused by fixing https://issues.apache.org/jira/browse/CLOUDSTACK-9114
  
 Reason is that for non-redundant routers, a strategy was implemented where first a new VR
is deployed, then old VR is powered-off/destroyed, and the new VR is again re-programmed.
With this strategy, two identical VRs may be up for a brief moment (few seconds) where both
can serve traffic, however the new VR performs arp-ping on its interfaces to update neighbours.
After the old VR is removed, the new VR is re-programmed which among many things performs
another arpping. The theoretical downtime is therefore limited by the arp-cache refresh which
can be up to 30 seconds. In my experiments, against various VMware, KVM and XenServer versions
I found that the downtime was indeed less than 30s, usually between 5-20 seconds. Compared
to older ACS versions, especially in cases where VRs deployment require full volume copy (like
in VMware) a 10x-12x improvement was seen.BUT NUAGE PLUGIN FAILS AS IP ADDRESS IS ALREADY
IN USE.

  was:
Nuage VSP regression fails in NetworksWithCleanup test since introduction of fix for Since
fix for CLOUDSTACK-9114 in ACS 4.11&master

In Nuage Networks QA regression cycle, cloudstackExpress is failing for master & 4.11
in expressrestartNetworksWithCleanup test.
 The error is Unable to create a deployment.
 Issue is caused by fixing https://issues.apache.org/jira/browse/CLOUDSTACK-9114
 
Reason is that for non-redundant routers, a strategy was implemented where first a new VR
is deployed, then old VR is powered-off/destroyed, and the new VR is again re-programmed.
With this strategy, two identical VRs may be up for a brief moment (few seconds) where both
can serve traffic, however the new VR performs arp-ping on its interfaces to update neighbours.
After the old VR is removed, the new VR is re-programmed which among many things performs
another arpping. The theoretical downtime is therefore limited by the arp-cache refresh which
can be up to 30 seconds. In my experiments, against various VMware, KVM and XenServer versions
I found that the downtime was indeed less than 30s, usually between 5-20 seconds. Compared
to older ACS versions, especially in cases where VRs deployment require full volume copy (like
in VMware) a 10x-12x improvement was seen.BUT NUAGE PLUGIN FAILS AS IP ADDRESS IS ALREADY
IN USE.


> Nuage VSP regression fails in NetworksWithCleanup test since introduction of fix for
CLOUDSTACK-9114 in ACS 4.11&master
> -----------------------------------------------------------------------------------------------------------------------
>
>                 Key: CLOUDSTACK-10377
>                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-10377
>             Project: CloudStack
>          Issue Type: Bug
>      Security Level: Public(Anyone can view this level - this is the default.) 
>    Affects Versions: 4.12, 4.11.1.0
>         Environment: ACS 4.11 or master with Nuage VSP 5.2.x
>            Reporter: Raf Smeets
>            Priority: Major
>
> Nuage VSP regression fails in NetworksWithCleanup test since introduction of fix for
CLOUDSTACK-9114 in ACS 4.11&master
> In Nuage Networks QA regression cycle, cloudstackExpress is failing for master &
4.11 in expressrestartNetworksWithCleanup test.
>  The error is Unable to create a deployment.
>  Issue is caused by fixing https://issues.apache.org/jira/browse/CLOUDSTACK-9114
>   
>  Reason is that for non-redundant routers, a strategy was implemented where first a new
VR is deployed, then old VR is powered-off/destroyed, and the new VR is again re-programmed.
With this strategy, two identical VRs may be up for a brief moment (few seconds) where both
can serve traffic, however the new VR performs arp-ping on its interfaces to update neighbours.
After the old VR is removed, the new VR is re-programmed which among many things performs
another arpping. The theoretical downtime is therefore limited by the arp-cache refresh which
can be up to 30 seconds. In my experiments, against various VMware, KVM and XenServer versions
I found that the downtime was indeed less than 30s, usually between 5-20 seconds. Compared
to older ACS versions, especially in cases where VRs deployment require full volume copy (like
in VMware) a 10x-12x improvement was seen.BUT NUAGE PLUGIN FAILS AS IP ADDRESS IS ALREADY
IN USE.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message