cloudstack-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Murali Reddy (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CLOUDSTACK-3998) explore simulator based fault injection for resiliency testing
Date Thu, 01 Aug 2013 22:37:49 GMT

    [ https://issues.apache.org/jira/browse/CLOUDSTACK-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13727008#comment-13727008
] 

Murali Reddy commented on CLOUDSTACK-3998:
------------------------------------------

code injection based faults
http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/FaultInjectFramework.html
                
> explore simulator based fault injection for resiliency testing 
> ---------------------------------------------------------------
>
>                 Key: CLOUDSTACK-3998
>                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3998
>             Project: CloudStack
>          Issue Type: Task
>      Security Level: Public(Anyone can view this level - this is the default.) 
>          Components: Management Server
>            Reporter: Murali Reddy
>            Assignee: Murali Reddy
>             Fix For: Future
>
>
> We could inject controlled faults in to simulated hypervisors, network elements, storage
resources, system VM's and try out tests for testing the resiliency of CloudStack core.
> For example, we can have test case (that runs only for Simulator)  where we instruct
a simulated hypervisor resource to not to respond to ping from CloudStack. Expected result
would be core to treat hypervisor host as disconnected and trigger HA enabled VM's.
> My initial thinking is to expose set of test API from simulator plug-in that would help
inject transient/permanent/intermittent faults into simulated resources like
>  - host that lost network connectivity with MS
>  - delayed response to agent commands (simulate overloaded hypervisor stacks like XAPI,
vCenter) and long running tasks like snapshots
>  - non responding edge service VM's
> need to think little more on the failure categories and points, and best abstract them
test API so that can enable flexible and rich resilience tests.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message