cloudstack-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Marcus Sorensen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CLOUDSTACK-5429) KVM - Primary store down/Network Failure - Hosts attempt to reboot becasue of primary store being down hangs.
Date Tue, 18 Nov 2014 00:14:35 GMT

    [ https://issues.apache.org/jira/browse/CLOUDSTACK-5429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215462#comment-14215462
] 

Marcus Sorensen commented on CLOUDSTACK-5429:
---------------------------------------------

No, that does not work. VMs cannot be cleanly shut down (or even forced off) if their storage
is hanging. The qemu processes will be in D state and unresponsive. Force reboot of host via
IPMI or sysrq trigger, or something like that would be necessary, and the mgmt server would
need to recognize that this has happened so the VMs can start elsewhere safely.

> KVM - Primary store down/Network Failure - Hosts attempt to reboot becasue of primary
store being down hangs.
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: CLOUDSTACK-5429
>                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-5429
>             Project: CloudStack
>          Issue Type: Bug
>      Security Level: Public(Anyone can view this level - this is the default.) 
>          Components: Management Server
>    Affects Versions: 4.3.0
>         Environment: Build from 4.3
>            Reporter: Sangeetha Hariharan
>            Assignee: edison su
>            Priority: Critical
>             Fix For: 4.4.0
>
>         Attachments: kvm-networkshutdown.png, kvmhostreboot.png, psdown.rar
>
>
> KVM - Primary store down - Hosts attempt to reboot becasue of primary store being down
hangs.
> Set up:
> Advanced zone with KVM (RHEL 6.3) hosts.
> Steps to reproduce the problem:
> 1. Deploy few Vms in each of the hosts with 10 GB ROOT volume size , so we start with
10 Vms.
> 2. Create snaposhot for ROOT volumes.
> 3. When snapshot is still in progress , Make the primary storage unavailable for 10 mts.
> This results in the KVM hosts to reboot.
> But reboot of KVM host is not successful.
> It is stuck at trying to unmount nfs mount points.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message