Return-Path: X-Original-To: apmail-incubator-cloudstack-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-cloudstack-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3B161E2B3 for ; Fri, 18 Jan 2013 11:47:51 +0000 (UTC) Received: (qmail 8738 invoked by uid 500); 18 Jan 2013 11:47:50 -0000 Delivered-To: apmail-incubator-cloudstack-dev-archive@incubator.apache.org Received: (qmail 8701 invoked by uid 500); 18 Jan 2013 11:47:50 -0000 Mailing-List: contact cloudstack-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cloudstack-dev@incubator.apache.org Delivered-To: mailing list cloudstack-dev@incubator.apache.org Received: (qmail 8669 invoked by uid 99); 18 Jan 2013 11:47:50 -0000 Received: from reviews-vm.apache.org (HELO reviews.apache.org) (140.211.11.40) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Jan 2013 11:47:50 +0000 Received: from reviews.apache.org (localhost [127.0.0.1]) by reviews.apache.org (Postfix) with ESMTP id B4C8D1C541D; Fri, 18 Jan 2013 11:47:43 +0000 (UTC) Content-Type: multipart/alternative; boundary="===============8420209246895480206==" MIME-Version: 1.0 Subject: Re: Review Request: CLOUDSTACK-803: HA gets triggered even when the host investigator is unable to determine the state of the host From: "Abhinandan Prateek" To: "Abhinandan Prateek" , "Alex Huang" Cc: "cloudstack" , "Koushik Das" Date: Fri, 18 Jan 2013 11:47:43 -0000 Message-ID: <20130118114743.27590.82689@reviews.apache.org> X-ReviewBoard-URL: https://reviews.apache.org Auto-Submitted: auto-generated Sender: "Abhinandan Prateek" X-ReviewGroup: cloudstack X-ReviewRequest-URL: https://reviews.apache.org/r/8853/ X-Sender: "Abhinandan Prateek" References: <20130107082606.2471.27958@reviews.apache.org> In-Reply-To: <20130107082606.2471.27958@reviews.apache.org> Reply-To: "Abhinandan Prateek" --===============8420209246895480206== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/8853/#review15486 ----------------------------------------------------------- Ship it! Ship It! - Abhinandan Prateek On Jan. 7, 2013, 8:26 a.m., Koushik Das wrote: > = > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/8853/ > ----------------------------------------------------------- > = > (Updated Jan. 7, 2013, 8:26 a.m.) > = > = > Review request for cloudstack, Abhinandan Prateek and Alex Huang. > = > = > Description > ------- > = > CS tries to do HA on VMs if it finds that the host on which they are runn= ing is down. HA should only happen when CS is able to determine (host inves= tigators return status as 'Down') that host is down. Currently due to a bug= in code even if the hosts investigators are unable to determine state (ret= urns 'null'), HA gets triggered. > = > This results in unnecessary HA of VMs. > = > The fix is to not do HA in case the state of host cannot be determined. > = > = > This addresses bug CLOUDSTACK-803. > = > = > Diffs > ----- > = > server/src/com/cloud/agent/manager/AgentManagerImpl.java 8141fdb = > = > Diff: https://reviews.apache.org/r/8853/diff/ > = > = > Testing > ------- > = > I did the following manual testing > = > - Setup advanced zone with XS host > - Created a VM > - Pulled out network cable for MS > - Checked in the debugger that the DisconnectTask is running > - This calls handleDisconnectWithInvestigation in AgentManagerImpl > - In the debugger saw that the following line of code is returning state = as null 'final Status determinedState =3D investigate(attache);' > - So the warning message added is seen 's_logger.warn("Agent state cannot= be determined, do nothing");' > - The above steps continues in a loop till the time I plug the network ca= ble back after which again the status of the host gets correctly determined= by the investigator. > = > If either the XS host or MS is disconnected from network then investigato= r cannot determine the state of the host and so based on the fix HA won=E2= =80=99t be triggered. > = > Sowmya from Citrix QA team also did additional testing for this on a priv= ate branch. > = > = > Thanks, > = > Koushik Das > = > --===============8420209246895480206==--