From users-return-50248-archive-asf-public=cust-asf.ponee.io@activemq.apache.org Sat Jul 14 14:59:19 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 8FD8C180632 for ; Sat, 14 Jul 2018 14:59:18 +0200 (CEST) Received: (qmail 60260 invoked by uid 500); 14 Jul 2018 12:59:17 -0000 Mailing-List: contact users-help@activemq.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@activemq.apache.org Delivered-To: mailing list users@activemq.apache.org Received: (qmail 60248 invoked by uid 99); 14 Jul 2018 12:59:16 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 Jul 2018 12:59:16 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 46E40C0323 for ; Sat, 14 Jul 2018 12:59:16 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.89 X-Spam-Level: *** X-Spam-Status: No, score=3.89 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, KAM_BADIPHTTP=2, NORMAL_HTTP_TO_IP=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, T_DKIMWL_WL_MED=-0.01, WEIRD_PORT=0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id c7EnV3ZPFwpz for ; Sat, 14 Jul 2018 12:59:14 +0000 (UTC) Received: from mail-wr1-f68.google.com (mail-wr1-f68.google.com [209.85.221.68]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 090E35F22E for ; Sat, 14 Jul 2018 12:59:14 +0000 (UTC) Received: by mail-wr1-f68.google.com with SMTP id j5-v6so21015975wrr.8 for ; Sat, 14 Jul 2018 05:59:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=ajVuaV4jctOPtULX7nmYc4VceIeLcWszeTwZhQgkTAg=; b=WeVMsaXghr08gvgKLZaDcSmzD04KIKumam14Ch170ydU9uKEZasv7hYtqD3IpMuV6b mCk3N1c1sD/Lqf/RdXp31aAkG2nDQm06Qj7RPZ42FmFt4jAHJhK0SPo+srmFDXfOGdLp 7gcKkobp1/CBFfmgcCe1HITw8KSZ4o13JNvr5CE3F9I78p1b+DTg/cFjgDXTIjpJIxLO cjMuJ9W3LM6bwDh4fa+1a1XGQLhBNYGgIaRJuVy94s80TpQXidRz+yPfJ8WJkKV45w+G RZAMEuf7Midy0GzBpSgLqEes+ddbe1aIjeDY4zZS1TKPQZUwm2oWRHUeCCYqtHXv0goW 95gw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=ajVuaV4jctOPtULX7nmYc4VceIeLcWszeTwZhQgkTAg=; b=rgOO6N1xY6FlCLjwEre7HQg+DYxSOr4i+wfbbC0cX/gapxlftgGsizUeH0WCDMrqyH vw19KoLN8sFcZVjWXa7IoagFSuGusH4lTbhUV2RTkQdHknBFP+q814am2ffY5hqtFVVD n+Uzwb/bfkVHkMXSbl0n7pDVwbW7T9N799rOoGV9XekiTAPsCjJtWRssQqkYzC5UCXQn P9ZUM1yVN1O59tV5TsorrZOXvCnMZ6h2UqNLpkCSUMihV8jW+BzQkTTmhF0CihFai3cI yNN4XJ5gDCH2yDy6/yL2/2yPdvd8LfCcsX0SPZJwKQ+SEyRD1DHNHq9A4VZpPmBPJSfn 2xBA== X-Gm-Message-State: AOUpUlHUAYcdsj51nogeDXrvPx8JTR5A8OrCnfysR9uGQCe4p39A+1iI 4MAW8O6PZqclO3W6X513Zmmb/DQWrdBDsdI4KXJ9Pg== X-Google-Smtp-Source: AAOMgpe5uy5E7xOVHr7X9FzUgrrv/ca7kUFK7l/mA0eg6V3dQSTvPK65xzZa6/lbh4xSAlEbWOtNVgzZ83Ml0Esr47g= X-Received: by 2002:adf:ec04:: with SMTP id x4-v6mr7080009wrn.245.1531573152694; Sat, 14 Jul 2018 05:59:12 -0700 (PDT) MIME-Version: 1.0 References: <2930912a-406e-4136-a91a-0a933ccbbbce@default> In-Reply-To: <2930912a-406e-4136-a91a-0a933ccbbbce@default> From: Clebert Suconic Date: Sat, 14 Jul 2018 08:59:01 -0400 Message-ID: Subject: Re: Artemis Failover tests To: users@activemq.apache.org Content-Type: multipart/alternative; boundary="000000000000b6c6d00570f52940" --000000000000b6c6d00570f52940 Content-Type: text/plain; charset="UTF-8" Can u try with 2.6.2. There were a few fixes in voting. On Fri, Jul 13, 2018 at 8:31 PM Neha Sareen wrote: > Hi, > > > > - We are setting up a cluster of 6 brokers using Artemis 2.4.0. > > - The cluster has 3 groups. > > - Each group has one master, and one slave broker pair. > > - The HA uses replication. > > - Each master broker configuration has the flag 'check-for-live-server' > set to true. > > - Each slave broker configuration has the flag 'allow-failback' set to > true. > > - We use static connectors for allowing cluster topology discovery. > > - Each broker's static connector list includes the connectors to the other > 5 servers in the cluster. > > - Each broker declares its acceptor. > > - Each broker exports its own connector information via the > 'connector-ref' configuration element. > > - The acceptor and the connector URLs for each broker are identical with > respect to the host and port information > > > > We have a standalone test application that created producers and consumers > to write messages and receive messages respectively. > > > > We are trying to execute an automatic failover test case with the > following characteristics, Initially create separate connection and session > for both producer and consumer to a master broker. > > Now send and consume a handful of messages using this initial connection. > > Now gracefully shutdown the master broker. > > The test continues trying to produce some more messages and then the > consumer consuming those messages. > > > > However I see the following exception being encountered after killing the > master broker: > > javax.jms.JMSException: AMQ119014: Timed out after waiting 30,000 ms for > response when sending packet 71 > > > > The url being used for our tests is as follows: > > tcp://localhost:" + masterBrokerPort + > "?ha=true&retryInterval=1000&retryIntervalMultiplier=1.0&reconnectAttempts=-1&clientFailureCheckPeriod=20000 > > > > Also, this is what I see on the slave broker logs (slave broker running on > port 61476): > > 17:07:16,889 INFO [org.apache.activemq.artemis.core.server] AMQ221109: > Apache ActiveMQ Artemis Backup Server version 2.4.0 [null] started, waiting > live to fail before it gets active > > 17:07:25,366 INFO [org.apache.activemq.artemis.core.server] AMQ221024: > Backup server > ActiveMQServerImpl::serverUUID=fa7192a0-816f-11e8-a66a-08002737e2ae is > synchronized with live-server. > > 17:07:25,449 INFO [org.apache.activemq.artemis.core.server] AMQ221031: > backup announced > > 17:10:10,237 INFO [org.apache.activemq.artemis.core.server] AMQ221066: > Initiating quorum vote: LiveFailoverQuorumVote > > 17:10:10,238 INFO [org.apache.activemq.artemis.core.server] AMQ221067: > Waiting 30 seconds for quorum vote results. > > 17:10:10,322 WARN [org.apache.activemq.artemis.core.client] AMQ212037: > Connection failure has been detected: AMQ119015: The connection was > disconnected because of server shutdown [code=DISCONNECTED] > > 17:10:10,331 INFO [org.apache.activemq.artemis.core.server] AMQ221060: > Sending quorum vote request to localhost/127.0.0.1:61456: > ServerConnectVote [nodeId=fa7192a0-816f-11e8-a66a-08002737e2ae, vote=false] > > 17:10:10,333 INFO [org.apache.activemq.artemis.core.server] AMQ221061: > Received quorum vote response from localhost/127.0.0.1:61456: > ServerConnectVote [nodeId=fa7192a0-816f-11e8-a66a-08002737e2ae, vote=true] > > 17:10:10,351 INFO [org.apache.activemq.artemis.core.server] AMQ221060: > Sending quorum vote request to localhost/127.0.0.1:61466: > ServerConnectVote [nodeId=fa7192a0-816f-11e8-a66a-08002737e2ae, vote=false] > > 17:10:10,353 INFO [org.apache.activemq.artemis.core.server] AMQ221061: > Received quorum vote response from localhost/127.0.0.1:61466: > ServerConnectVote [nodeId=fa7192a0-816f-11e8-a66a-08002737e2ae, vote=true] > > 17:10:10,354 INFO [org.apache.activemq.artemis.core.server] AMQ221068: > Received all quorum votes. > > 17:10:10,438 INFO [org.apache.activemq.artemis.core.server] AMQ221071: > Failing over based on quorum vote results. > > 17:10:10,499 WARN [org.apache.activemq.artemis.core.client] AMQ212037: > Connection failure has been detected: AMQ119015: The connection was > disconnected because of server shutdown [code=DISCONNECTED] > > 17:10:10,765 INFO [org.apache.activemq.artemis.core.server] AMQ221037: > ActiveMQServerImpl::serverUUID=fa7192a0-816f-11e8-a66a-08002737e2ae to > become 'live' > > 17:10:10,798 WARN [org.apache.activemq.artemis.core.client] AMQ212004: > Failed to connect to server. > > 17:10:11,380 INFO [org.apache.activemq.artemis.core.server] AMQ221003: > Deploying queue DLQ on address DLQ > > 17:10:11,381 INFO [org.apache.activemq.artemis.core.server] AMQ221003: > Deploying queue ExpiryQueue on address ExpiryQueue > > 17:10:11,381 INFO [org.apache.activemq.artemis.core.server] AMQ221003: > Deploying queue exampleQueue on address exampleQueue > > 17:10:11,405 INFO [org.apache.activemq.artemis.core.server] AMQ221007: > Server is now live > > 17:10:11,437 INFO [org.apache.activemq.artemis.core.server] AMQ221020: > Started EPOLL Acceptor at 0.0.0.0:61476 for protocols > [CORE,MQTT,AMQP,STOMP,HORNETQ,OPENWIRE] > > > > > > Can some one let us know what the issue is here and how we can rectify > this. > > > > Thanks > > Neha > > > > > > > -- Clebert Suconic --000000000000b6c6d00570f52940--