From users-return-50254-archive-asf-public=cust-asf.ponee.io@activemq.apache.org Tue Jul 17 19:23:45 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id C57C918067A for ; Tue, 17 Jul 2018 19:23:44 +0200 (CEST) Received: (qmail 52315 invoked by uid 500); 17 Jul 2018 17:23:43 -0000 Mailing-List: contact users-help@activemq.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@activemq.apache.org Delivered-To: mailing list users@activemq.apache.org Received: (qmail 52275 invoked by uid 99); 17 Jul 2018 17:23:42 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Jul 2018 17:23:42 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 8FF17180028 for ; Tue, 17 Jul 2018 17:23:42 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.988 X-Spam-Level: * X-Spam-Status: No, score=1.988 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, T_DKIMWL_WL_MED=-0.01] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=indosoft-com.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 7JUcAFHDwMMJ for ; Tue, 17 Jul 2018 17:23:41 +0000 (UTC) Received: from mail-io0-f193.google.com (mail-io0-f193.google.com [209.85.223.193]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 27B355F396 for ; Tue, 17 Jul 2018 17:23:41 +0000 (UTC) Received: by mail-io0-f193.google.com with SMTP id v26-v6so1658932iog.5 for ; Tue, 17 Jul 2018 10:23:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=indosoft-com.20150623.gappssmtp.com; s=20150623; h=mime-version:from:date:message-id:subject:to; bh=uHDPQU9GsWtTtZn2LMC+cbTti/gnDuXKeBtevjsMh90=; b=Qu0lrGk5RIMEAGZaDJHF8KUuSBUcWrX1estV2lpETwFTuNZoAc39TZb68/s/MIHlrc nxKBHJFjXPtUdYGh77V9lawsDyoNtxaTfQTaeTJTSpvimwek8KK7UP7QQVGpUVcMLqgu aecIPaaZPjp+UNOSQodS47af/IcB0/KE2Ikv+lwpUupp7wkkWd8KEfjqFR8zQJKd6rDT 2APMkrl783h9bEcOHLSN1Xmc1nHXzJt+WnaIheHk6vjs8Ot+QnTjX6/Zo1wwxWel111J 3Y6+ePq3WYe4yPSt13deYykBMbkTYMxPsncHbQ9yR0ikk5MLjjLdT5hLTUtVzaBxsVvc i7oQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=uHDPQU9GsWtTtZn2LMC+cbTti/gnDuXKeBtevjsMh90=; b=n21awg0XfCo7jxmzZ5R7FhZApfQRP9uC1Sn3CbVS3PHkjyKolKRk9dh8AQjBCvXlaa pdbt3FiOUhGt3vZKRC+Xaeftk6RpbFYiQWOSxf1Zl9xplKUtuUG5BSHXqfQm42IOUXtv O7kctdiV7h9HVpH2qhEC0ypnT76Vj/g7v7mGMOwSZ0mLIIcW3gfTtcjK8PMKdZ0MTlSy QKBZBKXcC4Z9nB2OwdOfltGHkwQTcTzbQVRdY79pPeFIZ6GmbYuca5PkjW/f7hyBmHiF oxoV5fx03iXadtvsDW4y9hg4wbEjLz/c+NnWVL1kwCAS06hE8H4B7ODrgLg480+UZHmg 7piA== X-Gm-Message-State: AOUpUlH32nqwgHnLs3PxbpeVz9rMszNgqMqap58s1WyLwO5ttKPo76gb XBCul+AcYFSSeLPq9b95urM4rB/lTIJhEcIvzGOCX4/O X-Google-Smtp-Source: AA+uWPza0ZBD/HNv6lJw02C56mv3eTL4YbD1tLhKHixNYIoJnaXuZRCS48way5eX1QdnA6OR3gr7NxFdFvb68Wrujwk= X-Received: by 2002:a6b:c844:: with SMTP id y65-v6mr2211059iof.187.1531848220410; Tue, 17 Jul 2018 10:23:40 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:ac0:9184:0:0:0:0:0 with HTTP; Tue, 17 Jul 2018 10:23:20 -0700 (PDT) From: Scott Van Wart Date: Tue, 17 Jul 2018 14:23:20 -0300 Message-ID: Subject: Retry queue message delivery to another node after timeout or connection failure To: users@activemq.apache.org Content-Type: multipart/alternative; boundary="000000000000071c8b0571353592" --000000000000071c8b0571353592 Content-Type: text/plain; charset="UTF-8" I can't for the life of me figure out how to get ActiveMQ Artemis to try another node for message delivery. I have a domain-managed wildfly cluster with 3 nodes: Wildfly 12 (Artemis 1.5.5) and JDK 1.8.0_131 running on Ubuntu 16.04. I started with the defaults. I deployed an EAR with a single MDB that listens to a durable queue. I then connect to a node and send a test message every 250ms. I can see the messages appearing round-robin on all nodes (and JMSDeliveryType is PERSISTENT). The MDB is configured with dups-ok-acknowledge. I changed some settings from the defaults that ship with Wildfly for cluster-connection: check-period=500 connection-ttl=1000 reconnect-attempts=2 call-timeout=1000 call-failover-timeout=500 use-duplicate-detection=false Other relevant settings: retry-interval=500 retry-interval-multiplier=1.5 initial-connect-attempts=-1 message-load-balancing=ON_DEMAND notification-interval=1000 notification-attempts=2 While the test is going on, I unplug the network cable to one of the nodes. The other two nodes fail their 3rd node connection in about a second and start distributing the messages across only the 2 remaining nodes, which is fine. But I "lose" about 2-3 messages during this time to the failed node. I can leave that failed node unplugged for as long as I want. I can even plug the failed node back in and it won't retransmit these 2-3 messages. Finally, I restart all the nodes and the 2-3 "lost" messages are then transmitted (much later) and only to the failed node. What I really want is for ActiveMQ to quickly retry delivery to another node. So if it attempts delivery and the message isn't acknowledged for 750-1000ms, try another node. I can handle duplicates just fine. Am I going about this the right way? Thanks, Scott --000000000000071c8b0571353592--