Return-Path: Delivered-To: apmail-activemq-dev-archive@www.apache.org Received: (qmail 21187 invoked from network); 26 Jun 2009 16:20:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 26 Jun 2009 16:20:51 -0000 Received: (qmail 90539 invoked by uid 500); 26 Jun 2009 16:21:02 -0000 Delivered-To: apmail-activemq-dev-archive@activemq.apache.org Received: (qmail 90500 invoked by uid 500); 26 Jun 2009 16:21:02 -0000 Mailing-List: contact dev-help@activemq.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@activemq.apache.org Delivered-To: mailing list dev@activemq.apache.org Received: (qmail 90452 invoked by uid 99); 26 Jun 2009 16:21:01 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Jun 2009 16:21:01 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Jun 2009 16:20:59 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 622E7234C004 for ; Fri, 26 Jun 2009 09:20:39 -0700 (PDT) Message-ID: <2112059634.1246033239385.JavaMail.jira@brutus> Date: Fri, 26 Jun 2009 09:20:39 -0700 (PDT) From: "Peter Jones (JIRA)" To: dev@activemq.apache.org Subject: [jira] Created: (AMQ-2308) org.apache.activemq.broker.ft.TransactedTopicMasterSlaveTest test hang on Solaris MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: ae95407df07c98740808b2ef9da0087c X-Virus-Checked: Checked by ClamAV on apache.org org.apache.activemq.broker.ft.TransactedTopicMasterSlaveTest test hang on Solaris --------------------------------------------------------------------------------- Key: AMQ-2308 URL: https://issues.apache.org/activemq/browse/AMQ-2308 Project: ActiveMQ Issue Type: Bug Components: Transport Environment: Solaris - jdk15 32 bit - also jdk16 64 bit Reporter: Peter Jones Seeing the TransactedTopicMasterSlaveTest hang when run on Solaris. After examining jstack output for the hanging test and discussing with Gary Tully, he had the following suggestion: {quote} Hi Peter, this looks like an example of http://issues.apache.org/activemq/browse/AMQ-1993 but this time on the client side. It seems like a write that results in the first reconnect attempt does not terminate but it should be aborting as there is no listening or reading thread. I guess there is no back log because the next write is blocked by the transport reconnect mutex. Some combination of low level tcp retries should be able to be configured at the OS level, and I guess the defaults should be less than 1 hour so I would expect this test to eventually complete, however, the OS level timeouts and retries may be contingent on a backlog reaching a minimum and currently there will be only one outstanding write due the the locking around the reconnect logic. This being the case, some code needs to implement the timeout. The solution from http://issues.apache.org/activemq/browse/AMQ-1993 may be a good approach here. It will timeout a write call. However the solution is currently only applicable to server side sockets, it needs to be extended to support a client connection. I think all that is needed is to move the additional configuration code from: org.apache.activemq.transport.TransportFactory.serverConfigure(Transport, WireFormat, HashMap) to compositeConfigure that is called for all transports (both client and server) {quote} Indeed, this fixes the problem. I've attached a patch against trunk which resolves the issue. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.