Return-Path: X-Original-To: apmail-db-derby-dev-archive@www.apache.org Delivered-To: apmail-db-derby-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 301749A4F for ; Fri, 9 Mar 2012 12:31:19 +0000 (UTC) Received: (qmail 84630 invoked by uid 500); 9 Mar 2012 12:31:19 -0000 Delivered-To: apmail-db-derby-dev-archive@db.apache.org Received: (qmail 84599 invoked by uid 500); 9 Mar 2012 12:31:19 -0000 Mailing-List: contact derby-dev-help@db.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: Delivered-To: mailing list derby-dev@db.apache.org Received: (qmail 84592 invoked by uid 99); 9 Mar 2012 12:31:18 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Mar 2012 12:31:18 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Mar 2012 12:31:17 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id CF03C1651C for ; Fri, 9 Mar 2012 12:30:57 +0000 (UTC) Date: Fri, 9 Mar 2012 12:30:57 +0000 (UTC) From: "Knut Anders Hatlen (Commented) (JIRA)" To: derby-dev@db.apache.org Message-ID: <548668011.43219.1331296257849.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1787141676.42724.1331282336181.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (DERBY-5643) Occasional hangs in replication tests on Linux MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/DERBY-5643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226041#comment-13226041 ] Knut Anders Hatlen commented on DERBY-5643: ------------------------------------------- I ran ReplicationSuite in a loop on one of the machines where this was seen, and after a couple of iterations it was stuck. I ran the suite with derby.tests.repltrace=true, and the trace indicated that the slave server used more than 3 minutes to get up and accept connections. ReplicationRun.startServer() pings the server for 75 seconds before it gives up, so it gave up before the server was up. The attempt to shut down the slave server also failed, because a connection to the server could not be established. tearDown() ended up waiting for the server to stop, and the server of course didn't stop since it never received the shutdown command. So the question is: Why does it take 3 minutes for the server to start accepting connections? > Occasional hangs in replication tests on Linux > ---------------------------------------------- > > Key: DERBY-5643 > URL: https://issues.apache.org/jira/browse/DERBY-5643 > Project: Derby > Issue Type: Bug > Components: Replication, Test > Affects Versions: 10.9.0.0 > Reporter: Knut Anders Hatlen > Attachments: thread-dump.txt > > > We occasionally see hangs in the replication tests on Linux. For example here: http://dbtg.foundry.sun.com/derby/test/Daily/jvm1.6/testing/testlog/sles/1298470-suitesAll_diff.txt > This test run was stuck in tearDown() after ReplicationRun_Local_Derby4910.testSlaveWaitsForMaster(). (Waiting for Thread.join() to return.) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira