Return-Path: X-Original-To: apmail-tomcat-users-archive@www.apache.org Delivered-To: apmail-tomcat-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 62D0410C88 for ; Tue, 2 Jul 2013 07:49:05 +0000 (UTC) Received: (qmail 28847 invoked by uid 500); 2 Jul 2013 07:49:02 -0000 Delivered-To: apmail-tomcat-users-archive@tomcat.apache.org Received: (qmail 28663 invoked by uid 500); 2 Jul 2013 07:49:01 -0000 Mailing-List: contact users-help@tomcat.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "Tomcat Users List" Delivered-To: mailing list users@tomcat.apache.org Received: (qmail 28653 invoked by uid 99); 2 Jul 2013 07:49:00 -0000 Received: from minotaur.apache.org (HELO minotaur.apache.org) (140.211.11.9) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Jul 2013 07:49:00 +0000 Received: from localhost (HELO mail-ie0-f180.google.com) (127.0.0.1) (smtp-auth username kfujino, mechanism plain) by minotaur.apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Jul 2013 07:49:00 +0000 Received: by mail-ie0-f180.google.com with SMTP id f4so11169037iea.39 for ; Tue, 02 Jul 2013 00:48:59 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=qcXtzyjaEmhVdJSykv+3zFoHea3FVScYuvClqQKtsJ4=; b=bxrVXidkH4NRzgwPYwesPIE4hnI4QJAz1yiDYGqt/YKfbn2YQuAFPx85juEVHS3UD4 IxWXjVJ/KuiIcU7pEfv+KBMKFQ56DDHcJSYAodjLEzY0LKDT0Trdw9k4/8ByQ02BUXjU Ql2WoPLQVqLFXQFqr9bzj7mhnv4y67rizR/scAcA6XWp9iLB4dReEZ4J/Hm3TV2AByfk /DDZLDAQ9Nc7TUIb5mY/uoSlscEJyBYHyUDbPwCmz8az3xd8JohO7IlqbVGkJq63RG7+ 7kxGUwgVg/bIp7PmhWi8yIDGIgU7IsqEWgXhsFgNB5XUFFxKT479Ws1z0s67P0+8zPrq XLNw== MIME-Version: 1.0 X-Received: by 10.50.127.139 with SMTP id ng11mr8864612igb.6.1372751339462; Tue, 02 Jul 2013 00:48:59 -0700 (PDT) Received: by 10.64.19.228 with HTTP; Tue, 2 Jul 2013 00:48:59 -0700 (PDT) In-Reply-To: <51cc7e06.4267320a.7961.ffffba4f@mx.google.com> References: <51cc7e06.4267320a.7961.ffffba4f@mx.google.com> Date: Tue, 2 Jul 2013 16:48:59 +0900 Message-ID: Subject: Re: BackupManager start fails under heavy load From: Keiichi Fujino To: Tomcat Users List Content-Type: multipart/alternative; boundary=089e0139ff4af431d604e0829371 --089e0139ff4af431d604e0829371 Content-Type: text/plain; charset=ISO-8859-1 2013/6/28 Patrick Savage > We have an issue in our Tomcat 7.0.30 clustered production environment on > RHEL 5 where Tomcat fails to start our application when other nodes in the > cluster are under extremely heavy load. It fails because the BackupManager > cannot start the replicated map due to timeouts trying to connect to all > the > other nodes. The only way to recover from this seems to be shutting down > almost all of the nodes and then starting them again. The cluster has 9 > nodes, but we have also had the problem with 6 nodes. > > > > Is there a way to ensure the application will start even if the > BackupManager cannot connect to the other nodes? No. If replication map fails to start, associated context will fail to start. I will implement a feature to ensure the application will start even If replication map fails to start. -- Keiichi.Fujino --089e0139ff4af431d604e0829371--