Return-Path: Delivered-To: apmail-tomcat-users-archive@www.apache.org Received: (qmail 11565 invoked from network); 25 Jan 2008 10:43:17 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 25 Jan 2008 10:43:17 -0000 Received: (qmail 3084 invoked by uid 500); 25 Jan 2008 10:42:55 -0000 Delivered-To: apmail-tomcat-users-archive@tomcat.apache.org Received: (qmail 3058 invoked by uid 500); 25 Jan 2008 10:42:54 -0000 Mailing-List: contact users-help@tomcat.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "Tomcat Users List" Delivered-To: mailing list users@tomcat.apache.org Received: (qmail 3047 invoked by uid 99); 25 Jan 2008 10:42:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Jan 2008 02:42:54 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [217.13.116.81] (HELO relay3.mail.nexica.com) (217.13.116.81) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Jan 2008 10:42:27 +0000 Received: from cl4-smtp.mail.nexica.com (zeus02nex.datacenter.nexica.com [10.2.0.151]) by relay3.noc.nexica.com (Postfix) with ESMTP id 4E0EEEB17A for ; Fri, 25 Jan 2008 10:49:16 +0100 (CET) Received: from mercurio (62.57.0.55.static.user.ono.com [62.57.0.55]) (Authenticated sender: rgarcia.emovilia.com) by cl4-smtp.mail.nexica.com (Postfix) with ESMTP id C98AFE2927 for ; Fri, 25 Jan 2008 11:42:30 +0100 (CET) From: =?iso-8859-1?B?UmH6bCBHYXJj7WE=?= To: "'Tomcat Users List'" References: <004001c8591b$5ca14100$15e3c300$@com> <478F97EB.60406@hanik.com> In-Reply-To: <478F97EB.60406@hanik.com> Subject: RE: Tomcat 6 - Cluster error. Date: Fri, 25 Jan 2008 11:42:03 +0100 Message-ID: <016801c85f3e$ecf21bc0$c6d65340$@com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook 12.0 Thread-Index: AchZMwdByBebArWlTCWGRhfpbvUwegGBU8PA Content-Language: es X-Virus-Checked: Checked by ClamAV on apache.org Hi Again, once again thanks for your time, but we still have problems, We applied the "keepAliveCount=0" param. and last Wednesday 23 Jan we restart both nodes. Around 11 hour after the startup, node 1 reports a new error, but both nodes are working perfectly. I cannot imagine why the member disappear unexpectedly, I repost the error, and the config files. INSTANCE 1 - LOG ================ Jan 24, 2008 10:25:54 PM org.apache.catalina.tribes.group.interceptors.TcpFailureDetector memberDisappeared INFO: Received memberDisappeared[org.apache.catalina.tribes.membership.MemberImpl[tcp://loc alhost:4002,localhost,4002, alive=123412856,id={-31 -91 -122 -60 -58 -5 68 25 -87 13 -20 -12 -100 5 -16 94 }, payload={}, command={}, domain={}, ]] message. Will verify. Jan 24, 2008 10:25:54 PM org.apache.catalina.tribes.group.interceptors.TcpFailureDetector memberDisappeared INFO: Verification complete. Member still alive[org.apache.catalina.tribes.membership.MemberImpl[tcp://localhost:4002, localhost,4002, alive=123412856,id={-31 -91 -122 -60 -58 -5 68 25 -87 13 -20 -12 -100 5 -16 94 }, payload={}, command={}, domain={}, ]] Jan 24, 2008 10:25:54 PM org.apache.catalina.ha.tcp.SimpleTcpCluster send SEVERE: Unable to send message through cluster sender. org.apache.catalina.tribes.ChannelException: Operation has timed out(60000 ms.).; Faulty members:tcp://localhost:4002; at org.apache.catalina.tribes.transport.nio.ParallelNioSender.sendMessage(Paral lelNioSender.java:97) at org.apache.catalina.tribes.transport.nio.PooledParallelSender.sendMessage(Po oledParallelSender.java:53) at org.apache.catalina.tribes.transport.ReplicationTransmitter.sendMessage(Repl icationTransmitter.java:80) at org.apache.catalina.tribes.group.ChannelCoordinator.sendMessage(ChannelCoord inator.java:78) at org.apache.catalina.tribes.group.ChannelInterceptorBase.sendMessage(ChannelI nterceptorBase.java:75) at org.apache.catalina.tribes.group.interceptors.ThroughputInterceptor.sendMess age(ThroughputInterceptor.java:61) at org.apache.catalina.tribes.group.ChannelInterceptorBase.sendMessage(ChannelI nterceptorBase.java:75) at org.apache.catalina.tribes.group.interceptors.MessageDispatchInterceptor.sen dMessage(MessageDispatchInterceptor.java:73) at org.apache.catalina.tribes.group.ChannelInterceptorBase.sendMessage(ChannelI nterceptorBase.java:75) at org.apache.catalina.tribes.group.interceptors.TcpFailureDetector.sendMessage (TcpFailureDetector.java:87) at org.apache.catalina.tribes.group.ChannelInterceptorBase.sendMessage(ChannelI nterceptorBase.java:75) at org.apache.catalina.tribes.group.GroupChannel.send(GroupChannel.java:216) at org.apache.catalina.tribes.group.GroupChannel.send(GroupChannel.java:175) at org.apache.catalina.ha.tcp.SimpleTcpCluster.send(SimpleTcpCluster.java:835) at org.apache.catalina.ha.tcp.SimpleTcpCluster.sendClusterDomain(SimpleTcpClust er.java:814) at org.apache.catalina.ha.tcp.ReplicationValve.send(ReplicationValve.java:551) at org.apache.catalina.ha.tcp.ReplicationValve.sendMessage(ReplicationValve.jav a:535) at org.apache.catalina.ha.tcp.ReplicationValve.sendSessionReplicationMessage(Re plicationValve.java:517) at org.apache.catalina.ha.tcp.ReplicationValve.sendReplicationMessage(Replicati onValve.java:428) at org.apache.catalina.ha.tcp.ReplicationValve.invoke(ReplicationValve.java:362 ) at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:263) at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:844) at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http 11Protocol.java:584) at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:447) at java.lang.Thread.run(Thread.java:619) Jan 24, 2008 10:26:54 PM org.apache.catalina.tribes.group.interceptors.TcpFailureDetector memberDisappeared INFO: Received memberDisappeared [...] repeats only once again. Jan 25, 2008 5:37:52 AM org.apache.catalina.tribes.group.interceptors.ThroughputInterceptor report INFO: ThroughputInterceptor Report[ Tx Msg:66167 messages Sent:37.02 MB (total) Sent:37.02 MB (application) Time:118.53 seconds Tx Speed:0.31 MB/sec (total) TxSpeed:0.31 MB/sec (application) Error Msg:2 Rx Msg:90000 messages Rx Speed:0.00 MB/sec (since 1st msg) Received:41.06 MB] INSTANCE-1 --- Server.xml ========================== NOTE:: 111.111.111.111 is the server ip address. ========================== ============================================== INSTANCE-2 server.xml ===================== =============================== -----Mensaje original----- De: Filip Hanik - Dev Lists [mailto:devlists@hanik.com] Enviado el: jueves, 17 de enero de 2008 19:01 Para: Tomcat Users List Asunto: Re: Tomcat 6 - Cluster error. already replied to your old thread ok, it looks like you might have ended up with a rogue socket, and what happens is that any message sent to that socket just gets lost in the ether, since it doesn't have any interest ops. There is a workaround for this, turn off keep alives all together, or implement a keep alive timeout Option 1 - no keep alives at all Option 2 - implement a keep alive timeout or make a combination of both values either option should work for you. On a side note, I'm interested if the scenario you run into is reproducible, it keeps happening over and over again, then if possible, I'd like to get some debug logs from you Filip --------------------------------------------------------------------- To start a new topic, e-mail: users@tomcat.apache.org To unsubscribe, e-mail: users-unsubscribe@tomcat.apache.org For additional commands, e-mail: users-help@tomcat.apache.org --------------------------------------------------------------------- To start a new topic, e-mail: users@tomcat.apache.org To unsubscribe, e-mail: users-unsubscribe@tomcat.apache.org For additional commands, e-mail: users-help@tomcat.apache.org