Return-Path: Delivered-To: apmail-hadoop-zookeeper-user-archive@minotaur.apache.org Received: (qmail 30890 invoked from network); 5 Oct 2010 16:47:57 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 5 Oct 2010 16:47:57 -0000 Received: (qmail 64101 invoked by uid 500); 5 Oct 2010 16:47:57 -0000 Delivered-To: apmail-hadoop-zookeeper-user-archive@hadoop.apache.org Received: (qmail 64059 invoked by uid 500); 5 Oct 2010 16:47:57 -0000 Mailing-List: contact zookeeper-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: zookeeper-user@hadoop.apache.org Delivered-To: mailing list zookeeper-user@hadoop.apache.org Received: (qmail 64051 invoked by uid 99); 5 Oct 2010 16:47:57 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Oct 2010 16:47:57 +0000 X-ASF-Spam-Status: No, hits=-1997.8 required=10.0 tests=ALL_TRUSTED,HTML_MESSAGE,NORMAL_HTTP_TO_IP,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.9] (HELO minotaur.apache.org) (140.211.11.9) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 05 Oct 2010 16:47:54 +0000 Received: (qmail 30776 invoked by uid 99); 5 Oct 2010 16:47:32 -0000 Received: from localhost.apache.org (HELO mail-ww0-f48.google.com) (127.0.0.1) (smtp-auth username phunt, mechanism plain) by minotaur.apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Oct 2010 16:47:32 +0000 Received: by wwb22 with SMTP id 22so8688624wwb.29 for ; Tue, 05 Oct 2010 09:47:30 -0700 (PDT) MIME-Version: 1.0 Received: by 10.227.137.81 with SMTP id v17mr10185295wbt.10.1286297250185; Tue, 05 Oct 2010 09:47:30 -0700 (PDT) Received: by 10.227.157.82 with HTTP; Tue, 5 Oct 2010 09:47:30 -0700 (PDT) In-Reply-To: References: Date: Tue, 5 Oct 2010 09:47:30 -0700 Message-ID: Subject: Re: Too many connections From: Patrick Hunt To: zookeeper-user , avinash.lakshman@gmail.com Content-Type: multipart/alternative; boundary=001636416ae3ac43d60491e16aea X-Virus-Checked: Checked by ClamAV on apache.org --001636416ae3ac43d60491e16aea Content-Type: text/plain; charset=ISO-8859-1 "A" (/10.30.84.207 a zookeeper client) is attempting to establish more then 10 sessions to the ZooKeeper server where you got the log. This can be caused by a bug in user code (we've seen bugs where incorrectly implemented ZK clients attempt to create an infinite number of sessions, which essentially DOS the service, so we added the maxClientCnxn default limit of 10). Often users see this problem when they are trying to simulate a real environment - they run a simulated set of clients sessions (>10) from a single host (ip) hitting the servers. However in your case I'm guessing that it has something to do with this "The observers are running in a different data center from where the ZK non-observers are running." Could you have a NAT or some other networking configuration that makes all the observers seem to be coming from the same IP address? Patrick On Tue, Oct 5, 2010 at 9:33 AM, Avinash Lakshman wrote: > Thanks Patrick. But what does this mean? I see the log on server A telling > me "Too many connections from A - default is 10". Too many connection from > A > to whom? I do not see who the other end of the connection is. > > Cheers > Avinash > > On Tue, Oct 5, 2010 at 9:27 AM, Patrick Hunt wrote: > > > See this configuration param in the docs "maxClientCnxns": > > > > > http://hadoop.apache.org/zookeeper/docs/current/zookeeperAdmin.html#sc_advancedConfiguration > > > > < > > > http://hadoop.apache.org/zookeeper/docs/current/zookeeperAdmin.html#sc_advancedConfiguration > > > > > Patrick > > > > On Tue, Oct 5, 2010 at 8:10 AM, Avinash Lakshman < > > avinash.lakshman@gmail.com > > > wrote: > > > > > I find this happening in my observers node in the logs. The observers > are > > > running in a different data center from where the ZK non-observers are > > > running. The only way to fix this seems to be restarting. How can I > start > > > addressing this? Here is the stack trace. > > > > > > Too many connections from /10.30.84.207 - max is 10 > > > WARN - Session 0x0 for server mybox.mydomain.com/10.30.84.207:5001, > > > unexpected error, closing socket connection and attempting reconnect > > > java.io.IOException: Connection reset by peer > > > at sun.nio.ch.FileDispatcher.read0(Native Method) > > > at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:21) > > > at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:233) > > > at sun.nio.ch.IOUtil.read(IOUtil.java:200) > > > at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:236) > > > at > > > org.apache.zookeeper.ClientCnxn$SendThread.doIO(ClientCnxn.java:817) > > > at > > > org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1089) > > > > > > Please advice. > > > > > > Cheers > > > Avinash > > > > > > --001636416ae3ac43d60491e16aea--