Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@minotaur.apache.org Received: (qmail 87678 invoked from network); 8 Apr 2009 01:13:02 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 8 Apr 2009 01:13:02 -0000 Received: (qmail 74445 invoked by uid 500); 8 Apr 2009 01:13:01 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 74377 invoked by uid 500); 8 Apr 2009 01:13:01 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 74367 invoked by uid 99); 8 Apr 2009 01:13:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Apr 2009 01:13:01 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [76.13.9.55] (HELO web65511.mail.ac4.yahoo.com) (76.13.9.55) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 08 Apr 2009 01:12:52 +0000 Received: (qmail 82088 invoked by uid 60001); 8 Apr 2009 01:12:31 -0000 Message-ID: <134469.81924.qm@web65511.mail.ac4.yahoo.com> X-YMail-OSG: Vyg4fcgVM1khJO827d3r3.c8fQ6rPTKUcBCKtMmu50Vvxvmu0rjl2xJ9M2.abVLYB.QvijZiVorBIZb2nk4spoFSGbxNQ4mB89NNoe8mcjHkDRKO9eL5FgFYDycRr6YFYMlxy9hEUxZ5De8HHv.2.vGs7i2C_KntDUF_XkVo7h4AzFbcZwGtQOc4ZDP_aNpEQc97Dt7ALvhF0IlHJLtu8F4vPBNjEPKDfXtQ2bD6T5m6Mp6zkAhFCaA3_e3usgNX2oVK5oUKigrBDwgWVP2RJ6T4rjm.C_mJAwOnpT.v0vql6SNlyHqV9NyWLw-- Received: from [69.226.16.168] by web65511.mail.ac4.yahoo.com via HTTP; Tue, 07 Apr 2009 18:12:30 PDT X-RocketYMMF: apurtell X-Mailer: YahooMailWebService/0.7.289.1 Date: Tue, 7 Apr 2009 18:12:30 -0700 (PDT) From: Andrew Purtell Reply-To: apurtell@apache.org Subject: Re: ZK rethink? To: hbase-dev@hadoop.apache.org In-Reply-To: <78568af10904071353m4c23acd3p63a69be41b4d4b62@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Checked: Checked by ClamAV on apache.org I see this kind of stuff: 2009-04-07 17:58:13,344 - WARN [NIOServerCxn.Factory:2181:NIOServerCnxn@417] - Exception causing close of session 0x2208296c38e0000 due to java.io.IOException: Read error and bye bye HRS ephemeral znodes, which triggers (currently) HBASE-1314. This I think is ZOOKEEPER-344 https://issues.apache.org/jira/browse/ZOOKEEPER-344 - Andy > From: Ryan Rawson > Subject: Re: ZK rethink? > Date: Tuesday, April 7, 2009, 1:53 PM > > Thanks for the input Joey, and may I be the first to say > "holy shit". > > The reason their approach works is because the C API spins > off OS threads that exist outside the domain of the Java VM, > which means those threads never get paused for GC processing. > > With that kind of input, we might want to consider doing > what he did. Maybe you can donate a bit of code? > > Thanks! > -ryan > > On Tue, Apr 7, 2009 at 1:49 PM, Nitay [...] > > I don't quite understand why the C API with JNI > > fixes the problem. Did that substantially reduce your > > tiny, short lived objects to the point where the > > GC wasn't starving the ZooKeeper IO threads anymore? > > > > Perhaps my initial 10 second value was not enough. > > Andrew, can you try 30 or 60 seconds as a test on your > > cluster to see if that calms things down? [...]