Return-Path: Delivered-To: apmail-hadoop-zookeeper-user-archive@minotaur.apache.org Received: (qmail 77731 invoked from network); 12 May 2010 18:34:23 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 12 May 2010 18:34:23 -0000 Received: (qmail 28971 invoked by uid 500); 12 May 2010 18:34:23 -0000 Delivered-To: apmail-hadoop-zookeeper-user-archive@hadoop.apache.org Received: (qmail 28947 invoked by uid 500); 12 May 2010 18:34:23 -0000 Mailing-List: contact zookeeper-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: zookeeper-user@hadoop.apache.org Delivered-To: mailing list zookeeper-user@hadoop.apache.org Received: (qmail 28939 invoked by uid 99); 12 May 2010 18:34:23 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 May 2010 18:34:23 +0000 X-ASF-Spam-Status: No, hits=-395.7 required=10.0 tests=AWL X-Spam-Check-By: apache.org Received: from [140.211.11.9] (HELO minotaur.apache.org) (140.211.11.9) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 12 May 2010 18:34:22 +0000 Received: (qmail 77277 invoked by uid 99); 12 May 2010 18:34:01 -0000 Received: from localhost.apache.org (HELO [10.0.0.119]) (127.0.0.1) (smtp-auth username phunt, mechanism plain) by minotaur.apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 May 2010 18:34:01 +0000 Message-ID: <4BEAF49A.9000106@apache.org> Date: Wed, 12 May 2010 11:34:02 -0700 From: Patrick Hunt User-Agent: Mozilla/5.0 (X11; U; Linux i686 (x86_64); en-US; rv:1.9.1.9) Gecko/20100317 Thunderbird/3.0.4 MIME-Version: 1.0 To: Jordan Zimmerman , "zookeeper-user@hadoop.apache.org" Subject: Re: Xid out of order. Got 8 expected 7 References: <412394FA-B66E-443A-9C1F-40E899D46DDA@proofpoint.com> <4BEAEF40.7040106@apache.org> <4BEAF345.9030301@apache.org> <03A4F43F-22A4-4B4A-8244-2C2F7A9102DB@proofpoint.com> In-Reply-To: <03A4F43F-22A4-4B4A-8244-2C2F7A9102DB@proofpoint.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit I think that explains it then - the server is probably dropping the new (3.3.0) "getChildren" message (xid 7) as it (3.2.2 server) doesn't know about that message type. Then the server responds to the client for a subsequent operation (xid 8), and at that point the client notices that getChildren (xid 7) got lost. Patrick On 05/12/2010 11:30 AM, Jordan Zimmerman wrote: > Oh, OK. When I get a moment I'll restart the 3.2.2 and post logs, > etc. > > Yes, we're calling getChildren with the callback. > > -JZ > > On May 12, 2010, at 11:28 AM, Patrick Hunt wrote: > >> I'm still interested though... Are you using the new "getChildren" >> api that was added to the client in 3.3.0? (it provides a Stat >> object on return, the old getChildren did not). While we don't >> officially support 3.3.0 client with 3.2.2 server (we do support >> the other way around), there shouldn't be they type of problem with >> this configuration as you describe. I'd still be interested for you >> to create that jira. >> >> Regards, >> >> Patrick >> >> On 05/12/2010 11:23 AM, Jordan Zimmerman wrote: >>> Apologies... >>> >>> I thought I was running 3.3.0 server, but was running 3.2.2 >>> server with 3.3.0 client. I upgraded the server and now all works >>> again. Sorry to trouble y'all. >>> >>> -Jordan >>> >>> On May 12, 2010, at 11:11 AM, Patrick Hunt wrote: >>> >>>> Hi Jordan, you've seen this once or frequently? (having the >>>> server + client logs will help alot) >>>> >>>> Patrick >>>> >>>> On 05/12/2010 11:08 AM, Jordan Zimmerman wrote: >>>>> Sure - if you think it's a bug. >>>>> >>>>> We were using Zookeeper without issue. I then refactored a >>>>> bunch of code and this new behavior started. I'm starting ZK >>>>> using "zkServer start" and haven't made any changes to the >>>>> code at all. >>>>> >>>>> I'll get the logs together and post a JIRA. >>>>> >>>>> -JZ >>>>> >>>>> On May 12, 2010, at 10:59 AM, Mahadev Konar wrote: >>>>> >>>>>> Hi Jordan, Can you create a jira for this? And attach all >>>>>> the server logs and client logs related to this timeline? >>>>>> How did you start up the servers? Is there some changes you >>>>>> might have made accidentatlly to the servers? >>>>>> >>>>>> >>>>>> Thanks mahadev >>>>>> >>>>>> >>>>>> On 5/12/10 10:49 AM, "Jordan >>>>>> Zimmerman" wrote: >>>>>> >>>>>>> We've just started seeing an odd error and are having >>>>>>> trouble determining the cause. "Xid out of order. Got 8 >>>>>>> expected 7" Any hints on what can cause this? Any ideas >>>>>>> on how to debug? >>>>>>> >>>>>>> We're using ZK 3.3.0. The error occurs in >>>>>>> ClientCnxn.java line 781 >>>>>>> >>>>>>> -Jordan >>>>>> >>>>> >>> >