Return-Path: X-Original-To: apmail-zookeeper-user-archive@www.apache.org Delivered-To: apmail-zookeeper-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9757A18C1F for ; Tue, 17 Nov 2015 18:10:47 +0000 (UTC) Received: (qmail 9160 invoked by uid 500); 17 Nov 2015 18:10:46 -0000 Delivered-To: apmail-zookeeper-user-archive@zookeeper.apache.org Received: (qmail 9107 invoked by uid 500); 17 Nov 2015 18:10:46 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 9095 invoked by uid 99); 17 Nov 2015 18:10:46 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Nov 2015 18:10:46 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 1437DC07D8 for ; Tue, 17 Nov 2015 18:10:46 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.981 X-Spam-Level: ** X-Spam-Status: No, score=2.981 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=itevenworks_net.20150623.gappssmtp.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id zwig9yMlu0fG for ; Tue, 17 Nov 2015 18:10:32 +0000 (UTC) Received: from mail-ob0-f177.google.com (mail-ob0-f177.google.com [209.85.214.177]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 01E252302D for ; Tue, 17 Nov 2015 18:10:31 +0000 (UTC) Received: by obbbj7 with SMTP id bj7so13273959obb.1 for ; Tue, 17 Nov 2015 10:10:24 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=itevenworks_net.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=GROGQ/AnuAu/WCJhgDX02L7A6MeOCaxmx1LnUunMP98=; b=KZPx55NpxXzOZ8rbMxC9WgUtwJqmA6cyxt767ONjJp2+Tk3x5CsdoSxDSnCcJoMnbk I5WjGvciWlY8C3oP8MvXeCJ+PPxfhobmQZHoY1UY0+WrAbXSbddgTNxT2wvyWe+VJ2oa sidgIQF8ZLUQ+H0qkfHgHqavdMfMPkQ9ag+yEVEj/rGdG8/9b5cX4DPU17vWld1SuJqy Yln7AuBpN61W4BgCnZca/XRU5IjdvkzNxtUmdWL/UWBVanQQssX9wCC7icIlXayZADFY XLnTOsAR2eiiMsmQcOGELRTmpHpesrPjPn3smhhCsMxkfxjYCJtg897pIDVyLWSlhTZG TgBw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=GROGQ/AnuAu/WCJhgDX02L7A6MeOCaxmx1LnUunMP98=; b=KEpLr4bbHBqAuGl0M4Q/ywdJOBinW5s16mNprZklzQmQyyvmUrzN3qO4/7DEQmYGxo bNueMWKlAb3cMrDJlDUDOIAwo5U4wAB3uq9/CRUrxRyKyh0t0YcCgQveKfpLttJF1ZXq HOhRomnz7kw52BBvgNMQ1ZVtSwTMQwpmQa6itgYbQiO4t/9ZmbCSAA3Lqq/kSha6Z1nu EBphuHHKbPcWwIKB8XZFVTb+PHefjmWy4XznlaZ3DiGzF+C8OtRyLhkFmg321oCEibME NJsoTQD3ixytmcE57iu71NOqnTByRR0plCPnIBa7vtkoIdHGNIleXHT1x/MiG7m9jAFa k0RA== X-Gm-Message-State: ALoCoQmPnXROjAGsXxpSbm6ZzIG3CvXIqejVH2746uxV6awxaDWmz466fK+9PTstIzzPeLz+wEfg MIME-Version: 1.0 X-Received: by 10.60.93.170 with SMTP id cv10mr11674593oeb.38.1447783824489; Tue, 17 Nov 2015 10:10:24 -0800 (PST) Received: by 10.202.89.194 with HTTP; Tue, 17 Nov 2015 10:10:24 -0800 (PST) X-Originating-IP: [192.94.66.17] In-Reply-To: References: Date: Tue, 17 Nov 2015 10:10:24 -0800 Message-ID: Subject: Re: Transaction timeouts From: =?UTF-8?B?UmHDumwgR3V0acOpcnJleiBTZWdhbMOpcw==?= To: "user@zookeeper.apache.org" Content-Type: multipart/alternative; boundary=047d7b33d1849219520524c070d9 --047d7b33d1849219520524c070d9 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi, On 17 November 2015 at 05:10, Akmal Abbasov wrote: > Hi, I=E2=80=99m seeing a lot of `Closing connection to peer due to transa= ction > timeout` messages in zk logs, in all zk servers. > Is this transaction timeout configured through syncLimit in zk config fil= e. > That message comes from LearnerHandler#ping() [0], and the frequency of pings from the leader to learners is twice a tick [1]. So if your tickTime is 2000ms (the default), you are pinging the learners every second. You could adjust the tickTime and see if it gets better. But I suspect something else (GC-ing? noisy network?) is going on, given that it shouldn't be that hard for the leader and learners to keep up with 1 ping every sec. You can check ZAB messages (i.e.: pings, acks, commits, proposals, etc.) between the leader and learners using zktraffic's zk-dump [2]. > Also does zk server need to be restarted in order to update this config? > yes. -rgs [0] https://github.com/apache/zookeeper/blob/trunk/src/java/main/org/apache/zoo= keeper/server/quorum/LearnerHandler.java#L923 [1] https://github.com/apache/zookeeper/blob/trunk/src/java/main/org/apache/zoo= keeper/server/quorum/Leader.java#L549 [2] https://github.com/twitter/zktraffic --047d7b33d1849219520524c070d9--