Return-Path: Delivered-To: apmail-couchdb-dev-archive@www.apache.org Received: (qmail 24923 invoked from network); 12 Feb 2011 22:22:30 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 12 Feb 2011 22:22:30 -0000 Received: (qmail 80178 invoked by uid 500); 12 Feb 2011 22:22:30 -0000 Delivered-To: apmail-couchdb-dev-archive@couchdb.apache.org Received: (qmail 80010 invoked by uid 500); 12 Feb 2011 22:22:29 -0000 Mailing-List: contact dev-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@couchdb.apache.org Delivered-To: mailing list dev@couchdb.apache.org Received: (qmail 80002 invoked by uid 99); 12 Feb 2011 22:22:29 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 12 Feb 2011 22:22:29 +0000 X-ASF-Spam-Status: No, hits=2.9 required=5.0 tests=FREEMAIL_FROM,FS_REPLICA,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of fdmanana@gmail.com designates 209.85.161.52 as permitted sender) Received: from [209.85.161.52] (HELO mail-fx0-f52.google.com) (209.85.161.52) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 12 Feb 2011 22:22:24 +0000 Received: by fxm5 with SMTP id 5so3892559fxm.11 for ; Sat, 12 Feb 2011 14:22:02 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type :content-transfer-encoding; bh=SynRKCe2CDhVSmo2bZf94MTuHDo8RIDhrv5GvAmMdIc=; b=kz3NnTFUYPMJK0/MG/lQlFmiva+vsOrAs+vyTUyDlpvC1StrMB6Fgl4RyBulN9t2Gq F+VLpEdXdYFmHuUAIrUvZVYf+crLy7n3KgvPO3wOhhLmNwMfs4VchQqAltpyPEkehwwa 5u3yDXwi6NZ+ubdBLqiLUGPkxk+YuODlso0v8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type :content-transfer-encoding; b=vaEEefBQ+PzkYVeY+jdkfVeTkwhkFvWEhUdeEq7/XPfwbcqOWKYNkgrovZHwJc10m3 rer5UML5RncUuqxGBvH9bWHFD8KkPCQGPXYpSKd4X+AjqAO1Sua8L89TBv5yy69gt/oS MEfso6AoDQoZl6EPwbSV8btdskSjcKhsBID5s= MIME-Version: 1.0 Received: by 10.223.104.147 with SMTP id p19mr2425636fao.6.1297549321927; Sat, 12 Feb 2011 14:22:01 -0800 (PST) Sender: fdmanana@gmail.com Received: by 10.223.124.139 with HTTP; Sat, 12 Feb 2011 14:22:01 -0800 (PST) In-Reply-To: References: Date: Sat, 12 Feb 2011 14:22:01 -0800 X-Google-Sender-Auth: HueHM8TGj5y80h9utu0Jz7FnOGQ Message-ID: Subject: Re: 1.0.2 regression in continuous replication From: Filipe David Manana To: dev@couchdb.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Dirkjan, What's the time delta between the moment the replication started and the moment you get the changes_timeout error? Is it a delta of about 30 seconds? You can check when the replication started by inspecting the log file for a line like this: [Sat, 12 Feb 2011 22:16:10 GMT] [info] [<0.108.0>] starting new replication "bae147260ef23b1d062188e6838d68d7" at <0.192.0> The relevant error message in your log is the third one: "[Tue, 01 Feb 2011 06:23:54 GMT] [error] [<0.3701.3>] changes loop timeout, no data received from http://10.8.0.1:5984/efp/" I want to know the timestamp difference between both. regards, On Fri, Feb 4, 2011 at 6:26 AM, Dirkjan Ochtman wrote: > Hi everyone, > > I asked about this a bit on IRC, but we decided that this might be > better on the mailing list. > > At work, we use CouchDB continuous replication for some important > streams, from a server in a data center somewhere to the servers in > our office, over OpenVPN. This was partly done because the > consumer-grade internet in our office (hey, it's a startup) is a > little flaky at times. This used to work fine; we had a script to > restart the replication if it failed, but that only happened once in a > week or a few weeks. > > However, since installing 1.0.2 on Monday, we see our continuous > replication failing much more often, like 3-5 times per day. It seems > to fail complaining about some timeout, here's a trace: > http://dirkjan.ochtman.nl/files/repl-fail.log. > > Is anyone seeing similar behavior? Are there any changes in CouchDB > between 1.0.1 and 1.0.2 that could have caused this? We're running > fairly standard Linux boxes, not much else changed on Monday. > > Cheers, > > Dirkjan > --=20 Filipe David Manana, fdmanana@gmail.com, fdmanana@apache.org "Reasonable men adapt themselves to the world. =C2=A0Unreasonable men adapt the world to themselves. =C2=A0That's why all progress depends on unreasonable men."