From user-return-11868-apmail-couchdb-user-archive=couchdb.apache.org@couchdb.apache.org Wed Aug 04 17:38:40 2010 Return-Path: Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: (qmail 19880 invoked from network); 4 Aug 2010 17:38:40 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 4 Aug 2010 17:38:40 -0000 Received: (qmail 67265 invoked by uid 500); 4 Aug 2010 17:38:38 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 67184 invoked by uid 500); 4 Aug 2010 17:38:38 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 67176 invoked by uid 99); 4 Aug 2010 17:38:38 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Aug 2010 17:38:38 +0000 X-ASF-Spam-Status: No, hits=3.6 required=10.0 tests=FREEMAIL_FROM,FS_REPLICA,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of chrisstocktonaz@gmail.com designates 209.85.213.52 as permitted sender) Received: from [209.85.213.52] (HELO mail-yw0-f52.google.com) (209.85.213.52) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Aug 2010 17:38:32 +0000 Received: by ywa6 with SMTP id 6so2962227ywa.11 for ; Wed, 04 Aug 2010 10:38:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=KWYuRwXdEDmOKCdAn5lRPL+yZ7Nev5xX3IxbcXP9l+c=; b=bgfBej/npuMW8hdlSl+TgsKC5o9zx9h9VYfcoyXK4SagrJbNTKOAqiVQrNYMPiD3jl kYly5ba70XenAxYcVuIQcpP3c1xaruTOsKB/RUJOaizxrJz2FXd7XGo9V8NDIGbQ4qi+ uWnXKHW4Z10d0MJdPJ9hI8eTN5CroduV8U7Lg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=M6cmxkmdPvTe7PfuxPSiQIJ+udNrC0T/x35cwJm1+NVmgFJ4qTM3HQs1hlux4zV909 0T9iSQiC9tQYq9TaWFqpIO3TrKO9k18C6WsZUAW928i0QnlcTw2nFR/a6QEb4ohR6wFn 6FE2Hijc6tJm5vd80du4RgK3n9nXAHhxeDxVE= MIME-Version: 1.0 Received: by 10.150.72.31 with SMTP id u31mr10980714yba.221.1280943488243; Wed, 04 Aug 2010 10:38:08 -0700 (PDT) Received: by 10.150.217.6 with HTTP; Wed, 4 Aug 2010 10:38:08 -0700 (PDT) In-Reply-To: References: Date: Wed, 4 Aug 2010 10:38:08 -0700 Message-ID: Subject: Re: CouchDB Replication Failure - odd db_not_found errors From: Chris Stockton To: user@couchdb.apache.org Content-Type: text/plain; charset=ISO-8859-1 Hello, On Mon, Aug 2, 2010 at 5:49 PM, Chris Stockton wrote: > I am having a strange issue that I am stuck and unsure how to further For anyone who gets similar errors, I was unable to find root cause for this. It seemed to happen when I fired up our replicator, which spins up about 4 thousand replication jobs. My only thought is it was over-loaded. I ended deleting all dbs, reinstalling couchdb, deleting all lib files etc. Basically a fresh couchdb build. I then replicated the databases one by one. I spun up the replicator, which basically does status checks and makes sure continuous replication is running on all machines before I went home last night and despite hours of errors they dwindled down until all databases were replicating correctly. Moral to story, if replication isn't working with similar errors, fix might be to wipe all databases, then single pass replicate (or rsync) all databases before firing up continuous. -Chris