Return-Path: Delivered-To: apmail-incubator-couchdb-dev-archive@locus.apache.org Received: (qmail 68515 invoked from network); 13 Nov 2008 22:06:37 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 13 Nov 2008 22:06:37 -0000 Received: (qmail 78374 invoked by uid 500); 13 Nov 2008 22:06:44 -0000 Delivered-To: apmail-incubator-couchdb-dev-archive@incubator.apache.org Received: (qmail 78329 invoked by uid 500); 13 Nov 2008 22:06:44 -0000 Mailing-List: contact couchdb-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: couchdb-dev@incubator.apache.org Delivered-To: mailing list couchdb-dev@incubator.apache.org Received: (qmail 78318 invoked by uid 99); 13 Nov 2008 22:06:44 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Nov 2008 14:06:43 -0800 X-ASF-Spam-Status: No, hits=-1998.8 required=10.0 tests=ALL_TRUSTED,FS_REPLICA X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Nov 2008 22:05:30 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 45C50234C28A for ; Thu, 13 Nov 2008 14:05:44 -0800 (PST) Message-ID: <1338269458.1226613944284.JavaMail.jira@brutus> Date: Thu, 13 Nov 2008 14:05:44 -0800 (PST) From: "Adam Kocoloski (JIRA)" To: couchdb-dev@incubator.apache.org Subject: [jira] Updated: (COUCHDB-160) replication performance improvements In-Reply-To: <1964590641.1226429624321.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/COUCHDB-160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Kocoloski updated COUCHDB-160: ----------------------------------- Attachment: couch_rep_v3.diff forgot to increase the pipeline length in the last version. The goal is for the entire replication to use 1 persistent connection. > replication performance improvements > ------------------------------------ > > Key: COUCHDB-160 > URL: https://issues.apache.org/jira/browse/COUCHDB-160 > Project: CouchDB > Issue Type: Improvement > Components: Database Core > Affects Versions: 0.9 > Reporter: Adam Kocoloski > Priority: Minor > Attachments: couch_rep.erl.diff, couch_rep_v2.diff, couch_rep_v3.diff > > > I wrote some code to speed up CouchDB's replication process by parallelizing document requests and using _bulk_docs to write changes to the target. I tested the speedup as follows: > * 1000 document DB, 1022 update_seq, ~450 KB after compaction > * local and remote machines have ~45 ms latency > * timed requests using timer:tc(couch_rep, replicate, [<<"source">>, <<"target">>] > * all replications are "from scratch" > trunk: > local-local 115 > local-remote 145 > remote-remote 173 > remote-local 146 > db size after replication: 1.8 MB > patch: > local-local 1.83 > local-remote 38 > remote-remote 64 > remote-local 35 > db size after replication: 453 KB > I'll attach the patch as an update to this issue. It might be worth exposing the "batch size" (currently 100 docs) as a configurable parameter. Comments welcome. Best, > Adam -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.