Return-Path: X-Original-To: apmail-couchdb-dev-archive@www.apache.org Delivered-To: apmail-couchdb-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 19EA110848 for ; Wed, 27 Nov 2013 14:19:03 +0000 (UTC) Received: (qmail 10350 invoked by uid 500); 27 Nov 2013 14:19:02 -0000 Delivered-To: apmail-couchdb-dev-archive@couchdb.apache.org Received: (qmail 10230 invoked by uid 500); 27 Nov 2013 14:19:01 -0000 Mailing-List: contact dev-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@couchdb.apache.org Delivered-To: mailing list dev@couchdb.apache.org Received: (qmail 10217 invoked by uid 99); 27 Nov 2013 14:19:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Nov 2013 14:19:01 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jason.h.smith@gmail.com designates 209.85.214.176 as permitted sender) Received: from [209.85.214.176] (HELO mail-ob0-f176.google.com) (209.85.214.176) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Nov 2013 14:18:54 +0000 Received: by mail-ob0-f176.google.com with SMTP id va2so7328378obc.7 for ; Wed, 27 Nov 2013 06:18:33 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=mMZ10QIqV8xHaHVtJTr/jlvsmPuyb+CBTYdqZUuKcxY=; b=nv1pz5duNrDy3c/6P0xIbx2wClmS8STeChNviT/2Gh1MwjIAPr+sElhP+7NiBl379u f6l46gNLiJMmDc+5PdHH9lT73iGsvMo32zVSA6H31yZ/OfdnHLf+6twhw1Lol9HjTqu9 om6n82C4A/pqs7DI/VvkoNAgejD182doS9Q7JhLlnwnALxHeZabWSPOY1DyOvxZoziRt /MBFYgqHgQOLFEDkK+gCiyP4iISaIrP3hMmVDYBJneSUpNGejjs0jiGlDwo3436GHhn1 9l7rb2tI4JymQH8hW6lp2p2tfOosx3NGIh61EO64Mpr2Z0pG1dMIrbbvlkjO9zuY7wV3 Glgw== X-Received: by 10.60.45.65 with SMTP id k1mr6176245oem.48.1385561913805; Wed, 27 Nov 2013 06:18:33 -0800 (PST) MIME-Version: 1.0 Received: by 10.182.128.42 with HTTP; Wed, 27 Nov 2013 06:18:13 -0800 (PST) In-Reply-To: References: From: Jason Smith Date: Wed, 27 Nov 2013 21:18:13 +0700 Message-ID: Subject: Re: NPM, CouchDB and big attachments To: "dev@couchdb.apache.org" Content-Type: multipart/alternative; boundary=089e0141a79cafd38e04ec29456c X-Virus-Checked: Checked by ClamAV on apache.org --089e0141a79cafd38e04ec29456c Content-Type: text/plain; charset=UTF-8 Yes, I think it is quite likely that an alternative replicator comes out soon. This replicator can be tailored to npm's needs, such as only replicating a subset of the documents[1] ("Replicate only package X, Y, and Z plus their dependencies). Or possibly even only replicating a subset of that document's attachments. Who knows? [1]: Yes replication filters can do that, but the npm filter policy depends on relationships between different documents, so it's difficult to shoehorn that into the current filter system. Perhaps the current filter will be used as a component in a broader tool. On Wed, Nov 27, 2013 at 9:14 PM, Robert Newson wrote: > "replicate 100GB (and growing) data just to have a local npm web service." > > This makes no sense to me. Of course you have to have a local copy of > all the things to have a "local npm web service". Are we talking about > partial mirroring instead then? If so, filtered replication has you > covered, right? If you just want a local mirror of the things you use, > then pull through a caching proxy (squid3 is nice). > > B. > > > On 27 November 2013 14:07, Jason Smith wrote: > > I am on-record as supportive of npm's needs here but still skeptical or > at > > least unclear on the fix. > > > > Pain points: > > > > 1. Replicating the registry can have problems. Best-case scenario you > have > > to download 100GB of data (and growing exponentially) > > > > 2. Bandwidth costs of the primary service > > > > Both problems are aggravated by the need for legacy support. CouchDB 1.5 > > has replication fixes but it is still new. And the npm clients will want > to > > fetch attachments from the couch server even after a newer CDN-using one > > comes out. > > > > I am conflicted. I agree with the point that you shouldn't have to > > replicate 100GB (and growing) data just to have a local npm web service. > > But I don't want to modify replication or the HTTP lightly. > > > > On Wed, Nov 27, 2013 at 2:14 PM, Alexander Shorin > wrote: > > > >> http://blog.nodejs.org/2013/11/26/npm-post-mortem/ > >> > >> > Move attachments out of CouchDB: Work has begun to move the package > >> tarballs out of > >> > CouchDB and into Joyent's Manta service. Additionally, MaxCDN has > >> generously offered to > >> > provide CDN services for npm, once the tarballs are moved out of the > >> registry database. > >> > This will help improve delivery speed, while dramatically reducing the > >> file system I/O load on > >> > the CouchDB servers. Work is progressing slowly, because at each stage > >> in the plan, we are > >> > making sure that current replication users are minimally impacted. > >> > >> I wonder is it CouchDB non-optimal I/O and/or can 769 issue fix it? > >> > >> https://issues.apache.org/jira/browse/COUCHDB-769 > >> > >> There is alpha-patch attached. May be it's good time to push it > >> forward? What things are left for it? > >> > >> -- > >> ,,,^..^,,, > >> > --089e0141a79cafd38e04ec29456c--