Return-Path: Delivered-To: apmail-incubator-couchdb-user-archive@locus.apache.org Received: (qmail 70626 invoked from network); 11 Apr 2008 07:27:10 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 11 Apr 2008 07:27:10 -0000 Received: (qmail 96734 invoked by uid 500); 11 Apr 2008 07:27:10 -0000 Delivered-To: apmail-incubator-couchdb-user-archive@incubator.apache.org Received: (qmail 96712 invoked by uid 500); 11 Apr 2008 07:27:10 -0000 Mailing-List: contact couchdb-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: couchdb-user@incubator.apache.org Delivered-To: mailing list couchdb-user@incubator.apache.org Received: (qmail 96703 invoked by uid 99); 11 Apr 2008 07:27:10 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Apr 2008 00:27:10 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jchris@gmail.com designates 72.14.220.152 as permitted sender) Received: from [72.14.220.152] (HELO fg-out-1718.google.com) (72.14.220.152) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Apr 2008 07:26:25 +0000 Received: by fg-out-1718.google.com with SMTP id 22so304314fge.26 for ; Fri, 11 Apr 2008 00:26:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:sender:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references:x-google-sender-auth; bh=H2wbSUpfkVudoY83rWxEsQ0RtmJXhotZeiDeLkiXvCs=; b=p5Fug+ittQK79z4EQ/c2N/M88Xu8Nts1BU0DnfQJqw7mbrNEQatD2bEIfOQl7JrpQ8cw2BMIwJYg6hpYhEKnSsl18Tb6xlZdJtQSs6daknWXGlIKPkb3aAR2dLtVMvufLElAwOanDTC7HwtL6B7Vjb9ay1l5Zp3bPbRdG5YOFPI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:sender:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references:x-google-sender-auth; b=HWYfjKbu745hrXFPDc4dkhjfA5ewPt2wfWrFBuL4WVfXFuz67OSxhSPjB6UicXf8x2NSxSogDxnMpkjA7w0R6rC2vHhLL9tohC9bDmNvshqdPsMhiB1PTL/cCgyBSPQM7jspHvNB/TFx2yOxJ3XzhwBc89xZeOXXk9SibFSjT0A= Received: by 10.86.53.8 with SMTP id b8mr5029807fga.32.1207898796397; Fri, 11 Apr 2008 00:26:36 -0700 (PDT) Received: by 10.86.57.13 with HTTP; Fri, 11 Apr 2008 00:26:36 -0700 (PDT) Message-ID: Date: Fri, 11 Apr 2008 00:26:36 -0700 From: "Chris Anderson" Sender: jchris@gmail.com To: couchdb-user@incubator.apache.org Subject: Re: Clarifications or bugs In-Reply-To: <44500.193.3.142.123.1207895485.squirrel@www.widetrail.dk> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline References: <200804092350.23492.sh@widetrail.dk> <9C3977AC-3865-4DD4-9B45-F6F5A4402072@gmx.de> <51868.193.3.142.123.1207808427.squirrel@www.widetrail.dk> <47FE9F26.8090508@naderman.de> <44500.193.3.142.123.1207895485.squirrel@www.widetrail.dk> X-Google-Sender-Auth: 1e5717cb21f81e9a X-Virus-Checked: Checked by ClamAV on apache.org On Thu, Apr 10, 2008 at 11:31 PM, S=F8ren Hilmer wrote: > My problem is that couchdb only tells me which database has changed, wit= h > that information I can get the documents changed, but now I need to run > the views to index on these documents (and preferably these documents > only) in order to get what I need to index. I believed I could filter on > the documentid (startkey_docid) but that is apparently only supported in > conjunction with startkey (which I do not know, as I haven't ran the vie= w > on the document yet). I'm afraid this is significantly different from the architecture as it currently stands, but if the indexer were to be called with the output of each emit() function from the view engine, it would clear up a lot of these problems. Since the view engine is already external to CouchDB and communicating over stdio, it seems like a decent opportunity to attach other processes as well. CouchDB could be set to send the result of each view (at least those that are configured to be full-text searchable) along with its corresponding document to the full-text indexer, at view-compute time. The full-text indexer would be called once per emit, so it could easily handle cases where a given document shows up in the same view multiple times. To me this seems simpler than trying to pick up changes that could be anywhere in the view sort order. The gotcha may come when the view-engines are distributed across instances. Then the problem becomes managing a distributed full-text indexing process... Hopefully this is helpful. Chris --=20 Chris Anderson http://jchris.mfdz.com