Return-Path: Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: (qmail 79654 invoked from network); 5 Jun 2010 21:09:59 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 5 Jun 2010 21:09:59 -0000 Received: (qmail 67462 invoked by uid 500); 5 Jun 2010 21:09:58 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 67422 invoked by uid 500); 5 Jun 2010 21:09:58 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 67414 invoked by uid 99); 5 Jun 2010 21:09:58 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 05 Jun 2010 21:09:58 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=10.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [145.58.30.185] (HELO out1b.mail.omroep.nl) (145.58.30.185) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 05 Jun 2010 21:09:48 +0000 Received: from localhost (ou1bclean [10.10.30.159]) by out1b.mail.omroep.nl (Postfix MTA - NPO ICT) with ESMTP id 3E67A3000119 for ; Sat, 5 Jun 2010 23:09:28 +0200 (CEST) X-Virus-Scanned: NPO ICT Received: from tweehoog.vpro.nl (tweehoog.vpro.nl [145.58.169.4]) by out1b.mail.omroep.nl (Postfix MTA - NPO ICT) with ESMTP id 1D5C730000B0 for ; Sat, 5 Jun 2010 23:09:28 +0200 (CEST) Received: from exmail.vpro.nl ([145.58.171.81] helo=VS-EX-01.intra.vpro.nl) by tweehoog.vpro.nl with esmtp (Exim 3.36 #1) id 1OL0cS-0008ME-00 for user@couchdb.apache.org; Sat, 05 Jun 2010 23:09:28 +0200 Received: from VS-EX-01.intra.vpro.nl ([145.58.171.81]) by VS-EX-01.intra.vpro.nl ([145.58.171.81]) with mapi; Sat, 5 Jun 2010 23:09:27 +0200 From: Nils Breunese To: "user@couchdb.apache.org" Date: Sat, 5 Jun 2010 23:09:26 +0200 Subject: RE: clucene and couchdb Thread-Topic: clucene and couchdb Thread-Index: AcsELVriOHWy9L0VSGSM1HN+7UthHQAxSADo Message-ID: References: In-Reply-To: Accept-Language: nl-NL Content-Language: nl-NL X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: nl-NL Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org We had some serious performance problems with couchdb-lucene on a busy site= recently. It turned out the problem wasn't couchdb-lucene itself (queries = were fast!), but the fact that communication between CouchDB and external p= rocesses use stdout/stdin, which AFAIK doesn't allow for concurrency. This = turned out to be a major bottleneck in our setup. We're currently setting u= p caching for couchdb-lucene URL's hoping this will help. We even tried red= irecting traffic for couchdb-lucene URL's directly to couchdb-lucene, thus = avoiding the stdin/stdout serialization, but apperently the current stable = release of couchdb-lucene doesn't handle concurrency well yet (I believe rn= ewsom already fixed some bugs in that area), but it least it has the potent= ial to do so. Let us know when you have some numbers of couchdb-clucene versus couchdb-lu= cene. (We're mainly a Java shop though, so we're not afraid of running JVM'= s.) Nils. ________________________________________ Van: Norman Barker [norman.barker@gmail.com] Verzonden: vrijdag 4 juni 2010 23:31 Aan: user@couchdb.apache.org Onderwerp: clucene and couchdb Hi, I am writing a clucene indexer for CouchDB, I have update_notifications and _fti as a db handler working. I am using stdout/stdin for the communication and it is looking good. Looking at http://wiki.apache.org/couchdb/Full_text_search I see that the index property in the design document is a javascript function and I am wondering why? For views I can understand why you would want to do an evaluation but for Lucene could we just use a JSON Path reference? Thoughts appreciated, since I am in C++ and SpiderMonkey is available I could do an eval of the javascript, but it might be easier just to parse the JSON path. We will be putting this CLucene implementation in the public domain once I have cleared the necessary internal paperwork. CLucene is dual license (Apache and LGPL) and I am using Cajun (BSD) for the JSON parsing so should I host this separately or take out a JIRA ticket to have it included in CouchDB? thanks, Norman De informatie vervat in deze e-mail en meegezonden bijlagen is uitsluitend= bedoeld voor gebruik door de geadresseerde en kan vertrouwelijke informati= e bevatten. Openbaarmaking, vermenigvuldiging, verspreiding en/of verstrekk= ing van deze informatie aan derden is voorbehouden aan geadresseerde. De VP= RO staat niet in voor de juiste en volledige overbrenging van de inhoud van= een verzonden e-mail, noch voor tijdige ontvangst daarvan.