Return-Path: Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: (qmail 33801 invoked from network); 29 Mar 2011 14:09:32 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 29 Mar 2011 14:09:32 -0000 Received: (qmail 92294 invoked by uid 500); 29 Mar 2011 14:09:30 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 92260 invoked by uid 500); 29 Mar 2011 14:09:30 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 92252 invoked by uid 99); 29 Mar 2011 14:09:30 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 29 Mar 2011 14:09:30 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of norman.barker@gmail.com designates 74.125.82.54 as permitted sender) Received: from [74.125.82.54] (HELO mail-ww0-f54.google.com) (74.125.82.54) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 29 Mar 2011 14:09:26 +0000 Received: by wwd20 with SMTP id 20so215279wwd.23 for ; Tue, 29 Mar 2011 07:09:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=kd4vTdwv+dFbbGXlEMOxIofsh1J3Sct1YuII055QAYY=; b=U4WAycddijvugl6YhGD7G3SxCgJqsvZWeq99oqyOkVmxFVwzWk6wBymov+L/vADPwL jDy4Jgdr/xuYgBM0Ft4ouBzu1tuPn4k6lDTfN9pilzhvuC6q5Ad2mnSxfqwPZNE38Ehl Yc6CII07EBLHyQvs2/Ns9m1GNOLoBQQoqW4a8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=R07qi3oRYa5b62Mx6NjY0GKAG7cBmiwqrUi1CwGUg8/QMMRIbC+Sveqp8VIg1XXKVS ++ywigQF/Q6pxAlp/4aR7zM7oOhN7HW9nl8BEDi3yNe8k92TL6x39fr2Ql+13hh9BKyZ /sKtYNDUbAJQrTqq8p8H7blDQm8e803/LhOuM= MIME-Version: 1.0 Received: by 10.216.44.208 with SMTP id n58mr4505118web.39.1301407744409; Tue, 29 Mar 2011 07:09:04 -0700 (PDT) Received: by 10.216.196.9 with HTTP; Tue, 29 Mar 2011 07:09:03 -0700 (PDT) In-Reply-To: References: <41564f51-bb8a-4fbe-a984-941d19852c06@HUB25.4emm.local> Date: Tue, 29 Mar 2011 08:09:03 -0600 Message-ID: Subject: Re: Full text search - is it coming? If yes, approx when. From: Norman Barker To: user@couchdb.apache.org Content-Type: text/plain; charset=ISO-8859-1 Benoit, interesting post on Lucy, I have been monitoring that as well (and though no where near as good as Robert's work) I have integrated clucene and couchdb as I was looking for a solution that didn't use Java. I see a trend with couchdb and NIFs, what is the official standpoint here, test and test the c / c++ library so that any chance of bringing the VM down is reduced? I know with Java and JNI in an app server you are taking a huge risk (heartbeat works, but an app server takes several minutes to start up), with Erlang are you relying on the heartbeat service to restart the VM in case of failure? I am interested in helping with any NIF on top of Lucy. thanks, Norman On Tue, Mar 29, 2011 at 7:58 AM, Simon Metson wrote: > Does http://blog.cloudant.com/developer-preview-cloudant-search-for-couchdb/ help wrt. the original post? Cloudant's search is built on Lucene. > Cheers > Simon > > Sent with Sparrow > On Tuesday, 29 March 2011 at 14:24, Dennis Geurts wrote: > Hi all, >> >> Looking at the amount of replies wrt to this topic it seems there's much interest in full text searching. >> >> It's really hard to tell how one would expect this feature to be implemented in couchdb in such a way that it would supersede the nice couchdb-lucene combo. >> >> That said, if you want a _really simple_ (and probably bad solution performance wise!) fulltext search implementation, have a look at couchdb lists. >> >> You decide which _view is sent to the _list function; within the _list function you can implement your full text search by inspecting the document data in javascript. >> >> This setup at least allows for replication of the fts functionality and might be just enough for the OP. >> >> >> >> Cheers, dennis >> >> >> >> ----- Reply message ----- >> From: "Zdravko Gligic" >> Date: Tue, Mar 29, 2011 13:49 >> Subject: Full text search - is it coming? If yes, approx when. >> To: "user@couchdb.apache.org" >> >> I have a bit tricky use case of super tagging or rather a somewhat >> hierarchical docs categorization. Several CouchDB gurus have suggested >> that I should look at Lucene and such. My problem is hosting because >> I would most rather go with a cloud solution such as Cloudant and >> forthcoming (I hope it's still forthcoming) CouchBase. Comparatively, >> I have very little amount of data - large number of tiny docs that are >> indexed every which way possible - such that the size of views dwarfs >> the size of docs. >> >> The full-text-searching problem is best illustrated by the >> full-text-searching hosting state of affairs at Cloudant and CouchBase >> - the only two commercial companies worth mentioning within the >> CouchDb community. Neither one uses Lucene out of the box and only >> Cloudant has their own solution. This means that I could not use a >> redundancy-performance perfect Master-Master replication that is >> hosted by both. This is why either full-text-searching needs to >> become CouchDb's internal first citizen or our hosting friends need to >> internalize and make Lucene their first class citizen. >> >> P.S. I love both but ... >> >