Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 720 invoked from network); 4 Jan 2011 18:26:28 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Jan 2011 18:26:28 -0000 Received: (qmail 4944 invoked by uid 500); 4 Jan 2011 18:26:26 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 4897 invoked by uid 500); 4 Jan 2011 18:26:26 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 4881 invoked by uid 99); 4 Jan 2011 18:26:24 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Jan 2011 18:26:24 +0000 X-ASF-Spam-Status: No, hits=1.5 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of adam.saltiel@gmail.com designates 209.85.161.48 as permitted sender) Received: from [209.85.161.48] (HELO mail-fx0-f48.google.com) (209.85.161.48) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Jan 2011 18:26:18 +0000 Received: by fxm2 with SMTP id 2so13893777fxm.35 for ; Tue, 04 Jan 2011 10:25:57 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=B47t+ac9CfutTeUJyDfZ/R4wRB566AaWtqRUcSoRxaI=; b=Se4nZ7k9QDB4UwdHU9dHD0cOTG24f1T6TPOikDahtfJfeORSdoxLneCm2NxDOFz5Tq Y8jD+SCg6p37XPwd3QRars2g+qZ1KHJ3Qm/J9FmsOqPLIXUHT4S69M8AFZAe8a2BdJ7U TNVxzk2KvATWlVylOJgrL6FLGNa453ZfbRo9o= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=TATU/mvfVgpQwMeRuikKxtHbVjat+u5Der5hbbwydBoDik38HoxodtArIuMoA38mIh dJQT0WzICy8I3Ju/35iCpG3qFvfp+BqYPhLZaXSwjQ2XIhStIfaGhWCGCVgxlSxIU99b 4deqxehgzt9Twkrgs9bOtDrzJB7D94lELYekg= MIME-Version: 1.0 Received: by 10.223.79.68 with SMTP id o4mr4419348fak.0.1294165557316; Tue, 04 Jan 2011 10:25:57 -0800 (PST) Received: by 10.223.78.198 with HTTP; Tue, 4 Jan 2011 10:25:57 -0800 (PST) In-Reply-To: References: <1596394141-1293637049-cardhu_decombobulator_blackberry.rim.net-1951755217-@b15.c6.bise7.blackberry> <37455700-1293642926-cardhu_decombobulator_blackberry.rim.net-1955853008-@b15.c6.bise7.blackberry> <499C43A3-01DA-4099-AB23-D1941FCC639A@apache.org> Date: Tue, 4 Jan 2011 18:25:57 +0000 Message-ID: Subject: Re: Using Lucene to search live, being-edited documents From: adasal To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=20cf3054a56d530e7304990966fe --20cf3054a56d530e7304990966fe Content-Type: text/plain; charset=UTF-8 I would think this is more like it. But the essential thing, so it seems to me, is whether there is a requirement for a serialised index, i.e. a more permanent record, aside from the saved document. Then, if there is a penalty to creating the index compared to regex, stringsearch or so, it is justified on other grounds. I think it is an interesting q. when does that requirement emerge? There is size of document. But there would also be field types. I think I have this right. This is really a classification system, so more than bare regex. There must be other criteria that apply to this use case, too? Adam p.s. we (in my work project) are just beginning to use Lucene for geometry objects and I am looking forward to understanding its use better, including, possibly, expanding it to other use cases apart from geo objects. On 3 January 2011 15:31, Robert Muir wrote: > On Mon, Jan 3, 2011 at 10:16 AM, Grant Ingersoll > wrote: > > There is also the MemoryIndex, which is in contrib and is designed for > one document at a time. That being said, basic grep/regex is probably fast > enough. > > > > In cases where you are doing a 'find' in a document similar to what a > wordprocessor would do (especially if you want to iterate > forwards/backwards through matches etc), you might want to consider > something like > http://icu-project.org/apiref/icu4j/com/ibm/icu/text/StringSearch.html > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --20cf3054a56d530e7304990966fe--