Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 29314 invoked from network); 17 May 2006 23:42:56 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 17 May 2006 23:42:56 -0000 Received: (qmail 95617 invoked by uid 500); 17 May 2006 23:42:53 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 95577 invoked by uid 500); 17 May 2006 23:42:53 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 95566 invoked by uid 99); 17 May 2006 23:42:53 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 May 2006 16:42:53 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (asf.osuosl.org: local policy) Received: from [209.86.89.68] (HELO smtpauth08.mail.atl.earthlink.net) (209.86.89.68) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 May 2006 16:42:52 -0700 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dk20050327; d=ix.netcom.com; b=tIaWdX1BqnKgt4iqjvsVTCa21jB6sQKpIzdM4ivoX4CDqi8SEquDLw9S9VnKR/ef; h=Received:Reply-To:From:To:Subject:Date:Message-ID:MIME-Version:Content-Type:Content-Transfer-Encoding:X-Priority:X-MSMail-Priority:X-Mailer:In-Reply-To:X-MimeOLE:Importance:X-ELNK-Trace:X-Originating-IP; Received: from [66.245.135.50] (helo=ENGELSSERVER) by smtpauth08.mail.atl.earthlink.net with asmtp (Exim 4.34) id 1FgVew-0007Z2-Nl for java-dev@lucene.apache.org; Wed, 17 May 2006 19:42:30 -0400 Reply-To: From: "Robert Engels" To: Subject: RE: non indexed field searching? Date: Wed, 17 May 2006 18:42:33 -0500 Message-ID: MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook IMO, Build 9.0.6604 (9.0.2911.0) In-Reply-To: X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.2869 Importance: Normal X-ELNK-Trace: 33cbdd8ed9881ca8776432462e451d7bd15d05d9470ff7107f6512ed01eacb2a7dc220879d05eaee350badd9bab72f9c350badd9bab72f9c350badd9bab72f9c X-Originating-IP: 66.245.135.50 X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Having an indexed-field seems to occur significant overhead when merging, and if the index is highly interactive, the merging process occurs quite often. Maybe I am incorrect regarding the overhead of indexed fields? I have attempted to keep the number of indexed fields to a minimum. I think it boils down to whether a being able to do a range query (for date filtering as an example) is worth the cost of maintaining that index. If the other terms are mildly rare, then inspecting the documents to match against the needed range seems more efficient (thus the need to turn Filter into an interface). But if the term they are looking for is common, then the date range would be needed (to avoid a scan of all documents matching the term). It may just be that all fields need to be indexed in order to cover all cases (and that the cost of doing a range filter on a indexed field is far less in ALL cases than inspecting any documents). -----Original Message----- From: Erik Hatcher [mailto:erik@ehatchersolutions.com] Sent: Wednesday, May 17, 2006 6:19 PM To: java-dev@lucene.apache.org Subject: Re: non indexed field searching? On May 17, 2006, at 11:20 AM, Robert Engels wrote: > I reviewed the solr source (at LOT of the code is amazingly similar > to our > own search server). > > I don't see anything related to searching using non-indexed fields. > Could > you maybe point me at the class(es) that implement this functionality? Sorry, I missed the "non" part of "non-indexed fields". I don't quite understand why you wouldn't just index every field if you needed that capability though. Erik > > -----Original Message----- > From: Erik Hatcher [mailto:erik@ehatchersolutions.com] > Sent: Tuesday, May 16, 2006 6:35 PM > To: java-dev@lucene.apache.org > Subject: Re: non indexed field searching? > > > > On May 16, 2006, at 3:37 PM, Robert Engels wrote: >> It seems that maybe a query could be separated into Filter and >> Query clauses >> (similar to how the query optimizer works in Nutch). Clauses that >> were based >> on non-indexed fields would be converted to a Filter. >> >> The problem is if you have some thing like >> >> (indexed:somevalue OR nonindexed:somevalue) >> >> would require a complete visit to every document. > > Not necessarily. A query optimizer could could extract these term > query clauses, look up cached doc sets (bit sets) and union them. > Scoring is the trickier part - I'm now curious to dig into Solr and > see how it handles this. > >> I understand that this is moving Lucene closer to a database, but >> it is just >> very difficult to perform some complex queries efficiently without >> it. > > Check out Solr - I think you'll find it fits this niche nicely. > >> *** As an aside, I still don't understand why Filter is not an >> interface > > I saw that Paul Elschot has just done some refactoring work attached > to a JIRA issue on this very topic. > > Erik > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-dev-help@lucene.apache.org > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-dev-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org