Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 61668 invoked from network); 2 Aug 2006 20:14:17 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 2 Aug 2006 20:14:17 -0000 Received: (qmail 26616 invoked by uid 500); 2 Aug 2006 20:14:10 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 26570 invoked by uid 500); 2 Aug 2006 20:14:09 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 26530 invoked by uid 99); 2 Aug 2006 20:14:09 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Aug 2006 13:14:09 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (asf.osuosl.org: local policy) Received: from [169.229.70.167] (HELO rescomp.berkeley.edu) (169.229.70.167) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Aug 2006 13:14:09 -0700 Received: by rescomp.berkeley.edu (Postfix, from userid 1007) id C968C5B783; Wed, 2 Aug 2006 13:13:48 -0700 (PDT) Received: from localhost (localhost [127.0.0.1]) by rescomp.berkeley.edu (Postfix) with ESMTP id C37937F403 for ; Wed, 2 Aug 2006 13:13:48 -0700 (PDT) Date: Wed, 2 Aug 2006 13:13:48 -0700 (PDT) From: Chris Hostetter To: java-user@lucene.apache.org Subject: Re: Filters or BooleanQuery In-Reply-To: <44CE064C.5090107@mac.com> Message-ID: References: <44CE064C.5090107@mac.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N : I could easily go through and add query's but I have a feeling I should : use a filter somewhere. What are the best practices when deciding what : to do? the list archives will have a lot of discussion on this (mainly along the lines of "should i use a Filter?" in general the deciding factors should be... 1) does the critera in question need to affect scoring -- ie: if you are searching for new articles with the word "software" and you only want articles maped to the stock tickers AAPL or IBM, do you care what the term frequency of the "stockTicker" field is for each document, or what hte document frequency for those particular tickers are? ... if so a Filter won't help you. 2) is the criteria somehting that is reused a lot, and would therfor make sense to cache independend of the users input -- ie: if a user is searching for "flash ram" products and they and lots of other users want to restict to "instock:true" products then a cached Filter could be really handy. -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org