Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 33481 invoked from network); 21 Jan 2004 14:31:58 -0000 Received: from daedalus.apache.org (HELO mail.apache.org) (208.185.179.12) by minotaur-2.apache.org with SMTP; 21 Jan 2004 14:31:58 -0000 Received: (qmail 67908 invoked by uid 500); 21 Jan 2004 14:31:49 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 67888 invoked by uid 500); 21 Jan 2004 14:31:49 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 67875 invoked from network); 21 Jan 2004 14:31:49 -0000 Received: from unknown (HELO c000.snv.cp.net) (209.228.32.64) by daedalus.apache.org with SMTP; 21 Jan 2004 14:31:49 -0000 Received: (cpmta 28614 invoked from network); 21 Jan 2004 06:31:50 -0800 Received: from 128.143.104.168 (HELO ?128.143.104.168?) by smtp.hatcher.net (209.228.32.64) with SMTP; 21 Jan 2004 06:31:50 -0800 X-Sent: 21 Jan 2004 14:31:50 GMT Mime-Version: 1.0 (Apple Message framework v609) In-Reply-To: <023701c3df69$44b99900$0301a8c0@POWERPACK> References: <023701c3df69$44b99900$0301a8c0@POWERPACK> Content-Type: text/plain; charset=US-ASCII; format=flowed Message-Id: <8AABEFC7-4C1E-11D8-8953-000393A564E6@ehatchersolutions.com> Content-Transfer-Encoding: 7bit From: Erik Hatcher Subject: Re: Query Term Questions Date: Wed, 21 Jan 2004 09:31:46 -0500 To: "Lucene Users List" X-Mailer: Apple Mail (2.609) X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N On Jan 20, 2004, at 10:22 AM, Terry Steichen wrote: > 1) Is there a way to set the query boost factor depending not on the > presence of a term, but on the presence of two specific terms? For > example, I may want to boost the relevance of a document that contains > both "iraq" and "clerics", but not boost the relevance of documents > that contain only one or the other terms. (The idea is better > discrimination than if I simply boosted both terms.) But doesn't the query itself take this into account? If there are multiple matching terms then the overlap (coord) factor kicks in. > 2) Is it possible to apply (or simulate) a negative query boost > factor? For example, I may have a complex query with lots of terms > but want to reduce the relevance of a matching document that also > included the term "iowa". ( The idea is for an easier and more > discriminating way than simply increasing the relevance of all other > terms besides "iowa"). Another reply mentioned negative boosting. Is that not working as you'd like? > 3) Is there a way to handle variants of a phrase without OR'ing > together the variants? For example, I may want to find documents > dealing with North Korea; the terms might be "north korea" or "north > korean" or "north koreans" - is there a way to handle this with a > single term using wildcards? Sounds like what you're really after is fancier analysis. This is one of the purposes of analysis, to do stemming. Erik --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org