Return-Path: Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 36547 invoked from network); 11 Sep 2003 19:48:50 -0000 Received: from unknown (HELO smtp-out3.iol.cz) (194.228.2.91) by daedalus.apache.org with SMTP; 11 Sep 2003 19:48:50 -0000 Received: from fw.shark (gprs193-11.eurotel.cz [160.218.193.11]) by smtp-out3.iol.cz (Internet on Line ESMP server) with ESMTP id 2ED9B33FA9 for ; Thu, 11 Sep 2003 21:48:52 +0200 (CEST) Received: from seznam.cz (0-3.shark [192.168.0.3]) by fw.shark (8.12.8/8.12.5) with ESMTP id h8BJlXHc006512 for ; Thu, 11 Sep 2003 21:47:37 +0200 Message-ID: <3F60D040.7040401@seznam.cz> Date: Thu, 11 Sep 2003 21:42:56 +0200 From: Leo Galambos User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.4) Gecko/20030624 X-Accept-Language: en-us, en MIME-Version: 1.0 To: Lucene Users List Subject: Re: Lucene features References: <3DC24860-E0DD-11D7-ADF3-000393A564E6@ehatchersolutions.com> <3F60B817.6040008@lucene.com> In-Reply-To: <3F60B817.6040008@lucene.com> X-Enigmail-Version: 0.76.4.0 X-Enigmail-Supports: pgp-inline, pgp-mime Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Doug Cutting wrote: > Erik Hatcher wrote: > >> Yes, you're right. Getting the scores of a second query based on the >> scores of the first query is probably not trivial, but probably >> possible with Lucene. And that combined with a QueryFilter would do >> the trick I suspect. Somehow the scores of the first query could be >> remembered and used as a boost (or other type of factor) the scores >> of the second query. > > > Why not just AND together the first and second query? That way > they're both incorporated in the ranking. Filters are good when you > don't want it to affect the ranking, and also when the first query is > a criterion that you'll reuse for many queries (e.g., > language=french), since the bit vectors can be cached (as by > QueryFilter). You probably missed the start of our discussion - we are talking about this: "q1 -> q2" which means "NOT q1 OR q2", versus "q2 -> q1" which means "q1 OR NOT q2". It causes the issue, and it also shows why you cannot use the simple "AND", because "q1 AND q2" != "NOT q1 OR q2" != "q1 OR NOT q2". Leo BTW: I didn't see the logic formulas for many years, so it is without any guarantee ;-)