Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 35566 invoked from network); 5 Jul 2009 10:00:06 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 5 Jul 2009 10:00:06 -0000 Received: (qmail 2035 invoked by uid 500); 5 Jul 2009 10:00:14 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 1946 invoked by uid 500); 5 Jul 2009 10:00:14 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 1936 invoked by uid 99); 5 Jul 2009 10:00:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 05 Jul 2009 10:00:14 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [209.85.210.182] (HELO mail-yx0-f182.google.com) (209.85.210.182) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 05 Jul 2009 10:00:06 +0000 Received: by yxe12 with SMTP id 12so5436239yxe.29 for ; Sun, 05 Jul 2009 02:59:43 -0700 (PDT) MIME-Version: 1.0 Received: by 10.100.38.5 with SMTP id l5mr6040110anl.119.1246787983660; Sun, 05 Jul 2009 02:59:43 -0700 (PDT) In-Reply-To: References: Date: Sun, 5 Jul 2009 05:59:43 -0400 Message-ID: <9ac0c6aa0907050259k841ab3bj986121436678582@mail.gmail.com> Subject: Re: Boolean retrieval From: Michael McCandless To: java-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org As of 2.9 (not yet released) the new Collector API allows you to skip scoring entirely and just collect the doc IDs matching the query. Mike On Sat, Jul 4, 2009 at 12:37 PM, Lukas Michelbacher wrote: > > This is about an experiment comparing plain Boolean retrieval with > vector-space-based retrieval. > > I would like to disable all of Lucene's scoring mechanisms and just > run a true Boolean query that returns exactly the documents that match a > query specified in Boolean syntax (OR, AND, NOT). No scoring or sorting > required. > > As far as I can see, this is not supported out of the box. =A0Which class= es > would I have to modify? > > Would it be enough to create a subclass of Similarity and to ignore all > terms but one (coord, say) and make this term return 1 if the query match= es > the document and 0 otherwise? > > Lukas > > -- > Lukas Michelbacher > Institute for Natural Language Processing > Universit=E4t Stuttgart > email: michells@ims.uni-stuttgart.de > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org