Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 24394 invoked from network); 10 Jun 2004 02:09:06 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 10 Jun 2004 02:09:06 -0000 Received: (qmail 4300 invoked by uid 500); 10 Jun 2004 02:09:21 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 4251 invoked by uid 500); 10 Jun 2004 02:09:20 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 4232 invoked by uid 99); 10 Jun 2004 02:09:20 -0000 Received: from [62.112.223.117] (HELO incze.adsl.enternet.hu) (62.112.223.117) by apache.org (qpsmtpd/0.27.1) with ESMTP; Wed, 09 Jun 2004 19:09:20 -0700 Received: from g40.incze.hu (g40.incze.hu [192.168.10.18]) by incze.adsl.enternet.hu (8.11.6/8.11.6) with ESMTP id i5A28kh04239 for ; Thu, 10 Jun 2004 04:08:46 +0200 Received: from g40.incze.hu (localhost [127.0.0.1]) by g40.incze.hu (8.12.10/8.12.10) with ESMTP id i5A28iPU022336 for ; Thu, 10 Jun 2004 04:08:44 +0200 Received: (from incze@localhost) by g40.incze.hu (8.12.10/8.12.8/Submit) id i5A28iQA022334 for lucene-user@jakarta.apache.org; Thu, 10 Jun 2004 04:08:44 +0200 X-Authentication-Warning: g40.incze.hu: incze set sender to incze@mail.matav.hu using -f Date: Thu, 10 Jun 2004 04:08:44 +0200 From: Incze Lajos To: Lucene Users List Subject: Re: phrase query not working in boolean clause Message-ID: <20040610020843.GA16685@incze.adsl.enternet.hu> References: <40C739F9.2010806@aps.org> <4CB46AAD-BA3C-11D8-A592-000393A564E6@ehatchersolutions.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4CB46AAD-BA3C-11D8-A592-000393A564E6@ehatchersolutions.com> User-Agent: Mutt/1.4.1i X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N On Wed, Jun 09, 2004 at 01:41:55PM -0400, Erik Hatcher wrote: > On Jun 9, 2004, at 12:25 PM, Michael Duval wrote: > >When doing an exact phrase query on the title the expected results are > >returned: > > > > +(title:"Mass Asymmetry") > > after tokenizing/filtering: +title:"mass asymmetri" > > returns 20 Hits > > example hit: "Mass asymmetry, equation of state, and nuclear > >multifragmentation" > > > >When attempting to confine the results to a particular journal the > >query used is: > > +(journal:L) +(title:"Mass Asymmetry") > > after t/f : +journal:L +title:"mass asymmetri" > > > > returns 315 Hits!! > > example hit 1: Towards dynamical mass calculations > > example hit 2: Up down-asymmetric gravitational fields of spinning > >masses > > > >It would seem that the search engine is treating > > +title:"mass asymmetri" as +title:mass asymmetri > > > >However, this behavior is only apparent on composite queries as shown > >previously. > > > >For a sanity check I built the query using both the standard query > >parser and > >the lucene search api (TermQuery, PhraseQuery, BooleanQuery). The > >results > >were the same both ways. > > > >Is this a well known limitation of the lucene search engine? Is there > >a different > >means of obtaining the desired results? > > Could you work up a JUnit test case example indexing a couple of > documents like this into a RAMDirectory and a testXXX method that shows > the failure? > > I cannot really make sense of what you have going on with the textual > queries and obviously some stemming going on to. Show us the code. :) > > Erik This was fixed cca a 2 months ago by Mr. Goller. You have to upgrade, if you can. incze --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org