Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 50296 invoked from network); 17 Jul 2003 13:21:36 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 17 Jul 2003 13:21:36 -0000 Received: (qmail 16475 invoked by uid 97); 17 Jul 2003 13:24:07 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@nagoya.betaversion.org Received: (qmail 16468 invoked from network); 17 Jul 2003 13:24:06 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 17 Jul 2003 13:24:06 -0000 Received: (qmail 47401 invoked by uid 500); 17 Jul 2003 13:20:42 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 47378 invoked from network); 17 Jul 2003 13:20:42 -0000 Received: from qmail.webpipe.net (63.172.126.3) by daedalus.apache.org with SMTP; 17 Jul 2003 13:20:42 -0000 Received: (qmail 22224 invoked by uid 89); 17 Jul 2003 13:20:52 -0000 Message-ID: <20030717132052.22222.qmail@qmail.webpipe.net> From: "greg" To: lucene-user@jakarta.apache.org Subject: interesting phrase query issue Date: Thu, 17 Jul 2003 13:20:52 GMT Mime-Version: 1.0 Content-Type: text/plain; format=flowed; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N I have several document sections that are being indexed via the StandardAnalyzer. One of these documents has the line "access, the manager". When searching for the phrase "access manager", this document is being returned. I understand why (at least i think i do), because a stop word is "the" and the "," is being removed by the tokenizer, my question is is there any way I can avoid having this returned in the results? My thoughts were to create a new analyzer that indexes the word "the" (blick to many of those), or index the "," in some way (also not good). Any suggestions? Thanks, Greg T Robertson --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org