lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ian Lea" <>
Subject Re: PhraseQuery and non-letter characters
Date Fri, 28 Nov 2008 18:09:09 GMT
I suggest you write your own analyzer that doesn't remove non-letter
characters at index time. There might be one out there already, but
not that I can think of off hand.

Instead of leaving the non-letters in place you might consider doing
something with position increments.  I think that would prevent phrase
queries from matching.


On Fri, Nov 28, 2008 at 5:05 PM, Ng Vinny <> wrote:
> Hi,
> I'm having an issue with PhraseQuery in which a query for the phrase
> "information technology" has among of its matches the strings "information,
> technology" and "information. Technology",  which should not be considered
> as matches.
> Both StopAnalyzer  StandardAnalyzer removes non-letter character at index
> time.
> Any suggestions?
> Thanks.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message