lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Hostetter <hossman_luc...@fucit.org>
Subject Re: Mixing Case and Case-Insensitive Searching
Date Tue, 17 Apr 2007 17:48:56 GMT

: I've run into a case where we want to search for the acronym 'LET',
: however this three letter word occurs very frequently in quite a
: number of documents.
:
: What I'm looking to do is a query that's case insensitive _except_ for
: that specific term.

it sounds like you need to create a customized bastard stepshild of
StopFilter and LowercaseFilter ... take in a dictionary of known
capitalized acronimes in the constructor, then for each Token lowercase it
unless:
   - it's in all caps
   - it's in your acronym list.



-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message