lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erick Erickson" <erickerick...@gmail.com>
Subject Re: Query in lucene
Date Wed, 18 Jul 2007 13:16:14 GMT
When in doubt, WhitespaceAnalyzer is the most predictable. Note that
it doesn't lower-case the tokens though. Depending upon your
requirements, you can always pre-process your query and indexing
streams and do your own lowercasing and/or character stripping.

You can always create your own analyzer with the building blocks
provided via Filters and Tokenizers....

Erick.

On 7/18/07, WATHELET Thomas <thomas.wathelet@europarl.europa.eu> wrote:
>
> Witch analyser I have to use to find text like this '<commision>'?
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message