lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doron Cohen <>
Subject Re: Where to place a filter...
Date Fri, 23 Nov 2007 15:12:54 GMT
Seems your ask if to remove accents before or after stemming.
Here is a discussion on similar question (for Spanish) -

Hope this helps,

Christian Aschoff <> wrote on 22/11/2007

> Hello,
> as a prolog, i have no problems and everything works the way i want :-)
> I am more interested in a tip if i am using the right way or pattern.
> I want to strip accents before data goes into my index, so i wrote
> the code following below. I did not find an example of where to place
> a filter (for indexing) with google, so this is my guess of how to do
> it.
> My question is: Is this the correct pattern for the usage of a filter
> or where should it be placed?
> Thank you in advantage for any comments,
> Christian
>      private static final SnowballAnalyzer analyzer = new
> SnowballAnalyzer("German", GermanAnalyzer.GERMAN_STOP_WORDS);
>      public TokenStream tokenStream(String fieldName, Reader reader) {
>          return new UTF8AccentFilter(analyzer.tokenStream(fieldName,

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message