lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doron Cohen <DOR...@il.ibm.com>
Subject Re: Where to place a filter...
Date Fri, 23 Nov 2007 15:12:54 GMT
Seems your ask if to remove accents before or after stemming.
Here is a discussion on similar question (for Spanish) -

http://www.nabble.com/Snowball-and-accents-filter...--tf3653720.html#a10207399

Hope this helps,
Doron

Christian Aschoff <christian.aschoff@uni-ulm.de> wrote on 22/11/2007
21:27:20:

> Hello,
>
> as a prolog, i have no problems and everything works the way i want :-)
>
> I am more interested in a tip if i am using the right way or pattern.
> I want to strip accents before data goes into my index, so i wrote
> the code following below. I did not find an example of where to place
> a filter (for indexing) with google, so this is my guess of how to do
> it.
>
> My question is: Is this the correct pattern for the usage of a filter
> or where should it be placed?
>
> Thank you in advantage for any comments,
> Christian
...
>      private static final SnowballAnalyzer analyzer = new
> SnowballAnalyzer("German", GermanAnalyzer.GERMAN_STOP_WORDS);
...
>      public TokenStream tokenStream(String fieldName, Reader reader) {
>          return new UTF8AccentFilter(analyzer.tokenStream(fieldName,


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message