lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Volker Luedeling" <v.luedel...@brox.de>
Subject Problem with tokenizing/stemming in GermanAnalyzer
Date Mon, 17 Feb 2003 12:00:48 GMT
Hi,

my application uses a GermanAnalyzer for tokenizing a search string and
constructing Query classes:

        Analyzer an = new
org.apache.lucene.analysis.de.GermanAnalyzer();
        TokenStream ts = an.tokenStream(fieldName, new
StringReader(fieldText));

I have noticed a strange problem with capitalization. Search for
"computer" results in the token "compu". Search for "Computer", however,
results in "comput". The search is supposed to be case-insensitive, so
this must be a bug, right?

Volker


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message