lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject LowerCaseFilter fails one letter (I) of Turkish alphabet
Date Mon, 30 Nov 2009 19:00:12 GMT
In Turkish alphabet lowercase of I is not i. It is LATIN SMALL LETTER DOTLESS I. LowerCaseFilter
which uses Character.toLowerCase() makes mistake just for that character.

I am not sure if it is worth to add a new TokenFilter for Turkish language. I see there exist
GreekLowerCaseFilter and RussianLowerCaseFilter. It would be nice to see TurkishLowerCaseFilter
in Lucene.

Wiki recommends to ask permission from lucene committers before opening an issue. I can provide
a patch (although it is just a one line change in original LowercaseFilter) for that if you

Thank you for your consideration.



To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message