lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Trejkaz <trej...@trypticon.org>
Subject Re: Difference in behaviour between LowerCaseFilter and String.toLowerCase()
Date Sat, 01 Dec 2012 05:32:27 GMT
On Fri, Nov 30, 2012 at 8:22 PM, Ian Lea <ian.lea@gmail.com> wrote:
> Sounds like a side effect of possibly different, locale-dependent,
> results of using String.toLowerCase() and/or Character.toLowerCase().
>
> http://docs.oracle.com/javase/6/docs/api/java/lang/String.html#toLowerCase()
> specifically mentions Turkish.
>
> A Google search for "Character.toLowerCase() turkish" gets hits which
> sound relevant.

Certainly Turkish has special rules because of that uppercase I with
dot. I was more wondering whether LowerCaseFilter was intentionally
doing it differently to String.toLowerCase() or whether it was some
kind of unintentional side-effect of using Character.toLowerCase()
iteratively.

TX

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message