lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dawid Weiss <>
Subject Re: [jira] Updated: (LUCENE-1029) Illegal character replacements in ISOLatin1AccentFilter
Date Wed, 17 Oct 2007 06:35:00 GMT

This gets even more complicated when you throw Polish in. We do have diacritics 
(such as ó, ż, ź or ą)

but we _also_ have things like "ł" (l with a stroke):

I don't think the stroke in "ł" would qualify as a diacritic mark... to me it's 
more like a different letter.

Anyway, most Poles are _very_ comfortable with writing e-mails and querying 
search engines with stripped diacritics (and the letter ł replaced by l) even if 
this often leads to change of meaning of the original word. I guess it is so 
because typing diacritics slows you down a bit. Pragmatism.


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message