commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Henri Yandell (JIRA)" <>
Subject [jira] Commented: (LANG-285) Wish : method unaccent
Date Tue, 24 Oct 2006 20:14:18 GMT
    [ ] 
Henri Yandell commented on LANG-285:

Aldrin's by-hand method does seem very painful in terms of maintenance. For example there
is nothing there for Hungarian accented characters yet or a German eszet. 

The JDK itself is only going to be of help when it can do this for us - so not much use. So
if we're going to have this method, it seems that we're pretty much going to have to build
it by hand and user submissions. The feature doesn't seem that valuable to suck up that much
time and effort - especially if JDK 6.0 will turn up with the solution.

I'm -0 on adding this to Lang. Too deep a feature to try and swallow.

As a sidenote - I wonder if we could ignore uppercase (use Character.isUpperCase, then lowercase
it, convert it and re-uppercase it on the other side if it was uppercase). 

> Wish : method unaccent
> ----------------------
>                 Key: LANG-285
>                 URL:
>             Project: Commons Lang
>          Issue Type: New Feature
>            Reporter: Guillaume Coté
>            Priority: Minor
> I would like to add a method that replace accented caracter by unaccented one.  For example,
with the input String "L'été où j'ai dû aller à l'île d'Anticosti commenca tôt", the
method would return "L'ete ou j'ai du aller à l'ile d'Anticosti commenca tot".
> I suggest to call that method unaccent and to add it in StringUtils.
> If we cannot covert all case, the first version could only covert iso-8859-1.
> If you are willing to go forward with that idea, I am willing to contribute a patch.

This message is automatically generated by JIRA.
If you think it was sent incorrectly contact one of the administrators:
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message