commons-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dennis Lundberg <denn...@apache.org>
Subject Re: Bug with StringEscapeUtilities' escapeHTML/unescapeHTML for certain characters
Date Sat, 18 Dec 2010 21:52:48 GMT
On 2010-12-18 09:07, Burt Leung wrote:
> Hello,
> 
> I recently used the StringEscapeUtilities to encode/decode a character
> into its equivalent HTML entity. While I haven't used it much I do
> notice that a couple cases in particular seem "wrong".
> 
> case1: StringEscapeUtils.escapeHtml4("ä")
> This appears to give "&atilde;". This should actually be "&auml;".
> 
> case2: StringEscapeUtils.escapeHtml4("å");
> This gives "&aring" but should actually be "&atilde;".

This one is correct "å" should give "&aring;"

> Using the unescape functional also gives the (incorrect) reverse results.
> 
> Is this an actual bug or am I missing something?
> 
> Thanks,
> Burt
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@commons.apache.org
> For additional commands, e-mail: user-help@commons.apache.org
> 
> 


-- 
Dennis Lundberg

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@commons.apache.org
For additional commands, e-mail: user-help@commons.apache.org


Mime
View raw message