cocoon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ben Fortuna (JIRA)" <>
Subject [jira] [Issue Comment Deleted] (COCOON-2352) XMLEncoder doesn't support Unicode surrogate pairs
Date Fri, 16 Sep 2016 07:23:20 GMT


Ben Fortuna updated COCOON-2352:
    Comment: was deleted

(was: Ok, I'll first create a unit test to demonstrate the issue. I'd prefer not to change
the Encoder interface so I'll see if it's possible to just update XMLEncoder.

I have looked at the EncodingSerializer, however I think a surrogate pair needs to be encoded
"together", so the logic really needs to be in the delegate encoder (i.e. XMLEncoder).

> XMLEncoder doesn't support Unicode surrogate pairs
> --------------------------------------------------
>                 Key: COCOON-2352
>                 URL:
>             Project: Cocoon
>          Issue Type: Bug
>          Components: * Cocoon Core, Blocks: Serializers
>            Reporter: Ben Fortuna
> Whilst investigating an issue with the Sling project and support for emoji characters,
I've come to notice that the XMLEncoder used by HTMLSerializer doesn't support Unicode surrogate
pairs to represent higher order unicode characters.
> A simple unit test that demonstrates this issue is here:
> More background info here also: SLING-5973
> This seems to have been identified/addressed in other Apache projects also:

This message was sent by Atlassian JIRA

View raw message