cocoon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ben Fortuna (JIRA)" <j...@apache.org>
Subject [jira] [Created] (COCOON-2352) XMLEncoder doesn't support Unicode surrogate pairs
Date Fri, 19 Aug 2016 05:07:20 GMT
Ben Fortuna created COCOON-2352:
-----------------------------------

             Summary: XMLEncoder doesn't support Unicode surrogate pairs
                 Key: COCOON-2352
                 URL: https://issues.apache.org/jira/browse/COCOON-2352
             Project: Cocoon
          Issue Type: Bug
          Components: * Cocoon Core
            Reporter: Ben Fortuna


Whilst investigating an issue with the Sling project and support for emoji characters, I've
come to notice that the XMLEncoder used by HTMLSerializer doesn't support Unicode surrogate
pairs to represent higher order unicode characters.

A simple unit test that demonstrates this issue is here:

https://github.com/micronode/whistlepost/blob/master/whistlepost-rewrite-lib/src/test/groovy/org/apache/cocoon/components/serializers/encoding/XMLEncoderTest.groovy

More background info here also: SLING-5973

This seems to have been identified/addressed in other Apache projects also:

https://issues.apache.org/jira/browse/THRIFT-3403?jql=text%20~%20%22surrogate%20pairs%22





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message