harmony-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <j...@apache.org>
Subject [jira] Created: (HARMONY-6640) UTF8 decoder doesn't properly decode supplementary characters
Date Thu, 02 Sep 2010 13:27:53 GMT
UTF8 decoder doesn't properly decode supplementary characters
-------------------------------------------------------------

                 Key: HARMONY-6640
                 URL: https://issues.apache.org/jira/browse/HARMONY-6640
             Project: Harmony
          Issue Type: Bug
          Components: Classlib
    Affects Versions: 5.0M14
         Environment: Windows Vista
            Reporter: Robert Muir


When attempting to build Lucene, I discovered a problem with UTF8 decoding.
(this actually prevents our tests from even compiling without a workaround)

For any codepoint > 0xffff (4-byte utf-8 sequence), the decoder doesn't properly
split the decoded codepoint into surrogate pairs.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message