xerces-j-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Richard Kelly (JIRA)" <xerces-j-...@xml.apache.org>
Subject [jira] Updated: (XERCESJ-1383) Adding Unicode Normalization support to Xerces2-J
Date Thu, 13 Aug 2009 17:09:14 GMT

     [ https://issues.apache.org/jira/browse/XERCESJ-1383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Richard Kelly updated XERCESJ-1383:

    Attachment: test.xml

This archive contains the Character Normalization XNI component, the ICU4J normalizer component
and patches needed to use the component.

DOM Level 3 Core: "normalize-characters" [1] 
DOM Level 3 Core: "check-character-normalization" [2] 
SAX2: "unicode-normalization-checking" [3] 

Once compiled, the following commands can be used demonstrate this component with the supplied
"test.xml" file:
java -classpath xercesSamples.jar;xercesImpl.jar;icu4j.jar sax.Counter -u test.xml
java -classpath xercesSamples.jar;xercesImpl.jar;icu4j.jar;xml-apis.jar dom.DOM3 test.xml

[1] http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-normalize-characters 
[2] http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-check-character-normalization

[3] http://www.saxproject.org/apidoc/org/xml/sax/package-summary.html 

> Adding Unicode Normalization support to Xerces2-J 
> --------------------------------------------------
>                 Key: XERCESJ-1383
>                 URL: https://issues.apache.org/jira/browse/XERCESJ-1383
>             Project: Xerces2-J
>          Issue Type: New Feature
>          Components: DOM (Level 3 Core), SAX
>    Affects Versions: 2.9.1
>         Environment: All
>            Reporter: Richard Kelly
>            Assignee: Michael Glavassevich
>         Attachments: CharacterNormalizer.java, CharacterNormalizer.patch, CharacterNormalizer.zip,
> This feature will add support for Unicode character normalization and normalization checking
to Xerces.  Applications that use Xerces will be able to produce fully normalized XML documents
and verify that any XML documents they process are fully normalised. 
> Adding this functionality will allow Xerces to meet the XML 1.1 W3C Recommendation regarding
character normalization and allow it to implement the optional character normalization and
normalization checking features specified in the DOM Level 3 Core and SAX2.
> More specifically, the features to be implemented are:
> DOM Level 3 Core: "normalize-characters" [1]
> DOM Level 3 Core: "check-character-normalization" [2]
> SAX2: "unicode-normalization-checking" [3]
> [1] http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-normalize-characters
> [2] http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-check-character-normalization
> [3] http://www.saxproject.org/apidoc/org/xml/sax/package-summary.html

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail: j-dev-unsubscribe@xerces.apache.org
For additional commands, e-mail: j-dev-help@xerces.apache.org

View raw message