[ https://issues.apache.org/jira/browse/COCOON-2063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ellis Pritchard updated COCOON-2063: ------------------------------------ Attachment: NekoHTMLGenerator_BRANCH2_1_X.patch This has bitten us too. Here's a patch for Cocoon 2.1.X, rev 597695 > NekoHTMLTransformer needs to set the default-encoding of the current system to work properly with UTF-8 > ------------------------------------------------------------------------------------------------------- > > Key: COCOON-2063 > URL: https://issues.apache.org/jira/browse/COCOON-2063 > Project: Cocoon > Issue Type: Bug > Components: Blocks: HTML > Affects Versions: 2.2-dev (Current SVN) > Reporter: Alexander Klimetschek > Attachments: NekoHTMLGenerator_BRANCH2_1_X.patch, nekohtmltransformer-encoding.patch > > > The NekoHTMLTransformer uses the cyberneko HTMLConfiguration for tidying html. Unfortunately it does not use the system's current encoding as default, instead you have to set a property to set your encoding. But this varies from one OS to another, so the best solution is to set this property automatically in the NekoHTMLTransformer depending on what Java uses as defaultCharset: > config.setProperty("http://cyberneko.org/html/properties/default-encoding", Charset.defaultCharset().name()); -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.