cocoon-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Joerg Heinicke <>
Subject Re: Encoding and thorn
Date Thu, 27 Mar 2003 08:06:56 GMT
Hello Tim,

Tim Cavanagh wrote:
> Hi,
> I am putting in some text from MS word through our publishing system that
> includes MS curly quotes (8-bit) when it gets to cocoon to be parsed via
> XSLT I get an error. The error is caused by the the thorn element being
> placed in the text (unclosed). Our publishing system is on a Mac and using
> Mac roman encoding if this is an issue?

no this should be no issue.

> The input is:
> Tim¹s ³good² day
> The output is:
> Tim”¥ús ”¥<thorn>good”¥ÿ day

The only problem in the above is the <, so you must preprocess the 
document and replace < with &lt;. Same for & => &amp;

In which way is your publishing system converting the Word file into XML?

> Any ideas?

I don't think, that you have another chance.



> Cheers
> Tim Cavanagh
> South Australia

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message