forrest-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Fabrice Bacchella <bacch...@club-internet.fr>
Subject Encoding of HTML files
Date Sun, 02 May 2004 21:47:04 GMT
By default, HTML files are encoded in ISO-8859-1 because of the way the 
Cocoon's HTML serialiser is declared in sitemap.xmap :

<map:serializer name="html" mime-type="text/html" 
src="org.apache.cocoon.serialization.HTMLSerializer">
	<doctype-public>-//W3C//DTD HTML 4.01 Transitional//EN</doctype-public>
      
<doctype-system>http://www.w3.org/TR/html4/loose.dtd</doctype-system>
</map:serializer>

The default one is such a case is ISO-8859-1. And it even add a META in 
the HTML result. But here in French, ISO-8859-15 is better because of 
the  « e dans l'o » character : œ. In Russian or Asia, the situation is 
even worse.

Is that possible to add a properties in forrest.properties saying for 
example :
project.encoding=UTF-8

and adding a <encoding>@project.encoding@</encoding> element in the 
HTML serializer ?

And in a more general way, where can I find a document about the way 
properties are used by forrest ? Which files use them ? How to add a 
new one ?

Mime
View raw message