cocoon-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gernot Koller <>
Subject HTML Generator, JTidy, charencoding
Date Fri, 20 Jun 2003 15:58:28 GMT


I'm reading a HTML page from various URLs and convert it to XHTML using 
JTidy (as in HTML Generator). I know that I can configure JTidy to use a 
certain encoding by calling setCharEncoding(Configuration.UTF8); for 
My problem is, that the character encoding is very often specifyed only 
within the HTML document using tags like <meta http-equiv="content-type" 
content="text/html; charset=ISO-8859-1">.

Any tricks how to solve this problem ?



DI Gernot Koller

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message