cocoon-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gernot Koller <griz...@gmx.at>
Subject HTML Generator, JTidy, charencoding
Date Fri, 20 Jun 2003 15:58:28 GMT

Hi!

I'm reading a HTML page from various URLs and convert it to XHTML using 
JTidy (as in HTML Generator). I know that I can configure JTidy to use a 
certain encoding by calling setCharEncoding(Configuration.UTF8); for 
example.
My problem is, that the character encoding is very often specifyed only 
within the HTML document using tags like <meta http-equiv="content-type" 
content="text/html; charset=ISO-8859-1">.

Any tricks how to solve this problem ?

thx,

Gernot


-- 
DI Gernot Koller

---------------------------------------------------------------------
To unsubscribe, e-mail: cocoon-users-unsubscribe@xml.apache.org
For additional commands, e-mail: cocoon-users-help@xml.apache.org


Mime
View raw message