xml-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Thomas B. Passin" <tpas...@mitretek.org>
Subject Re: Html To Xml
Date Mon, 29 Jan 2001 15:49:45 GMT
You don't even need the <?xml ... declaration.  On the other hand,
depending on you processor and the encoding that was used for the
document, you may have problems with the encoding (oe lack of an
encoding declaration) causing the processor to complain and quit.

You need the dtd for xhtml rather than for "html4".  Be aware that the
html dtd is very big and it may take along time for the processor to
find, load, and process all the parts.

Cheers,

Tom Passin

Christian Laflamme wrote -

> you can just add the 2 following lines at begining of the HTML file
> <?xml version='1.0'?>
> <!DOCTYPE XMI SYSTEM 'HTML4.dtd'>
>
> and change the extension to .xml
> but you also the need the DTD file defining all the tags, otherwise,
you
> won't be able to transform it with XSL files
>
> the HTML file must also be valid, which means, for example, that <P>
must
> closed with </P>, attributes must be inside quotes (<IMG
SRC="image.gif">),
> etc
>
>



Mime
View raw message