cocoon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nicola Ken Barozzi" <nicola...@apache.org>
Subject Re: NekoHTML in Cocoon....
Date Tue, 14 May 2002 17:37:48 GMT
Yes, it should be faster.

Also, I had to "patch" JTidy because it wasn't cleaning HTML correctly with
duplicate attributes.
I submitted a bug report to them, and they told me to wait the next C
version to be ported to Java.

This says it all...

--
Nicola Ken Barozzi                   nicolaken@apache.org
            - verba volant, scripta manent -
   (discussions get forgotten, just code remains)
---------------------------------------------------------------------

----- Original Message -----
From: "Jörn Heid" <heid@fh-heilbronn.de>
To: <cocoon-dev@xml.apache.org>
Sent: Tuesday, May 14, 2002 2:16 PM
Subject: AW: NekoHTML in Cocoon....


I haven't tested Tidy yet. From Sourceforge I got
"JTidy is a Java port of HTML Tidy, a HTML syntax checker and pretty
printer. Like its non-Java cousin, JTidy can be used as a tool for
cleaning up malformed and faulty HTML. In addition, JTidy provides a DOM
parser for real-world HTML.".
So is Tidy just DOM based? If so, and Necko supports SAX, Necko should
be faster...

-----Ursprüngliche Nachricht-----
Von: Reinhard Pötz [mailto:reinhard_poetz@gmx.net]
Gesendet: Dienstag, 14. Mai 2002 14:07
An: cocoon-dev@xml.apache.org
Betreff: RE: NekoHTML in Cocoon....


What are there any differences to HTML Tidy (speed, functionality)?

Reinhard

 > -----Original Message-----
 > From: Jörn Heid [mailto:heid@fh-heilbronn.de]
 > Sent: Tuesday, May 14, 2002 1:27 PM
 > To: cocoon-dev@xml.apache.org
 > Subject: NekoHTML in Cocoon....
 >
 >
 >
 > Necko is an HTML parser based on Xerces who can parse 'normal' HTML
(not  > XHTML) and prodcues pure XML (SAX).  >
http://www.apache.org/~andyc/nekohtml/doc/index.html
 >
 > Just thinking if it could be usefull... (I haven't tried it yet ;).
>  > One possibible use case would be the ability for Cocoon developers
to  > use old html files and change them with XSLT. E.g. including news
pages  > not based on XML in Cocoon, filtering information from external
pages  > and so on.  >  > What do you think? I will ask Andy Clark for
permission if you think it  > would be usefull for Cocoon.  >  > JOERN
>  >  >
---------------------------------------------------------------------
 > To unsubscribe, e-mail: cocoon-dev-unsubscribe@xml.apache.org
 > For additional commands, email: cocoon-dev-help@xml.apache.org  >


---------------------------------------------------------------------
To unsubscribe, e-mail: cocoon-dev-unsubscribe@xml.apache.org
For additional commands, email: cocoon-dev-help@xml.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: cocoon-dev-unsubscribe@xml.apache.org
For additional commands, email: cocoon-dev-help@xml.apache.org




---------------------------------------------------------------------
To unsubscribe, e-mail: cocoon-dev-unsubscribe@xml.apache.org
For additional commands, email: cocoon-dev-help@xml.apache.org


Mime
View raw message